Earticle

현재 위치 Home

Convergence of Internet, Broadcasting and Communication

Intelligent Warehousing: Comparing Cooperative MARL Strategies

첫 페이지 보기
  • 발행기관
    국제인공지능학회(구 한국인터넷방송통신학회) 바로가기
  • 간행물
    International Journal of Internet, Broadcasting and Communication 바로가기
  • 통권
    Vol.16 No.3 (2024.08)바로가기
  • 페이지
    pp.205-211
  • 저자
    Yosua Setyawan Soekamto, Dae-Ki Kang
  • 언어
    영어(ENG)
  • URL
    https://www.earticle.net/Article/A456107

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

원문정보

초록

영어
Effective warehouse management requires advanced resource planning to optimize profits and space. Robots offer a promising solution, but their effectiveness relies on embedded artificial intelligence. Multi-agent reinforcement learning (MARL) enhances robot intelligence in these environments. This study explores various MARL algorithms using the Multi-Robot Warehouse Environment (RWARE) to determine their suitability for warehouse resource planning. Our findings show that cooperative MARL is essential for effective warehouse management. IA2C outperforms MAA2C and VDA2C on smaller maps, while VDA2C excels on larger maps. IA2C’s decentralized approach, focusing on cooperation over collaboration, allows for higher reward collection in smaller environments. However, as map size increases, reward collection decreases due to the need for extensive exploration. This study highlights the importance of selecting the appropriate MARL algorithm based on the specific warehouse environment's requirements and scale.

목차

Abstract
1. Introduction
2. Literature Review
2.1 Multi-agent Reinforcement Learning (MARL)
2.2 Cooperative and Competitive Behavior in MARL
2.3 Distinguishing Collaboration and Cooperation Goals in Cooperative Behavior
3. Methodology
3.1 System and Environment
3.2 Independent Synchronous Advantage Actor-Critic
3.3 Multi-Agent Advantage Actor-Critic
3.4 Value Decomposition Advantage Actor-Critic
4. Result and Discussion
5. Conclusion
Acknowledgment
References

키워드

Multi-Agent Reinforcement Learning Multi-Robot Warehouse Environment Independent Advantage Actor-Critic Multi-Agent Advantage Actor-Critic Value Decomposition Advantage Actor-Critic.

저자

  • Yosua Setyawan Soekamto [ PhD Student, Department of Computer Engineering, Dongseo University, Busan, South Korea, Lecturer, Department of Information Systems, Universitas Ciputra Surabaya ]
  • Dae-Ki Kang [ Professor, Department of Computer Engineering, Dongseo University, Busan, South Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

  • 발행기관명
    국제인공지능학회(구 한국인터넷방송통신학회) [The International Association for Artificial Intelligence]
  • 설립연도
    2000
  • 분야
    공학>전자/정보통신공학
  • 소개
    인터넷방송, 인터넷 TV , 방송 통신 네트워크 및 관련 분야에 대한 국내는 물론 국제적인 학술, 기술의 진흥발전에 공헌하고 지식 정보화 사회에 기여하고자 한다.

간행물

  • 간행물명
    International Journal of Internet, Broadcasting and Communication
  • 간기
    계간
  • pISSN
    2288-4920
  • eISSN
    2288-4939
  • 수록기간
    2009~2025
  • 십진분류
    KDC 326 DDC 380

이 권호 내 다른 논문 / International Journal of Internet, Broadcasting and Communication Vol.16 No.3

    피인용수 : 0(자료제공 : 네이버학술정보)

    함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

      페이지 저장