Combinatorial Game Theory Meets Deep Learning : Efficient Endgame Analysis in Go

Stanisław Frejlak

216.73.216.95

개인회원 가입

개인회원
기관회원

개인회원 로그인

개인회원 가입으로 더욱 편리하게 이용하세요. 개인회원 가입

아이디/비밀번호를 잊으셨나요? 아이디/비밀번호 찾기

기관회원 로그인

소속기관에서 검색되지 않는 기관은 무료원문다운이 불가능합니다. 개인회원 가입 후 유료구매를 하시거나 소속기관 도서관에 이용문의해 주세요.

Home

Combinatorial Game Theory Meets Deep Learning : Efficient Endgame Analysis in Go

발행기관

국제바둑학회(구 한국바둑학회) 바로가기
간행물

바둑학연구 바로가기
통권

제18권 제2호 통권32호 (2024.12)바로가기
페이지

pp.17-50
저자

Stanisław Frejlak
언어

영어(ENG)
URL

https://www.earticle.net/Article/A463234

원문정보

초록

영어: The endgame stage of Go presents a unique challenge for scientific research. Contrary to previous stages, in the endgame the key to a successful analysis is board decomposition into smaller, independent local positions. Go players typically analyze these positions separately and prioritize moves based on their value. In this paper, I introduce a novel program that automates this decomposition-based analysis for the endgame stage of Go. AlphaZero has revolutionized Computer Go, by applying a generic move-selection mechanism, based on neural network judgments and the MCTS search algorithm. However, it does not specifically address the complexity of endgame in the aforementioned manner. On the other hand, by leveraging the decomposition-based analysis, my program reaches decisions in the endgame with relatively little computation. Additionally, it offers insights for Go practitioners by providing accurate move value evaluations. Notable prior work on automated endgame analysis was done by Martin Müller (1995). His program Explorer checked all possible variations in every undecided position and aggregated the results based on an algorithm inspired by the Combinatorial Game Theory (CGT). However, due to the exponential growth of the number of variations, Explorer’s application was limited to small, tightly bounded local positions. In contrast, my program leverages a neural network to predict optimal local moves, dramatically reducing the number of variations that need to be explored. Provided that the neural network’s predictions are correct, the program can accurately evaluate move values by considering relatively few variations, just like human Go experts do. Thanks to this approach, it is the first program capable of analyzing large, unbounded local positions, which are commonly encountered in real games. The neural network was fine-tuned from a pre-trained AlphaZero reimplementation on the task of optimal local move prediction. Training data was gathered from KataGo self-play games, utilizing KataGo’s network to perform board decomposition.

Abstract
I. Introduction
1. Mathematical approach to endgame
2. AlphaZero mode of operation
3. AI estimating move values
II. Related work
III. Goal of this work
1. Canonical forms vs. temperature theory
2. Forcing moves in light of CGT
3. Chosen approach
IV. Methods
1. Information to be predicted by the network
2. Model architecture
3. Training data construction
4. Data augmentation and sampling
5. Training procedure
6. Calculating temperature
V. Results
1. Comparison with baseline
2. Qualitative analysis
VI. Future work
Conclusion
References

키워드

AlphaZero Fine-Tuning Combinatorial Game Theory Temperature Move Values

저자

Stanisław Frejlak [ University of Warsaw, Poland ]

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

발행기관명

국제바둑학회(구 한국바둑학회) [International Society of Go Studies]
설립연도
2003
분야
예술체육>기타예술체육

간행물

간행물명

바둑학연구 [Journal of Go Studies]
간기
반년간
pISSN
1738-3730
수록기간
2004~2025
십진분류
KDC 691 DDC 794

이 권호 내 다른 논문 / 바둑학연구 제18권 제2호 통권32호

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

출처 : 네이버학술정보

0개의 논문이 장바구니에 담겼습니다.

페이지 저장

소속기관 조회

이용자님의 소속기관(단체)이 서비스에 가입되어 있는지 확인해 보십시오.
기관회원에 소속되어 있는 이용자는 원문을 무료로 이용할 수 있습니다.

상호: 주식회사 학술교육원 I 대표: 노방용 I 사업자등록번호: 122-81-88227 I 통신판매업신고번호: 제2008-인천부평-00176호 I 정보보호책임자: 이두영
주소: (21319)인천광역시 부평구 영성중로 50 미래타워 701호 I 전화: 0505-555-0740 I 팩스: 0505-555-0741 I 이메일: earticle@earticle.net

음성지원 및 돋보기 서비스

Earticle

Combinatorial Game Theory Meets Deep Learning : Efficient Endgame Analysis in Go

원문정보

초록

목차

키워드

저자

참고문헌

간행물 정보

발행기관

간행물

이 권호 내 다른 논문 / 바둑학연구 제18권 제2호 통권32호

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.