A Copula-Based CNN-LSTM Deep Q-Network Framework for Multi-Stage Sequential Treatment Optimization in Pseudo-Temporal Environments

Jong-Min Kim; Jinhwa Kim

216.73.216.207

개인회원 가입

개인회원
기관회원

개인회원 로그인

개인회원 가입으로 더욱 편리하게 이용하세요. 개인회원 가입

아이디/비밀번호를 잊으셨나요? 아이디/비밀번호 찾기

기관회원 로그인

소속기관에서 검색되지 않는 기관은 무료원문다운이 불가능합니다. 개인회원 가입 후 유료구매를 하시거나 소속기관 도서관에 이용문의해 주세요.

Home

A Copula-Based CNN-LSTM Deep Q-Network Framework for Multi-Stage Sequential Treatment Optimization in Pseudo-Temporal Environments

발행기관

대한산업경영학회 바로가기
간행물

International Journal of Intelligent Technologies and Innovative Practices 바로가기
통권

Vol. 1 No. 2 (2026.04)바로가기
페이지

pp.1-20
저자

Jong-Min Kim, Jinhwa Kim
언어

영어(ENG)
URL

https://www.earticle.net/Article/A484934

※ 기관로그인 시 무료 이용이 가능합니다.

5,500원

원문정보

초록

영어: Sequential decision-making in dynamic, heterogeneous environments is often hindered by multivariate and correlated outcomes. This study introduces a unified Copula-CNN-LSTM Deep Q-Network (DQN) framework for multi-stage individualized policy learning in pseudo temporal settings. Motivated by the need for agents that account for inter-outcome dependencies, we extend static covariates from benchmark datasets (Boston Housing and Wine Quality Red) into pseudo-temporal sequences to emulate state transitions. Multivariate rewards with controlled correlations (ρ = 0.5 and ρ = −0.5) are standardized via an empirical copula transformation to assess policy robustness under varying dependency structures. The DQN agent optimizes policies using experience replay and temporal discounting of state-action reward trajectories. The framework demonstrates stable convergence in average rewards across both datasets under positive and negative correlation structures. Analysis of the resulting dynamic conditional average treatment effects (CATEs) across outcome dimensions highlights the model’s ability to discern heterogeneous treatment impacts. Furthermore, learned policy matrices and dynamic Directed Acyclic Graphs (DAGs) reveal interpretable temporal dependencies, with edge structures reflecting the complex multivariate nature of the optimal policy. Overall, the proposed framework effectively captures inter-temporal dependencies and adapts to correlated rewards, providing a scalable and interpretable solution for sequential decision making in complex environments.

Abstract
1. INTRODUCTION
2. METHODS
2.1. Sequential Decision-Making Framework
2.2. Treatment Assignment Model
2.3. Multistage Outcome Generation with Heterogeneous Treatment Effects
2.4. Empirical Copula Transformation for Normalization
2.5. Dynamic Conditional Average Treatment Effect (CATE)
2.6. Deep Q-Network (DQN) for Multistage Policy Learning
2.7. Theoretical Justification and Policy Learning
2.8. Learned Dynamic Directed Acyclic Graph (DAG)
2.9. Performance Metrics
3. DATA ANALYSIS
3.1. Data Sets
3.2. Descriptive Statistics
3.3. Experimental Setup
3.4. Outcome Simulation and Transformation
3.5. Policy Learning and Replay Memory
3.6. Dynamic DAG Estimation
3.7. Evaluation Metrics
3.8. Results Overview
3.9. Visualization
3.10. Analysis of Temporal Model Structure and Dynamics
4. CONCLUSION
DATA AVAILABILITY
REFERENCES

키워드

Deep Q-Network Sequential Decision-Making Multivariate Outcomes Copula CNNLSTM Conditional Average Treatment Effect

저자

Jong-Min Kim [ Statistics Discipline, Division of Science and Mathematics, University of Minnesota-Morris, Morris, MN 56267, USA / EGADE Business School, Tecnologico de Monterrey, Ave. Rufino Tamayo, Garza Garcia, NL,´ CP. 66269, Mexico ]
Jinhwa Kim [ School of Business, Sogang University, Seoul, South Korea ] Corresponding Author

참고문헌

자료제공 : 네이버학술정보

간행물 정보

발행기관

발행기관명

대한산업경영학회 [Dae Han Society of Industrial Management]
설립연도
2003
분야
복합학>과학기술학
소개
본 학회는 산업체·학계·연구소 등의 회원 상호간에 정보교환 및 지원을 통하여 산업경영에 관한 학문발전을 도모하고 산학에 관한 긴밀한 네트워크를 형성하여 기업의 경쟁력을 강화시키는데 그 설립 목적을 두고 있다.

간행물

간행물명

International Journal of Intelligent Technologies and Innovative Practices
간기
계간
eISSN
3092-412X
수록기간
2026~2026
십진분류
KDC 323 DDC 338

이 권호 내 다른 논문 / International Journal of Intelligent Technologies and Innovative Practices Vol. 1 No. 2

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.

출처 : 네이버학술정보

0개의 논문이 장바구니에 담겼습니다.

페이지 저장

소속기관 조회

이용자님의 소속기관(단체)이 서비스에 가입되어 있는지 확인해 보십시오.
기관회원에 소속되어 있는 이용자는 원문을 무료로 이용할 수 있습니다.

상호: 주식회사 학술교육원 I 대표: 노방용 I 사업자등록번호: 122-81-88227 I 통신판매업신고번호: 제2008-인천부평-00176호 I 정보보호책임자: 이두영
주소: (21319)인천광역시 부평구 영성중로 50 미래타워 701호 I 전화: 0505-555-0740 I 팩스: 0505-555-0741 I 이메일: earticle@earticle.net

음성지원 및 돋보기 서비스

Earticle

A Copula-Based CNN-LSTM Deep Q-Network Framework for Multi-Stage Sequential Treatment Optimization in Pseudo-Temporal Environments

원문정보

초록

목차

키워드

저자

참고문헌

간행물 정보

발행기관

간행물

이 권호 내 다른 논문 / International Journal of Intelligent Technologies and Innovative Practices Vol. 1 No. 2

피인용수 : 0건 (자료제공 : 네이버학술정보)

함께 이용한 논문 이 논문을 다운로드한 분들이 이용한 다른 논문입니다.