当前位置: X-MOL 学术J. Am. Stat. Assoc. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Off-policy Evaluation in Doubly Inhomogeneous Environments
Journal of the American Statistical Association ( IF 3.0 ) Pub Date : 2024-09-09 , DOI: 10.1080/01621459.2024.2395593
Zeyu Bian 1 , Chengchun Shi 2 , Zhengling Qi 3 , Lan Wang 1
Affiliation  

This work aims to study off-policy evaluation (OPE) under scenarios where two key reinforcement learning (RL) assumptions – temporal stationarity and individual homogeneity are both violated. To ha...

中文翻译:


双非均匀环境中的离策略评估



这项工作旨在研究在两个关键的强化学习(RL)假设(时间平稳性和个体同质性)都被违反的情况下的离策略评估(OPE)。至哈...
更新日期:2024-09-12
down
wechat
bug