Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments

Jan 2, 2023·

Vincent Liu

Vincent Liu

,

Yash Chandak

,

Philip Thomas

,

Martha White

· 0 min read

Type

Conference paper

Publication

International Conference on Artificial Intelligence and Statistics

Last updated on Jan 2, 2023

Vincent Liu

Authors

Machine Learning Researcher

← Exploiting action impact regularity and exogenous state variables for offline reinforcement learning Jan 3, 2023

Measuring and Mitigating Interference in Reinforcement Learning Jan 1, 2023 →