Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments

Publication
International Conference on Artificial Intelligence and Statistics
Vincent Liu
Vincent Liu
PhD Candidate

I am a PhD candidate working on reinforcement learning.