Logarithmic Regret Bounds for Continuous-Time Average-Reward Markov Decision Processes
Article in SIAM Journal on Control and Optimization (September 2024)
The most recent citing publications are shown below. View all 259 publications that cite this research output on Dimensions.
Article in SIAM Journal on Control and Optimization (September 2024)
Preprint in arXiv (August 2024)
Article in Applied Mathematics & Optimization (June 2024)