Trust region policy optimization via entropy regularization for Kullback–Leibler divergence constraint
Article in Neurocomputing (July 2024)
The most recent citing publications are shown below. View all 113 publications that cite this research output on Dimensions.
Article in Neurocomputing (July 2024)
Article in Energy and AI (May 2024)
Article in Journal of Infrastructure Preservation and Resilience (April 2024)