Trust region policy optimization via entropy regularization for Kullback–Leibler divergence constraint
Article in Neurocomputing (July 2024)
The most recent citing publications are shown below. View all 117 publications that cite this research output on Dimensions.
Article in Neurocomputing (July 2024)
Conference proceeding (May 2024)
Article in Quantum Information Processing (May 2024)