arXiv Xidong Feng, Bo Liu, Yan Song, Haotian Fu, Ziyu Wan, Girish A. Koushik, Zhiyuan Hu, Mengyue Yang, Ying Wen, Jun Wang
arXiv preprint • 2024 • Preprint
Feng, X., Liu, B., Song, Y., Fu, H., Wan, Z., Koushik, G. A., Hu, Z., Yang, M., Wen, Y., & Wang, J. (2024). Natural Language Reinforcement Learning. arXiv preprint arXiv:2411.14251.