Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

portfolio

publications

Natural Language Reinforcement Learning

Submitted on arXiv preprint, 2024

Xidong Feng, Bo Liu, Yan Song, Haotian Fu, Ziyu Wan, Girish A. Koushik, Zhiyuan Hu, Mengyue Yang, Ying Wen, Jun Wang

Recommended citation: Feng, X., Liu, B., Song, Y., Fu, H., Wan, Z., Koushik, G. A., Hu, Z., Yang, M., Wen, Y., & Wang, J. (2024). Natural Language Reinforcement Learning. arXiv preprint arXiv:2411.14251.
Download Paper

Efficient Reinforcement Learning with Large Language Model Priors

Published on ICLR 2025, 2025

Xue Yan, Yan Song, Xidong Feng, Mengyue Yang, Haifeng Zhang, Haitham Bou Ammar, Jun Wang

Recommended citation: Yan, X., Song, Y., Feng, X., Yang, M., Zhang, H., Bou Ammar, H., & Wang, J. (2025). Efficient Reinforcement Learning with Large Language Model Priors. International Conference on Learning Representations (ICLR).
Download Paper

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models

Published on ECML-PKDD 2025, 2025

Xue Yan, Yan Song, Xinyu Cui, Filippos Christianos, Haifeng Zhang, David Henry Mguni, Jun Wang

Recommended citation: Yan, X., Song, Y., Cui, X., Christianos, F., Zhang, H., Mguni, D. H., & Wang, J. (2025). Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD).
Download Paper

REMA: Learning to Meta-Think for LLMS with Multi-Agent Reinforcement Learning

Published on NeurIPS 2025, 2025

Ziyu Wan, Yunxiang LI, Xiaoyu Wen, Yan Song, Hanjing Wang, Linyi Yang, Mark Schmidt, Jun Wang, Weinan Zhang, Shuyue Hu, Ying Wen

Recommended citation: Wan, Z., LI, Y., Wen, X., Song, Y., Wang, H., Yang, L., Schmidt, M., Wang, J., Zhang, W., Hu, S., & Wen, Y. (2025). REMA: Learning to Meta-Think for LLMS with Multi-Agent Reinforcement Learning. Advances in Neural Information Processing Systems (NeurIPS).
Download Paper

talks

teaching