About Me
I’m a first-year Ph.D. student in the Department of Computer Science at University College London (UCL), advised by Professor Jun Wang. After three years in industry, I have returned to academia to pursue my passion for research. I previously received my M.Sc. in Computational Statistics and Machine Learning (CSML) from UCL.
My research interests lie in Reinforcement Learning, Multi-Agent Systems, and Large Language Models. And yes, I proudly identify as a Bayesianist!
If you’d like to discuss potential collaborations or shared research interests, feel free to contact me at yan.song.24[at]ucl.ac.uk.
News
[2025.03] RL can now interactively train two LLM agents to reason ! Our paper ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning is available on Arxiv !
[2025.01] Our paper Efficient Reinforcement Learning with Large Language Model Priors got accepted by ICLR 2025 !
[2024.10] We release our LLM reasoning framework – OpenR !