About Me

I’m a Ph.D. student in the Department of Computer Science at University College London (UCL), advised by Professor Jun Wang. After years in industry, I have returned to academia to pursue my passion for research. I previously received my M.Sc. in Computational Statistics and Machine Learning (CSML) from UCL.

My research interests lie in Reinforcement Learning, Multi-Agent Systems, and Large Language Models.

If you’d like to discuss potential collaborations or shared research interests, feel free to contact me at yan.song.24[at]ucl.ac.uk.

News

[2026.02] We have been closely collaborating with Li Auto on several research topics. Now we have released our first joint paper: Hardware Co-Design Scaling Laws via Roofline Modelling for On-Device LLMs. Well Done Guys! Stay tuned for more to come out!
[2025.05] Our paper Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models got accepted by ECML-PKDD 2025. A Testament to Persistence!
[2025.05] We have successfully held the AAMAS 2025 Online AI Competitions!
[2025.03] RL can now interactively train two LLM agents to reason ! Our paper ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning is available on Arxiv ! (Neurips 2025)
[2025.01] Our paper Efficient Reinforcement Learning with Large Language Model Priors got accepted by ICLR 2025 !
[2024.10] We release our LLM reasoning framework – OpenR !

Yan Song

News