清华大学计算机系 keg 实验室博一在读,研究方向为大模型强化学习(LLM RL)、深度研究智能体(Deep Search Agent),研究工作 DeepDive(第一作者,GitHub ~200 Stars 🌟,Huggingface ~2k Downloads),GLM-4.5(核心贡献者,GitHub ~3k Stars),TreeRL(共同一作,ACL 2025 Main),AgentTuning(共同一作,ACL 2024 Findings,GitHub ~1.5k Stars 🌟,Huggingface ~20k Downloads) ,欢迎添加微信(微信号 learning_rate)和我交流~😊
I’m a first-year PhD student at the KEG Lab, Department of Computer Science, Tsinghua University. My research focuses on reinforcement learning for large language models and deep agent research. My recent works include DeepDive (First Author, GitHub ~200 Stars🌟, Huggingface ~2k Downloads), GLM-4.5 (Core Contributor, GitHub ~3k Stars, ), TreeRL (Co-First Author, ACL 2025 Main) and AgentTuning (Co-First author, ACL 2024 Findings, GitHub ~1.5k Stars 🌟, Huggingface ~20k Downloads). Feel free to connect with me on WeChat (ID: learning_rate) — happy to chat! 😊