DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
Bio
I am currently a M.S. student at the University of Washington. At UW, I am fortunate to work with Ph.D. Student Rulin Shao, Prof. Pang Wei Koh, and Prof. Akari Asai. I used to intern at ByteDance Seed and Shanghai AI Laboratory. I obtained my B.S. in Computer Science from Jilin University.
Research Interests
- Deep Research Agents
- Test time scaling
- Human LLM collaboration
News
-
Feb. 2026
Our DR Tulu demo is released, please check it out!
-
Nov. 2025
Our DR Tulu paper is released on arXiv, this is the first fully open source deep research agent!
-
Sep. 2025
Started my M.S. journey at UW!
Publications
Projects
A reinforcement learning framework for deep research that uses evolving rubrics to improve planning, evidence grounding, and long-form report quality.
A strong embedding model focused on retrieval and reasoning quality, designed for robust performance on benchmarks such as MTEB and BRIGHT.
An open large language model family with training, alignment, and evaluation components for general-purpose NLP and agent-style use cases.
An extensible LLM evaluation platform that supports diverse benchmarks across code, agents, long context, math, and instruction following.
Education
-
M.S. in ECE
University of Washington
Sep. 2025 - Present
-
B.S. in CS
Jilin University
Sep. 2020 - Jun. 2024
Experience
-
Research Assistant
UW NLP
Working on deep research agents.
Sep. 2025 - Present
-
Research Intern
ByteDance Seed
Worked on reasoning-intensive retrieval.
Mar. 2025 - Sep. 2025
-
Research Intern
Shanghai AI Laboratory
Worked on LLM post-training and evaluation.
Sep. 2023 - May. 2024
Service
-
Reviewer
ICLR, ICML, ACL, EMNLP, NAACL, COLING
Misc
- Piano
- Music
- Travel
- Snowboard