arxiv:2604.13016
Wenkai Yang
Keven16
AI & ML interests
None yet
Recent Activity
new activity 20 days ago
Keven16/Qwen3-4B-Non-Thinking-RL-Math-Step500:What is the data source used for training this model? authored a paper about 1 month ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe upvoted a paper about 2 months ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and RecipeOrganizations
None yet