WorldKV: Efficient World Memory with World Retrieval and Compression Paper • 2605.22718 • Published 5 days ago • 38
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 4 days ago • 26
HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents Paper • 2605.17873 • Published 8 days ago • 5
GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction Paper • 2605.23888 • Published 4 days ago • 8
Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published 6 days ago • 92
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published 6 days ago • 97
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 18 days ago • 38
view article Article OlmoEarth v1.1: A more efficient family of Earth observation models allenai • 7 days ago • 18
view article Article Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models nvidia • 3 days ago • 17
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps Paper • 2605.16928 • Published 10 days ago • 89
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 6 days ago • 201
Quantitative Video World Model Evaluation for Geometric-Consistency Paper • 2605.15185 • Published 12 days ago • 3
RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO Paper • 2605.15190 • Published 12 days ago • 13