Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation Paper • 2510.22115 • Published Oct 25, 2025 • 86
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published Oct 30, 2025 • 88
Don't Blind Your VLA: Aligning Visual Representations for OOD Generalization Paper • 2510.25616 • Published Oct 29, 2025 • 107
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper • 2512.04987 • Published Dec 4, 2025 • 84
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published Dec 18, 2025 • 91
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 305
Towards Truly Multilingual ASR: Generalizing Code-Switching ASR to Unseen Language Pairs Paper • 2606.05846 • Published 4 days ago • 4
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published 10 days ago • 59
OmniVoice: Towards Omnilingual Zero-Shot Text-to-Speech with Diffusion Language Models Paper • 2604.00688 • Published Apr 1 • 16
PARCEL: Pool-Anchored Resampling with Conditioned Elastic Queries for Efficient Vision-Language Understanding Paper • 2605.30126 • Published 11 days ago • 10
Bootstrap Your Generator: Unpaired Visual Editing with Flow Matching Paper • 2606.03911 • Published 6 days ago • 22
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 261
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 105
PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval Paper • 2603.01493 • Published Mar 2 • 21
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published Feb 25 • 45
Training a Student Expert via Semi-Supervised Foundation Model Distillation Paper • 2604.03841 • Published Apr 4 • 11
MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control Paper • 2604.06156 • Published Apr 7 • 11
CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era Paper • 2602.23452 • Published Feb 26 • 18
How to Take a Memorable Picture? Empowering Users with Actionable Feedback Paper • 2602.21877 • Published Feb 25 • 17