Collections

Discover the best community collections!

Collections including paper arxiv:2504.08791
inference optimization
Collection by
Apr 26
Research • Archive
Long-term archive of papers, models, datasets, and tools worth revisiting. Curated for reference, replication, and future deep dives.
AI-paper
Collection by
Mar 4
Infra • Serving & Optimization
Inference engines, quantization, serving stacks, and perf tooling. Reference list for deployment and latency/cost work.
[papers] Distillation
Collection by
Feb 22
LLM
Collection by
Jan 28
video
Collection by
May 3, 2025
inference optimization
Collection by
Apr 26
Infra • Serving & Optimization
Inference engines, quantization, serving stacks, and perf tooling. Reference list for deployment and latency/cost work.
Research • Archive
Long-term archive of papers, models, datasets, and tools worth revisiting. Curated for reference, replication, and future deep dives.
[papers] Distillation
Collection by
Feb 22
AI-paper
Collection by
Mar 4
LLM
Collection by
Jan 28
video
Collection by
May 3, 2025