AC-LoRA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs Paper • 2505.11557 • Published May 15, 2025 • 7
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights Paper • 2509.22944 • Published Sep 26, 2025 • 80
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks Paper • 2606.03458 • Published 12 days ago • 60