Feature Extraction
Transformers
Safetensors
English
qwen3
speculative-decoding
dflash
speculative-decoding-drafter
kimi-linear
linear-attention
custom_code
Instructions to use Moonlight556/Kimi-Linear-48B-A3B-DFlash-240k with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Moonlight556/Kimi-Linear-48B-A3B-DFlash-240k with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="Moonlight556/Kimi-Linear-48B-A3B-DFlash-240k", trust_remote_code=True)# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("Moonlight556/Kimi-Linear-48B-A3B-DFlash-240k", trust_remote_code=True) model = AutoModel.from_pretrained("Moonlight556/Kimi-Linear-48B-A3B-DFlash-240k", trust_remote_code=True) - Notebooks
- Google Colab
- Kaggle
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!