Add model card: architecture, training, usage, limitations

by Yatsuiii - opened 17 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+152

-62

Yatsuiii

Lablab.ai AMD Developer Hackathon org 17 days ago

Documents the LoRA r=16 fine-tune over Qwen2.5-7B-Instruct on AMD MI300X.

Includes: architecture, training hyperparams (600 examples, 3 epochs, bf16), intended use, out-of-scope use, direct-inference code, vLLM serving, evaluation, limitations.

Add model card: architecture, training, usage, limitationsd0b31cf4

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment