Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
419.2
TFLOPS
93
77
184
Yaowei Zheng
hiyouga
Follow
JOIO's profile picture
awacke1's profile picture
ben81828's profile picture
3,497 followers
·
38 following
https://github.com/hiyouga
hiyouga_dev
hiyouga
hiyouga
AI & ML interests
LLM Training System
Recent Activity
posted
an
update
3 days ago
Follow my X account — I'll be sharing thoughts and findings on building open-source AI Agent projects, Agent Memory, and Observability. Thanks for connecting! https://x.com/hiyouga_dev
liked
a dataset
5 days ago
DEVAI-benchmark/DEVAI
updated
a Space
29 days ago
hiyouga/LLaMA-Board
View all activity
Organizations
hiyouga
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
5 days ago
DEVAI-benchmark/DEVAI
Preview
•
Updated
Oct 24, 2024
•
571
•
22
liked
a dataset
7 months ago
neulab/agent-data-collection
Preview
•
Updated
Mar 9
•
5.76k
•
112
liked
a model
9 months ago
microsoft/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
Jan 22
•
73.9k
•
2.39k
liked
a model
10 months ago
internlm/Intern-S1-mini
Image-Text-to-Text
•
9B
•
Updated
Mar 29
•
15.7k
•
115
liked
a dataset
10 months ago
nvidia/Llama-Nemotron-VLM-Dataset-v1
Viewer
•
Updated
Oct 22, 2025
•
2.86M
•
3.84k
•
166
liked
2 models
10 months ago
janhq/Jan-v1-4B
Text Generation
•
4B
•
Updated
Aug 23, 2025
•
1.32k
•
•
355
openbmb/MiniCPM-V-4
Image-Text-to-Text
•
4B
•
Updated
Sep 15, 2025
•
260k
•
465
liked
a dataset
10 months ago
allenai/WildChat-4.8M
Viewer
•
Updated
Aug 11, 2025
•
3.2M
•
5.51k
•
159
liked
a model
10 months ago
openai/gpt-oss-20b
Text Generation
•
22B
•
Updated
Aug 26, 2025
•
7.57M
•
•
4.69k
liked
a dataset
10 months ago
JT-LM/JIUTIAN-TReB
Updated
Sep 9, 2025
•
69
•
4
liked
a Space
11 months ago
Runtime error
19
Megatron Memory Estimator
👁
19
Estimate GPU memory usage for Megatron models
liked
a model
11 months ago
moonshotai/Kimi-K2-Instruct
Text Generation
•
1T
•
Updated
Apr 23
•
638k
•
•
2.37k
liked
a dataset
11 months ago
data-for-agents/insta-150k-v3
Viewer
•
Updated
May 28, 2025
•
146k
•
46
•
16
liked
a model
11 months ago
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
Oct 25, 2025
•
511k
•
776
liked
a dataset
12 months ago
Saigyouji-Yuyuko1000/dapo17k
Viewer
•
Updated
Jun 23, 2025
•
17.9k
•
134
•
2
liked
2 models
12 months ago
reducto/RolmOCR
Image-Text-to-Text
•
8B
•
Updated
Apr 2, 2025
•
236k
•
586
nanonets/Nanonets-OCR-s
Image-Text-to-Text
•
4B
•
Updated
Jun 20, 2025
•
122k
•
1.59k
liked
a dataset
about 1 year ago
open-thoughts/OpenThoughts3-1.2M
Viewer
•
Updated
Jun 9, 2025
•
1.2M
•
25.2k
•
234
liked
2 models
about 1 year ago
open-thoughts/OpenThinker3-7B
Text Generation
•
8B
•
Updated
Jun 9, 2025
•
3.29k
•
•
135
ByteDance-Seed/BAGEL-7B-MoT
Any-to-Any
•
15B
•
Updated
Jan 9
•
871
•
1.2k
Load more