PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
Jie Cheng
jinachris
AI & ML interests
Reinforcement learning, LLM
Recent Activity
liked a model 2 days ago
stepfun-ai/Step-3.7-Flash liked a dataset 3 months ago
stepfun-ai/Step-3.5-Flash-SFT liked a model 3 months ago
cerebras/Step-3.5-Flash-REAP-121B-A11BOrganizations
None yet