5 78 38

QINGHE WANG

Qinghew

https://qinghew.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

liked a model 4 days ago

baidu/Unlimited-OCR

upvoted a paper 11 days ago

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

View all activity

Organizations

upvoted a paper about 23 hours ago

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

Paper • 2606.21661 • Published 7 days ago • 21

liked a model 4 days ago

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated 2 days ago • 134k • 962

upvoted a paper 11 days ago

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

Paper • 2606.13432 • Published 15 days ago • 109

liked a model 22 days ago

jdopensource/JoyAI-Echo

Text-to-Video • Updated 8 days ago • 15.7k • 152

upvoted a paper 29 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published about 1 month ago • 75

updated a Space about 1 month ago

CharacterFactory

🖼

Generate consistent character images from text prompts

upvoted a paper about 1 month ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published May 12 • 194

upvoted 3 papers 2 months ago

liked a dataset 2 months ago

Haoyuwu/MultiWorldData

Updated Apr 20 • 381 • 7

upvoted a paper 2 months ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 72

liked a model 2 months ago

meituan-longcat/LongCat-Next

Any-to-Any • 74B • Updated Apr 10 • 788 • 179

upvoted an article 3 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 167

liked a dataset 3 months ago

8ruceLi/VFXMaster_datasets

Viewer • Updated Apr 7 • 9.21k • 9 • 3

upvoted 3 papers 3 months ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published Apr 6 • 204

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 157

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

liked a model 3 months ago

KlingTeam/CamCloneMaster-Wan2.1

Updated Mar 25 • 13

upvoted a paper 3 months ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published Mar 24 • 36