Sports Video Understanding Benchmarks
AI & ML interests
Computer Vision; Video Understanding; Action Recognition
Recent Activity
Papers
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
SAM 2++: Tracking Anything at Any Granularity
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
Learning Human Skill Generators at Key-Step Levels
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser).
VideoMAE Pre-trained Models
-
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Paper • 2203.12602 • Published • 2 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 82k • 50 -
MCG-NJU/videomae-base-finetuned-kinetics
Video Classification • 86.5M • Updated • 24.7k • 47 -
MCG-NJU/videomae-base-finetuned-ssv2
Video Classification • Updated • 1.34k • 7
-
MCG-NJU/SteadyDancer-14B
Image-to-Video • Updated • 377 • 67 -
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Paper • 2511.19320 • Published • 43 -
MCG-NJU/X-Dance
Viewer • Updated • 36 • 469 • 18 -
MCG-NJU/SteadyDancer-GGUF
Image-to-Video • 16B • Updated • 839 • 22
Sports Video Understanding Benchmarks
VideoMAE Pre-trained Models
-
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Paper • 2203.12602 • Published • 2 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 82k • 50 -
MCG-NJU/videomae-base-finetuned-kinetics
Video Classification • 86.5M • Updated • 24.7k • 47 -
MCG-NJU/videomae-base-finetuned-ssv2
Video Classification • Updated • 1.34k • 7
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
-
MCG-NJU/SteadyDancer-14B
Image-to-Video • Updated • 377 • 67 -
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Paper • 2511.19320 • Published • 43 -
MCG-NJU/X-Dance
Viewer • Updated • 36 • 469 • 18 -
MCG-NJU/SteadyDancer-GGUF
Image-to-Video • 16B • Updated • 839 • 22
Learning Human Skill Generators at Key-Step Levels
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser).