Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
baojian1024
's Collections
Video
Audio
Image
OCR
Comfyui
LTX-2.3
3D models
Audio
updated
5 days ago
Upvote
-
microsoft/VibeVoice-ASR
Automatic Speech Recognition
•
9B
•
Updated
Jan 27
•
577k
•
1.12k
CohereLabs/cohere-transcribe-03-2026
Automatic Speech Recognition
•
Updated
4 days ago
•
256k
•
933
JiongzeYu/SparkVSR
Updated
Apr 4
•
1.06k
•
58
smthem/SparkVSR-GGUF
6B
•
Updated
Mar 25
•
38
•
4
microsoft/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
Jan 22
•
259k
•
2.37k
microsoft/VibeVoice-Realtime-0.5B
Text-to-Speech
•
1B
•
Updated
Dec 12, 2025
•
949k
•
1.22k
meituan-longcat/LongCat-AudioDiT-3.5B
4B
•
Updated
Apr 3
•
5.14k
•
69
openbmb/VoxCPM2
Text-to-Speech
•
Updated
23 days ago
•
175k
•
1.29k
k2-fsa/OmniVoice
Text-to-Speech
•
Updated
1 day ago
•
2.24M
•
809
YJX-Xiaomi/ControlFoley
Text-to-Audio
•
Updated
17 days ago
•
9
Upvote
-
Share collection
View history
Collection guide
Browse collections