Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
baojian1024 's Collections
Video
Audio
Image
OCR
Comfyui
LTX-2.3
3D models

Audio

updated 5 days ago
Upvote
-

  • microsoft/VibeVoice-ASR

    Automatic Speech Recognition • 9B • Updated Jan 27 • 577k • 1.12k

  • CohereLabs/cohere-transcribe-03-2026

    Automatic Speech Recognition • Updated 4 days ago • 256k • 933

  • JiongzeYu/SparkVSR

    Updated Apr 4 • 1.06k • 58

  • smthem/SparkVSR-GGUF

    6B • Updated Mar 25 • 38 • 4

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 259k • 2.37k

  • microsoft/VibeVoice-Realtime-0.5B

    Text-to-Speech • 1B • Updated Dec 12, 2025 • 949k • 1.22k

  • meituan-longcat/LongCat-AudioDiT-3.5B

    4B • Updated Apr 3 • 5.14k • 69

  • openbmb/VoxCPM2

    Text-to-Speech • Updated 23 days ago • 175k • 1.29k

  • k2-fsa/OmniVoice

    Text-to-Speech • Updated 1 day ago • 2.24M • 809

  • YJX-Xiaomi/ControlFoley

    Text-to-Audio • Updated 17 days ago • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs