stepfun-ai/Step-Audio-R1
Audio-Text-to-Text • 33B • Updated • 37 • 144
Open source models with audio understanding. Tracking mostly vendor releases in the audio and text to text subclassification of multimodal.
View and compare audio model performance rankings