meta-llama/Llama-4-Scout-17B-16E-Instruct Image-to-Text • 109B • Updated May 22, 2025 • 204k • 1.2k
Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 2.68M • • 1.42k
3loi/SER-Odyssey-Baseline-WavLM-Categorical Audio Classification • 0.3B • Updated Jun 12, 2024 • 8.02k • 10
3loi/SER-Odyssey-Baseline-WavLM-Multi-Attributes Audio Classification • 0.3B • Updated Jun 12, 2024 • 565 • 9
3loi/SER-Odyssey-Baseline-WavLM-Arousal Audio Classification • 0.3B • Updated Jun 12, 2024 • 177 • 2
3loi/SER-Odyssey-Baseline-WavLM-Valence Audio Classification • 0.3B • Updated Jun 12, 2024 • 31 • 1
3loi/SER-Odyssey-Baseline-WavLM-Dominance Audio Classification • 0.3B • Updated Jun 12, 2024 • 3 • 1
google/vit-base-patch16-224-in21k Image Feature Extraction • 86.4M • Updated Feb 5, 2024 • 817k • 393
3loi/SER-Odyssey-Baseline-WavLM-Categorical Audio Classification • 0.3B • Updated Jun 12, 2024 • 8.02k • 10