GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning Paper • 2511.11653 • Published Nov 10, 2025 • 59
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 375k • 1.6k
SimpleRL-Zoo Collection The collection for the Paper "SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild" • 12 items • Updated Mar 2 • 8
Running 133 Open FinLLM Leaderboard 🥇 133 Explore and compare LLM performance on financial benchmarks
nishadsinghi/math7500_train_solutions_DeepSeek-R1-Distill-Qwen-7B_32K_tokens Viewer • Updated Feb 13, 2025 • 7.45k • 5 • 2