Josephgflowers/Phinance-Phi-4-mini-instruct-finance-v0.4-with-reasoning-gguf 4B • Updated Oct 24 • 101
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning Paper • 2509.08755 • Published Sep 10 • 56