fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving Paper • 2502.05370 • Published Feb 7
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 2 days ago • 81