deepseek-ai/DeepSeek-V4-Pro Text Generation • 862B • Updated about 4 hours ago • 138k • • 2.96k
DavidAU/Qwen3.6-27B-NEO-CODE-Di-IMatrix-MAX-GGUF Image-Text-to-Text • 27B • Updated 1 day ago • 17.8k • 26
Running Featured 201 Gemma 4 WebGPU 🚀 201 Run Gemma 4 locally in-browser on WebGPU w/ Transformers.js
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation Paper • 2604.08455 • Published 18 days ago • 47
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Paper • 2410.10813 • Published Oct 14, 2024 • 16
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance Apr 16, 2025 • 73
arcee-ai/Trinity-Large-Thinking Text Generation • 399B • Updated 18 days ago • 25.7k • • 164