FlexTok AR Models EPFL-VILAB/FlexAR-3B-T2I Text-to-Image • Updated 17 days ago • 60 EPFL-VILAB/FlexAR-113M-T2I Text-to-Image • Updated 17 days ago • 16 • 1 EPFL-VILAB/FlexAR-382M-T2I Text-to-Image • Updated 17 days ago • 8 EPFL-VILAB/FlexAR-1B-T2I Text-to-Image • Updated 17 days ago • 7
Vlm MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published Feb 2, 2025 • 24 DeepRAG: Thinking to Retrieval Step by Step for Large Language Models Paper • 2502.01142 • Published Feb 3, 2025 • 25 ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published Feb 3, 2025 • 21 The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles Paper • 2502.01081 • Published Feb 3, 2025 • 13
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published Feb 2, 2025 • 24
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models Paper • 2502.01142 • Published Feb 3, 2025 • 25
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published Feb 3, 2025 • 21
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles Paper • 2502.01081 • Published Feb 3, 2025 • 13
FlexTok AR Models EPFL-VILAB/FlexAR-3B-T2I Text-to-Image • Updated 17 days ago • 60 EPFL-VILAB/FlexAR-113M-T2I Text-to-Image • Updated 17 days ago • 16 • 1 EPFL-VILAB/FlexAR-382M-T2I Text-to-Image • Updated 17 days ago • 8 EPFL-VILAB/FlexAR-1B-T2I Text-to-Image • Updated 17 days ago • 7
Vlm MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published Feb 2, 2025 • 24 DeepRAG: Thinking to Retrieval Step by Step for Large Language Models Paper • 2502.01142 • Published Feb 3, 2025 • 25 ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published Feb 3, 2025 • 21 The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles Paper • 2502.01081 • Published Feb 3, 2025 • 13
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published Feb 2, 2025 • 24
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models Paper • 2502.01142 • Published Feb 3, 2025 • 25
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published Feb 3, 2025 • 21
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles Paper • 2502.01081 • Published Feb 3, 2025 • 13