-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 23 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
Collections
Discover the best community collections!
Collections including paper arxiv:2507.07105
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 259 -
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 157 -
4KAgent: Agentic Any Image to 4K Super-Resolution
Paper • 2507.07105 • Published • 105 -
A Survey on Latent Reasoning
Paper • 2507.06203 • Published • 93
-
yandex/stable-diffusion-3.5-medium-alchemist
Text-to-Image • Updated • 18 • 6 -
Ovis-U1 Technical Report
Paper • 2506.23044 • Published • 62 -
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Paper • 2507.01953 • Published • 19 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 78
-
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Paper • 2503.13434 • Published • 27 -
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Paper • 2503.13327 • Published • 29 -
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes
Paper • 2503.13435 • Published • 18 -
MediaTek-Research/Llama-Breeze2-8B-Instruct
8B • Updated • 1.34k • 49
-
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 263 -
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Paper • 2507.01006 • Published • 240 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 259
-
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper • 2409.02097 • Published • 34 -
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper • 2409.11406 • Published • 27 -
Diffusion Models Are Real-Time Game Engines
Paper • 2408.14837 • Published • 126 -
Segment Anything with Multiple Modalities
Paper • 2408.09085 • Published • 22
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 23 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 263 -
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Paper • 2507.01006 • Published • 240 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 259
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 259 -
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 157 -
4KAgent: Agentic Any Image to 4K Super-Resolution
Paper • 2507.07105 • Published • 105 -
A Survey on Latent Reasoning
Paper • 2507.06203 • Published • 93
-
yandex/stable-diffusion-3.5-medium-alchemist
Text-to-Image • Updated • 18 • 6 -
Ovis-U1 Technical Report
Paper • 2506.23044 • Published • 62 -
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Paper • 2507.01953 • Published • 19 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 78
-
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Paper • 2503.13434 • Published • 27 -
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Paper • 2503.13327 • Published • 29 -
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes
Paper • 2503.13435 • Published • 18 -
MediaTek-Research/Llama-Breeze2-8B-Instruct
8B • Updated • 1.34k • 49
-
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper • 2409.02097 • Published • 34 -
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper • 2409.11406 • Published • 27 -
Diffusion Models Are Real-Time Game Engines
Paper • 2408.14837 • Published • 126 -
Segment Anything with Multiple Modalities
Paper • 2408.09085 • Published • 22