-
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Paper • 2506.07491 • Published • 52 -
Story2Board: A Training-Free Approach for Expressive Storyboard Generation
Paper • 2508.09983 • Published • 70 -
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Paper • 2503.01710 • Published • 6 -
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
Paper • 2507.21809 • Published • 143
Samuel Thio
sthio90
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 hour ago
Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models upvoted a paper about 1 hour ago
SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning upvoted an article 4 days ago
Reachy Mini goes fully local