1 27 184

Joshua Chak

JoshuaChak

AI & ML interests

None yet

Recent Activity

liked a model 12 days ago

MediaTek-Research/Breeze-ASR-25

upvoted a paper 13 days ago

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

liked a model 13 days ago

Supertone/supertonic

View all activity

Organizations

upvoted a paper 13 days ago

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Paper • 2511.11007 • Published 23 days ago • 15

upvoted an article 23 days ago

Article

We’re open-sourcing our text-to-image model and the process behind it

25 days ago

•

upvoted 2 papers 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 158

upvoted a paper 4 months ago

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 56

upvoted an article 5 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

722

upvoted a paper 5 months ago

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Paper • 2506.20639 • Published Jun 25 • 31

upvoted 2 papers 6 months ago

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Paper • 2506.08009 • Published Jun 9 • 30

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published May 27 • 38

upvoted a paper 9 months ago

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Paper • 2503.10625 • Published Mar 13 • 33

upvoted a collection 9 months ago

OLMo 2

Collection

Artifacts for the OLMo 2 release. • 35 items • Updated 8 days ago • 149

upvoted a paper 10 months ago

Scaling Embedding Layers in Language Models

Paper • 2502.01637 • Published Feb 3 • 24

upvoted a paper 12 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 158

upvoted 6 papers about 1 year ago

upvoted a paper over 1 year ago

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5, 2024 • 34