1193 1192

Starstrek

Stars321123

Stars321

AI & ML interests

Recent Activity

upvoted a paper about 4 hours ago

What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards

liked a model about 4 hours ago

easygoing0114/Z-Image_clear_vae

upvoted a paper about 4 hours ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

View all activity

Organizations

upvoted a paper about 4 hours ago

What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards

Paper • 2512.00425 • Published 7 days ago • 45

liked a model about 4 hours ago

easygoing0114/Z-Image_clear_vae

Updated about 12 hours ago • 11

upvoted 2 papers about 4 hours ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 5 days ago • 48

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published 4 days ago • 25

upvoted 2 collections about 5 hours ago

Self-Calibration

Collection

Efficient Test-Time Scaling via Self-Calibration https://arxiv.org/abs/2503.00031 • 7 items • Updated Jun 8 • 3

PosS-Speculative-Decoding

Collection

This collection contains models of the paper "PosS:Position Specialist Generates Better Draft for Speculative Decoding" • 9 items • Updated Jun 5 • 2

upvoted 2 papers about 6 hours ago

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published 4 days ago • 47

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Paper • 2512.04926 • Published 2 days ago • 28

upvoted a paper 1 day ago

PixelDiT: Pixel Diffusion Transformers for Image Generation

Paper • 2511.20645 • Published 11 days ago • 24

upvoted 4 articles 1 day ago

Article

Swift Transformers Reaches 1.0 – and Looks to the Future

Sep 26

•

Article

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

16 days ago

•

Article

We Got Claude to Fine-Tune an Open Source LLM

2 days ago

•

205

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

2 days ago

•

liked a Space 2 days ago

Huggingface Leaderboard

🏆

101

Generate Hugging Face leaderboard stats

liked 2 models 2 days ago

nvidia/NVLM-D-72B

Image-Text-to-Text • 79B • Updated Jan 14 • 55.4k • 775

Qwen/Qwen2.5-Math-72B

Text Generation • 73B • Updated Sep 23, 2024 • 1.37k • 17

liked a dataset 2 days ago

nvidia/Nemotron-CrossThink

Preview • Updated May 1 • 310 • 112

upvoted 2 papers 2 days ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Paper • 2511.20549 • Published 11 days ago • 23

Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model

Paper • 2512.01030 • Published 6 days ago • 16

liked a Space 2 days ago

WER

🤗

Starstrek

AI & ML interests

Recent Activity

Organizations

Stars321123's activity

Swift Transformers Reaches 1.0 – and Looks to the Future

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

We Got Claude to Fine-Tune an Open Source LLM

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

Huggingface Leaderboard

WER