Nagori's picture

Nagori

MohammedNaeem

·

Naeem_1144

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

upvoted a paper 1 day ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

upvoted a paper 1 day ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published 4 days ago • 114

upvoted 2 papers 1 day ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published 6 days ago • 123

Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published 3 days ago • 90

upvoted 2 papers 2 days ago

Qwen3-ASR Technical Report

Paper • 2601.21337 • Published 3 days ago • 21

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 3 days ago • 28

upvoted 3 papers 3 days ago

Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning

Paper • 2601.19280 • Published 5 days ago • 7

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 54

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 6 days ago • 38

upvoted a paper 4 days ago

Towards Pixel-Level VLM Perception via Simple Points Prediction

Paper • 2601.19228 • Published 5 days ago • 15

upvoted a collection 4 days ago

HunyuanImage

4 items • Updated 4 days ago • 13

upvoted a paper 5 days ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 9 days ago • 170

upvoted a paper 7 days ago

Learning to Discover at Test Time

Paper • 2601.16175 • Published 10 days ago • 41

upvoted 2 collections 8 days ago

Nemotron Speech

Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated 2 days ago • 36

RWKV7-Gxx-GGUF

GGUF of RWKV7-G series reasoning models • 6 items • Updated 8 days ago • 10

upvoted 2 papers 8 days ago

Behavior Knowledge Merge in Reinforced Agentic Models

Paper • 2601.13572 • Published 12 days ago • 23

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 14 days ago • 186

upvoted a paper 9 days ago

LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR

Paper • 2601.14251 • Published 12 days ago • 23

upvoted a collection 9 days ago

Nex-N1.1

1 item • Updated 9 days ago • 1

upvoted a paper 9 days ago

Qwen3-TTS Technical Report

Paper • 2601.15621 • Published 10 days ago • 56

upvoted a collection 11 days ago

Deepseek Papers

Deepseek papers collection • 29 items • Updated 3 days ago • 318