dhruva-sarma (Dhruvajyoti Sarma)

upvoted an article 4 months ago

Article

Optimizing your LLM in production

Sep 15, 2023

•

22

upvoted a paper 4 months ago

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 25

upvoted 2 articles 4 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

+1

Feb 23, 2024

•

187

Article

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

Sep 10, 2025

•

109

upvoted a collection 6 months ago

Google's Gemma models family

Collection

332 items • Updated 22 days ago • 659

upvoted an article 8 months ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

+7

Apr 29, 2025

•

43

upvoted a paper 8 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 185

upvoted 2 papers 11 months ago

MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm

Paper • 2502.02358 • Published Feb 4, 2025 • 19

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Paper • 2502.04306 • Published Feb 6, 2025 • 20

upvoted an article 11 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

+1

Apr 15, 2024

•

191

upvoted 8 papers 12 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9, 2025 • 95

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published Jan 10, 2025 • 75

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13, 2025 • 55

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90

Potential and Perils of Large Language Models as Judges of Unstructured Textual Data

Paper • 2501.08167 • Published Jan 14, 2025 • 6

upvoted 2 papers about 1 year ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 74

Reliable Tuberculosis Detection using Chest X-ray with Deep Learning, Segmentation and Visualization

Paper • 2007.14895 • Published Jul 29, 2020 • 1

Dhruvajyoti Sarma

AI & ML interests

Organizations

Optimizing your LLM in production

Matryoshka Representation Learning

🪆 Introduction to Matryoshka Embedding Models

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

Google's Gemma models family

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Executable Code Actions Elicit Better LLM Agents

MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

The GAN is dead; long live the GAN! A Modern GAN Baseline

VideoRAG: Retrieval-Augmented Generation over Video Corpus

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Tensor Product Attention Is All You Need

Potential and Perils of Large Language Models as Judges of Unstructured Textual Data

MiniMax-01: Scaling Foundation Models with Lightning Attention

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Transformer^2: Self-adaptive LLMs

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Reliable Tuberculosis Detection using Chest X-ray with Deep Learning, Segmentation and Visualization

Dhruvajyoti Sarma

AI & ML interests

Organizations

dhruva-sarma's activity

Optimizing your LLM in production

🪆 Introduction to Matryoshka Embedding Models

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community