Pruna AI

company

https://www.pruna.ai/

Activity Feed Request to join this org

AI & ML interests

Efficient machine learning for any model and hardware: pruning, quantization, compilation, and more.

Recent Activity

sdiazlor updated a model about 5 hours ago

PrunaAI/DeepHat-DeepHat-V1-7B-HQQ-4bit-smashed

sdiazlor updated a model about 5 hours ago

PrunaAI/maicomputer-alpaca-native-HQQ-8bit-smashed

sdiazlor updated a model about 5 hours ago

PrunaAI/ByteDance-Seed-Stable-DiffCoder-8B-Instruct-HQQ-4bit-smashed

View all activity

Articles

Pruna 0.3.2: More OSS Algos, More Ways to Optimize

LLM Architectures Explained: What Powers Today’s Top Models

Slashing torch.compile Warmup & LoRA Swapping Times with Pruna

SmolLM-Smashed: Tiny Giants, Optimized for Speed

AI Model Optimization More Flexible Than Ever

Effective Prompting for Generative Vision Models

Measuring What Matters: Objective Metrics for Image Generation Assessment

Faster ComfyUI Nodes for Flux and Stable Diffusion with Pruna

🔥 Announcing FLUX-Juiced: The Fastest Image Generation Endpoint (2.6 times faster)!

An Introduction to AI Model Optimization Techniques

Optimise AI Models and Make Them Faster, Smaller, Cheaper, Greener

View all articles

Organization Card

Community About org cards

🌍 Join the Pruna AI community!

💜 Make AI models faster, cheaper, smaller, greener!

Pruna AI makes AI models faster, cheaper, smaller, and greener with the pruna package.

It supports various models, including CV, NLP, audio, and graphs for predictive and generative AI.
It supports various hardware, including GPU, CPU, Edge.
It supports various compression algorithms, including quantization, pruning, distillation, caching, recovery, compilation, or factorization, among others.
You can combine algorithms to find the optimal configuration and smash/compress your model.
You can evaluate reliable quality and efficiency metrics of your base vs smashed/compressed models.

Set it up in minutes and compress your first models in a few lines of code!

⏩ How to get started?

You can smash your own models by installing pruna with pip:

pip install pruna

You can start with simple notebooks to experience efficiency gains with:

Use Case	Free Notebooks
3x Faster Stable Diffusion Models	⏩ Smash for free
Making your LLMs 4x smaller	⏩ Smash for free
Smash your model with a CPU only	⏩ Smash for free
Transcribe 2 hours of audio in less than 2 minutes with Whisper	⏩ Smash for free
100% faster Whisper Transcription	⏩ Smash for free
Run your Flux model without an A100	⏩ Smash for free
x2 smaller Sana in action	⏩ Smash for free

For more details on installation and free tutorials, check the Pruna AI documentation.

✨ Test our endpoints

Want to use our optimized models right away? Try them via our API for fast, easy access to Pruna-powered inference.

Collections 8

View 8 collections

spaces 3

InferBench

A cost/quality/speed Leaderboard for Inference Providers!

Frame Arena

Frame by frame video comparison tool with quality metrics

models 130

PrunaAI/DeepHat-DeepHat-V1-7B-HQQ-8bit-smashed

Updated about 3 hours ago • 15

PrunaAI/DeepHat-DeepHat-V1-7B-HQQ-4bit-smashed

Updated about 3 hours ago • 13

PrunaAI/maicomputer-alpaca-native-HQQ-8bit-smashed

Updated about 3 hours ago • 14

PrunaAI/ByteDance-Seed-Stable-DiffCoder-8B-Instruct-HQQ-4bit-smashed

Updated about 3 hours ago • 11

PrunaAI/maicomputer-alpaca-native-HQQ-4bit-smashed

Updated about 3 hours ago • 14

PrunaAI/Nanbeige-Nanbeige4.1-3B-HQQ-8bit-smashed

Updated about 3 hours ago • 18

PrunaAI/sarvamai-sarvam-1-HQQ-8bit-smashed

Updated about 3 hours ago • 10

PrunaAI/sarvamai-sarvam-1-HQQ-4bit-smashed

Updated about 3 hours ago • 15

PrunaAI/ai21labs-AI21-Jamba-Reasoning-3B-HQQ-4bit-smashed

Updated about 3 hours ago • 14

PrunaAI/PleIAs-Baguettotron-HQQ-8bit-smashed

Updated about 3 hours ago • 12

View 130 models

datasets 4

PrunaAI/user_generated_content

Updated Feb 10 • 9

PrunaAI/InferBench-evaluation-results-new

Updated Oct 16, 2025 • 6

PrunaAI/InferBench-evaluation-results

Viewer • Updated Aug 21, 2025 • 1 • 42

PrunaAI/documentation-images

Viewer • Updated Jun 10, 2025 • 1 • 741