Pruna 0.3.2: More OSS Algos, More Ways to Optimize
•
4
Efficient machine learning for any model and hardware: pruning, quantization, compilation, and more.
Pruna AI makes AI models faster, cheaper, smaller, and greener with the pruna package.
Set it up in minutes and compress your first models in a few lines of code!
You can smash your own models by installing pruna with pip:
pip install pruna
You can start with simple notebooks to experience efficiency gains with:
| Use Case | Free Notebooks |
|---|---|
| 3x Faster Stable Diffusion Models | ⏩ Smash for free |
| Making your LLMs 4x smaller | ⏩ Smash for free |
| Smash your model with a CPU only | ⏩ Smash for free |
| Transcribe 2 hours of audio in less than 2 minutes with Whisper | ⏩ Smash for free |
| 100% faster Whisper Transcription | ⏩ Smash for free |
| Run your Flux model without an A100 | ⏩ Smash for free |
| x2 smaller Sana in action | ⏩ Smash for free |
For more details on installation and free tutorials, check the Pruna AI documentation.
Want to use our optimized models right away? Try them via our API for fast, easy access to Pruna-powered inference.