AI Upscale Models for FFMPEGA
Pre-trained super-resolution models for use with ComfyUI-FFMPEGA's AI Upscale feature.
Models are automatically downloaded on first use โ no manual setup required.
Models
| File | Architecture | Scale | Size | VRAM | Best For |
|---|---|---|---|---|---|
RealESRGAN_x4plus.pth |
RRDBNet (GAN) | 4ร | 67 MB | ~2 GB | General real-world photos |
RealESRGAN_x4plus_anime_6B.pth |
RRDBNet (compact) | 4ร | 18 MB | ~1 GB | Anime, cartoon, illustration |
Real_HAT_GAN_SRx4.pth |
HAT (hybrid attention) | 4ร | 170 MB | ~4 GB | SOTA quality, fine detail |
003_realSR_BSRGAN_DFOWMFC_s64w8_SwinIR-L_x4_GAN.pth |
SwinIR-Large | 4ร | 48 MB | ~3 GB | Clean images, classical SR |
All models output 4ร resolution. For 2ร output, the upscaler runs at 4ร then applies high-quality Lanczos downscaling.
Usage in FFMPEGA
- Set
llm_modelโnone - Set
no_llm_modeโai_upscale - Choose
upscale_model(e.g.hat_x4for best quality) - Choose
upscale_scale(4or2) - Connect an image or video input and run
Model Loading
Models are loaded via spandrel, which auto-detects the architecture from the checkpoint file. No additional dependencies are needed beyond what ComfyUI already provides.
Credits
- Real-ESRGAN: xinntao/Real-ESRGAN โ BSD-3-Clause
- HAT: XPixelGroup/HAT โ MIT
- SwinIR: JingyunLiang/SwinIR โ Apache 2.0