π Qwen-Image-Edit-MeiTu
This model β Qwen-Image-Edit-MeiTu β is an improved variant of Qwen/Qwen-Image-Edit, built with DiT-based architecture fine-tuning to enhance visual consistency, aesthetic quality, and structural alignment in complex edits.
Developed by Valiant Cat AI Lab, this version aims to further close the gap between high-fidelity semantic editing and coherent artistic rendering, achieving a more natural and professional output across a wide range of prompts and subjects.
β¨ Key Improvements
Enhanced Consistency:
Utilizes DiT (Diffusion Transformer) fine-tuning to ensure structural stability between input and edited regions, maintaining global spatial coherence.Aesthetic Optimization:
Trained with aesthetic discriminators and curated aesthetic score datasets, producing more pleasing colors, contrast, and light balance.Better Detail Preservation:
Improved low-level reconstruction for fine details such as textures, faces, and typography.Broader Scene Adaptability:
Performs well on portraits, environments, product photos, and illustrations, supporting both semantic and appearance-based editing.
πΌοΈ Showcase
Below are examples of consistency and aesthetic improvement in complex editing scenarios:
| Input & Output |
|---|
![]() |
![]() |
![]() |
![]() |
![]() |
π¬ Recommended Prompts
Try these prompts to explore the modelβs strengths:
- βmake the lighting soft and cinematic with better balanceβ
- βenhance the photoβs composition and maintain realismβ
- βrefine skin tone and texture consistencyβ
- βimprove the global color tone and aesthetic harmonyβ
- βincrease photo realism and clarity without changing contentβ
π§© Integration with ComfyUI
This model works seamlessly with a modified ComfyUI Qwen-Image-Edit workflow.
Just use this model in the Unet node to workflow for edit image.
π₯ Download Model
Weights available in Safetensors format:
π Download Qwen-Image-Edit-MeiTu
π§ Training
This model was trained and optimized by the
AI Laboratory of Chongqing Valiant Cat Technology Co., LTD.
Visit https://vvicat.com/ for business collaborations or research partnerships.
π Related Paper
This model is part of the Qwen-Edit+ research line and is associated with the following preprint:
Fan Tang, Siyuan Li
Qwen-Edit+: Scaling Image Editing with VLM-Guided Consistency and Aesthetic Preference Distillation.
Research Square, Version 1, 08 April 2026.
DOI: 10.21203/rs.3.rs-9352857/v1
π Citation
If you use this model, please cite:
@article{tang2026qweneditplus,
author = {Fan Tang and Siyuan Li},
title = {Qwen-Edit+: Scaling Image Editing with VLM-Guided Consistency and Aesthetic Preference Distillation},
journal = {Research Square},
year = {2026},
doi = {10.21203/rs.3.rs-9352857/v1},
url = {https://doi.org/10.21203/rs.3.rs-9352857/v1}
}
π License
Licensed under Apache 2.0.
πΌ Join Us
We are hiring research engineers and creative ML practitioners at
Chongqing Valiant Cat Technology Co., LTD β reach out via
π§ tommy@vvicat.com
- Downloads last month
- 686
4-bit
5-bit
6-bit
8-bit




