Fast, multi-speaker TTS (44.1kHz) with voice cloning
MegaTTS 3 but with voice cloning!
Generate images with SD3.5