SecoustiCodec: Cross-Modal Aligned Streaming Single-Codecbook Speech Codec
Paper
β’
2508.02849
β’
Published
SecoustiCodec is a low-bitrate streaming speech codec that achieves good performance in speech reconstruction at ultra-low bitrates (0.27-1 kbps). The model introduces several innovations:
@article{qiang2025secousticodec,
title={SecoustiCodec: Cross-Modal Aligned Streaming Single-Codecbook Speech Codec},
author={Chunyu Qiang, Haoyu Wang, Cheng Gong, Tianrui Wang, Ruibo Fu, Tao Wang, Ruilong Chen, Jiangyan Yi, Zhengqi Wen, Chen Zhang, Longbiao Wang, Jianwu Dang, Jianhua Tao},
journal={arXiv preprint arXiv:2508.02849},
year={2025}
}