A finetune of OmniVoice that adds singing and emotional speech on top of state-of-the-art text-to-speech for 600+ languages:
[singing]
[happy]
[sad]
[angry]
[excited]
[calm]
[nervous]
[whisper]
Model: ModelsLab/omnivoice-singing Β· Built with OmniVoice by Xiaomi AI Lab Next-gen Kaldi team.
Prefix your text with a tag. Combine them too, e.g. [singing] [sad] .... A Guidance Scale of 3.0 is recommended for pronounced tag behavior.
[singing] [sad] ...
Keep as Auto to auto-detect the language.