https://github.com/canopyai/Orpheus-TTS/issues/10#issuecomment-2740645470christophschuhmann left a comment (canopyai/Orpheus-TTS#10)Hey

Speech Technology

https://github.com/canopyai/Orpheus-TTS/issues/10#issuecomment-2740645470

christophschuhmann left a comment (canopyai/Orpheus-TTS#10)
Hey, Christoph from Laion here, the guy who made the Laion 5 billion data set. I have been making a voice acting data set with some donations from Intel with altogether 5,000 hours of high quality voice acting.
https://huggingface.co/datasets/laion/laions_got_talent_enhanced_flash_annotations_and_long_captions
https://huggingface.co/datasets/laion/laions_got_talent_raw
I was using HyperLab, which is a reseller for OpenAI API, so I never actually agreed to the OpenAI terms of service and then prompted the voice API to role play like an actor at a casting audition. This way I generated evenly distributed utterances over 40 emotion categories for all 11 OpenAI voices for English, German, French, and Spanish. The data is already online and I also have very detailed emotion captions. I will make an official release in the next few weeks, but you could already take it and tune German, Spanish, and French models on it. I would be very happy about a capable German model because I want to deploy voice assistants in schools in Germany. I'm doing all of this in my free time and I am still a high school teacher and want to keep it this way. In the following repository, the quality is the best, but unfortunately I lost the accent labels for the English samples. Some samples in the English part are with accents. In the second repository here, you find the unenhanced data, which is slightly lower from the recording quality, but you can find in the emotion entry of the JSON the corresponding accent. For English, I generated 14 different accents. German, Spanish, and French don't have any accents. Have fun!

GitHub

training / finetuning other languages · Issue #10 · canopyai/Orpheus-TTS

Amazing project! Would we be able to finetune on other languages? Could we just give finetune with a large dataset for another language or would it require other changes ? Would it handle other alp...

www.tg-me.com/us/Speech Technology/com.speechtech/2087

1.3K viewsMar 21 at 13:29

tg-me.com/speechtech/2087

Create: 2025-03-21
Last Update: 2025-07-03 17:16:06

BY Speech Technology

Warning: Undefined variable $i in /var/www/tg-me/post.php on line 283

Share with your friend now:
tg-me.com/speechtech/2087

Speech Technology Telegram | DID YOU KNOW?

Can I mute a Telegram group?

https://github.com/canopyai/Orpheus-TTS/issues/10#issuecomment-2740645470christophschuhmann left a comment (canopyai/Orpheus-TTS#10)Hey