AI Clones Your Voice After Listening for 5 Seconds
We describe a neural network-based system for text-to-speech (TTS) synthesis that is able to generate speech audio in the voice of many different speakers, including those unseen during training.
Learn more and listen to examples.