Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio
Text-to-speech model can preserve speaker's emotional tone and acoustic environment.

Text-to-speech model can preserve speaker's emotional tone and acoustic environment.
via ArsTechnica : lire l’article source