Benj Edwards / Ars Technica:
Microsoft unveils VALL-E, a text-to-speech AI model trained on 60K hours of English speech that can simulate a person’s voice from three seconds of sample audio — Text-to-speech model can preserve speaker’s emotional tone and acoustic environment. — On Thursday, Microsoft researchers announced …
Lees verder op Tech Meme