Microsoft’s AI Voice Cloning Tech Is So Good, You Can’t Use It

VALL-E-2 generates extremely convincing cloned voices with just three seconds of audio data—but don’t get too excited.