Text to Speech

 

VALL-E – Neural Codec Language Models are Zero (paper)