Aligntts: Efficient Feed-Forward Text-To-Speech System Without Explicit Alignment

This video program is a part of the Premium package:

Aligntts: Efficient Feed-Forward Text-To-Speech System Without Explicit Alignment


  • IEEE MemberUS $11.00
  • Society MemberUS $0.00
  • IEEE Student MemberUS $11.00
  • Non-IEEE MemberUS $15.00
Purchase

Aligntts: Efficient Feed-Forward Text-To-Speech System Without Explicit Alignment

0 views
  • Share
Create Account or Sign In to post comments
Targeting at both high efficiency and performance, we propose AlignTTS to predict the mel-spectrum in parallel. AlignTTS is based on a Feed-Forward Transformer which generates mel-spectrum from a sequence of characters, and the duration of each character
Targeting at both high efficiency and performance, we propose AlignTTS to predict the mel-spectrum in parallel. AlignTTS is based on a Feed-Forward Transformer which generates mel-spectrum from a sequence of characters, and the duration of each character