Zero-Shot Multi-Speaker Text-To-Speech With State-Of-The-Art Neural Speaker Embeddings

This video program is a part of the Premium package:

Zero-Shot Multi-Speaker Text-To-Speech With State-Of-The-Art Neural Speaker Embeddings

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Zero-Shot Multi-Speaker Text-To-Speech With State-Of-The-Art Neural Speaker Embeddings

0 views

Create Account or Sign In to post comments

While speaker adaptation for end-to-end speech synthesis using speaker embeddings can produce good speaker similarity for speakers seen during training, there remains a gap for zero-shot adaptation to unseen speakers. We investigate multi-speaker modeling

While speaker adaptation for end-to-end speech synthesis using speaker embeddings can produce good speaker similarity for speakers seen during training, there remains a gap for zero-shot adaptation to unseen speakers. We investigate multi-speaker modeling

Next Up

00:10:00

Anti-Jamming Routing For Internet Of Satellites: A Reinforcement Learning Approach

00:10:00

00:10:00

00:11:17

Article Production Process: Author Gateway and POPP - PoE 2020

00:05:01

Article Production Process: Service Levels & Workflow Options - PoE 2020

00:28:33

AWS Partner Solution Showcase presented by Scott Francis IoT Partner Solutions Architect at Amazon Web Services