Mellotron: Multispeaker Expressive Voice Synthesis By Conditioning On Rhythm, Pitch And Global Style Tokens

This video program is a part of the Premium package:

Mellotron: Multispeaker Expressive Voice Synthesis By Conditioning On Rhythm, Pitch And Global Style Tokens

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Mellotron: Multispeaker Expressive Voice Synthesis By Conditioning On Rhythm, Pitch And Global Style Tokens

0 views

Create Account or Sign In to post comments

Mellotron is a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data. By explicitly conditioning on rhythm and continuous pitch contours from an audio signal or music score

Mellotron is a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data. By explicitly conditioning on rhythm and continuous pitch contours from an audio signal or music score

Next Up

00:10:00

Anti-Jamming Routing For Internet Of Satellites: A Reinforcement Learning Approach

00:10:00

00:10:00

00:11:17

Article Production Process: Author Gateway and POPP - PoE 2020

00:05:01

Article Production Process: Service Levels & Workflow Options - PoE 2020

00:28:33

AWS Partner Solution Showcase presented by Scott Francis IoT Partner Solutions Architect at Amazon Web Services