Asr Is All You Need: Cross-Modal Distillation For Lip Reading

This video program is a part of the Premium package:

Asr Is All You Need: Cross-Modal Distillation For Lip Reading

IEEE MemberUS $11.00
Society MemberUS $0.00
IEEE Student MemberUS $11.00
Non-IEEE MemberUS $15.00

Purchase

Asr Is All You Need: Cross-Modal Distillation For Lip Reading

0 views

Create Account or Sign In to post comments

The goal of this work is to train strong models for visual speech recognition without requiring human annotated ground truth data. We achieve this by distilling from an Automatic Speech Recognition (ASR) model that has been trained on a large-scale audio-

The goal of this work is to train strong models for visual speech recognition without requiring human annotated ground truth data. We achieve this by distilling from an Automatic Speech Recognition (ASR) model that has been trained on a large-scale audio-

Next Up

00:10:00

Anti-Jamming Routing For Internet Of Satellites: A Reinforcement Learning Approach

00:10:00

00:10:00

00:11:17

Article Production Process: Author Gateway and POPP - PoE 2020

00:05:01

Article Production Process: Service Levels & Workflow Options - PoE 2020

00:28:33

AWS Partner Solution Showcase presented by Scott Francis IoT Partner Solutions Architect at Amazon Web Services