End-To-End Multi-Person Audio/Visual Automatic Speech Recognition

This video program is a part of the Premium package:

End-To-End Multi-Person Audio/Visual Automatic Speech Recognition


  • IEEE MemberUS $11.00
  • Society MemberUS $0.00
  • IEEE Student MemberUS $11.00
  • Non-IEEE MemberUS $15.00
Purchase

End-To-End Multi-Person Audio/Visual Automatic Speech Recognition

0 views
  • Share
Traditionally, audio-visual automatic speech recognition has been studied under the assumption that the speaking face on the visual signal is the face matching the audio. However, in a more realistic setting, when multiple faces are potentially on screen
Traditionally, audio-visual automatic speech recognition has been studied under the assumption that the speaking face on the visual signal is the face matching the audio. However, in a more realistic setting, when multiple faces are potentially on screen