Self-Supervised Learning For Audio-Visual Speaker Diarization

This video program is a part of the Premium package:

Self-Supervised Learning For Audio-Visual Speaker Diarization


  • IEEE MemberUS $11.00
  • Society MemberUS $0.00
  • IEEE Student MemberUS $11.00
  • Non-IEEE MemberUS $15.00
Purchase

Self-Supervised Learning For Audio-Visual Speaker Diarization

0 views
  • Share
Create Account or Sign In to post comments
Speaker diarization, which is to find the speech segments of specific speakers, has been widely used in human-centered applications such as video conferences or human-computer interaction systems. In this paper, we propose a self-supervised audio-video sy
Speaker diarization, which is to find the speech segments of specific speakers, has been widely used in human-centered applications such as video conferences or human-computer interaction systems. In this paper, we propose a self-supervised audio-video sy