Vggsound: A Large-Scale Audio-Visual Dataset

This video program is a part of the Premium package:

Vggsound: A Large-Scale Audio-Visual Dataset


  • IEEE MemberUS $11.00
  • Society MemberUS $0.00
  • IEEE Student MemberUS $11.00
  • Non-IEEE MemberUS $15.00
Purchase

Vggsound: A Large-Scale Audio-Visual Dataset

0 views
  • Share
Create Account or Sign In to post comments
Our goal is to collect a large-scale audio-visual dataset with low label noise from videos `in the wild' using computer vision techniques. The resulting dataset can be used for training and evaluating audio recognition models. We make three contributions.
Our goal is to collect a large-scale audio-visual dataset with low label noise from videos `in the wild' using computer vision techniques. The resulting dataset can be used for training and evaluating audio recognition models. We make three contributions.