Android Audio and Visual Balance

Audio-Visual Target Speaker Extraction With Selective Auditory Attention

Abstract: Audio-visual target speaker extraction (AV-TSE) aims to extract the specific person's speech from the audio mixture given auxiliary visual cues. Previous methods usually search for the ...

IEEE

MAVAD: Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos

Abstract: This paper introduces the first audio-visual dataset for traffic anomaly detection called MAVAD, taken from real-world scenes, with a diverse range of illumination conditions. In addition, a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Audio-Visual Target Speaker Extraction With Selective Auditory Attention

MAVAD: Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos

Trending now