The aim of this work is to detect the events in video sequences that are salient with respect to the audio signal. In particular, we focus on the audio analysis of a video, with the goal of finding which are the significant features to detect audio-salient events. In our work we have extracted the audio tracks from videos of different sport events. For each video, we have manually labeled the salient audio-events using the binary markings. On each frame, features in both time and frequency domains have been considered. These features have been used to train different classifiers: Classification and Regression Trees, Support Vector Machine, and k-Nearest Neighbor. The classification performances are reported in terms of confusion matrices.

Corchs, S., Ciocca, G., Fiori, M., Gasparini, F. (2014). Video salient event classification using audio features. In Imaging and Multimedia Analytics in a Web and Mobile World 2014. SPIE [10.1117/12.2039191].

Video salient event classification using audio features

Corchs, S;Ciocca, G;Gasparini, F
2014

Abstract

The aim of this work is to detect the events in video sequences that are salient with respect to the audio signal. In particular, we focus on the audio analysis of a video, with the goal of finding which are the significant features to detect audio-salient events. In our work we have extracted the audio tracks from videos of different sport events. For each video, we have manually labeled the salient audio-events using the binary markings. On each frame, features in both time and frequency domains have been considered. These features have been used to train different classifiers: Classification and Regression Trees, Support Vector Machine, and k-Nearest Neighbor. The classification performances are reported in terms of confusion matrices.
poster + paper
Audio analysis, audio saliency
English
Electronic Imaging - Imaging and Multimedia Analytics in a Web and Mobile World 2014
2014
Imaging and Multimedia Analytics in a Web and Mobile World 2014
978-081949944-8
2014
9027
90270P
none
Corchs, S., Ciocca, G., Fiori, M., Gasparini, F. (2014). Video salient event classification using audio features. In Imaging and Multimedia Analytics in a Web and Mobile World 2014. SPIE [10.1117/12.2039191].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/50632
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact