Home
Esportazione
Statistica
Opzioni
Visualizza tutti i metadati (visione tecnica)
Video event classification using bag of words and string kernels
Ballan, Lamberto
•
Bertini, Marco
•
Del Bimbo, Alberto
•
SERRA, Giuseppe
2009
conference object
Abstract
The recognition of events in videos is a relevant and challenging task of automatic semantic video analysis. At present one of the most successful frameworks, used for object recognition tasks, is the bag-of-words (BoW) approach. However this approach does not model the temporal information of the video stream. In this paper we present a method to introduce temporal information within the BoW approach. Events are modeled as a sequence composed of histograms of visual features, computed from each frame using the traditional BoW model. The sequences are treated as strings where each histogram is considered as a character. Event classification of these sequences of variable size, depending on the length of the video clip, are performed using SVM classifiers with a string kernel that uses the Needlemann-Wunsch edit distance. Experimental results, performed on two datasets, soccer video and TRECVID 2005, demonstrate the validity of the proposed approach. © 2009 Springer Berlin Heidelberg.
DOI
10.1007/978-3-642-04146-4_20
WOS
WOS:000279101900019
Archivio
http://hdl.handle.net/11390/1105598
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-76249100989
Diritti
metadata only access
Soggetti
video annotation
action classication
bag-of-word
string kernel
edit distance
Scopus© citazioni
13
Data di acquisizione
Jun 2, 2022
Vedi dettagli
Visualizzazioni
4
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Vedi dettagli