Journal of Shanghai Jiaotong University

• Automation Technique, Computer Technology • Previous Articles     Next Articles

Violent Videos Classification Algorithm Based on Bag of Audio Words

LI Rongjie1,JIANG Xinghao1,2,SUN Tanfeng1,2
  

  1. (1. School of Information Engineering Security, Shanghai Jiaotong University, Shanghai 200240, China; 2. Shanghai Information Security Management and Technology Research Key Lab, Shanghai 200240, China)
  • Received:2010-06-13 Revised:1900-01-01 Online:2011-02-28 Published:2011-02-28

Abstract: A new method to classify the violent videos by the bag of audio words was introduced. The MPEG7 audio descriptors are firstly extracted, including the low level features such as AudioSpectrumCentroid and AudioSpectrumSpread etc. After that, the audio words are built through the MPEG7 high level descriptor, the AudioSighnature, which is considered as the fingerprint of the audio stream. The support vector machine is used to classify the feature vectors into two genres, which are the violent and nonviolent. There are three experiments in this paper: the research on the different types of the audio words, the different size of words and the classification of the shots detected from the visual features. It is demonstrated from the experiment result that the proposed method achieves good recall accuracy.

CLC Number: