Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
1-2016
Abstract
In this paper, we introduce a novel framework called SASA (Smart Ambient Sound Analyser) to support different ambient audio mining tasks (e.g., audio classification and location estimation). To gain comprehensive ambient sound modelling, SASA extracts a variety of acoustic features from different sound components (e.g., music, voice and background), and translates them into structured information. This significantly enhances quality of audio content representation. Further, distinguished from existing approaches, SASA’s multilayered architecture seamlessly integrates mixture models and aPEGASOS (adaptive PEGASOS) SVM algorithm into a unified classification framework. The approach can leverage complimentary strengths of both models. Experimental results based on three large test collections demonstrate the SASA’s advantages over existing methods on various analysis tasks.
Keywords
Ambient intelligence, Environmental sound analysis
Discipline
Computer Sciences | Databases and Information Systems
Publication
MultiMedia Modeling: International Conference on Multimedia Modeling 2016: Miami, FL, January 4-6
First Page
231
Last Page
243
ISBN
9783319276731
Identifier
10.1007/978-3-319-27674-8_21
Publisher
Springer Verlag
City or Country
Cham
Citation
SHEN, Jialie; NIE, Liqiang; and CHUA, Tat Seng.
Smart ambient sound analysis via structured statistical modeling. (2016). MultiMedia Modeling: International Conference on Multimedia Modeling 2016: Miami, FL, January 4-6. 231-243.
Available at: https://ink.library.smu.edu.sg/sis_research/3543
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://doi.org/10.1007/978-3-319-27674-8_21