MPEG-7 audio and beyond : audio content indexing and retrieval /

Advances in technology, such as MP3 players, the Internet and DVDs, have led to the production, storage and distribution of a wealth of audio signals, including speech, music and more general sound signals and their combinations. MPEG-7 audio tools were created to enable the navigation of this data,...

Full description

Saved in:
Bibliographic Details
Main Author: Kim, Hyoung-Gook
Other Authors: Moreau, Nicolas, Sikora, Thomas
Format: Electronic eBook
Language:English
Published: Chichester, West Sussex, England ; Hoboken, NJ, USA : J. Wiley, ©2005.
Subjects:
Online Access:CONNECT

MARC

LEADER 00000nam a2200000 a 4500
001 in00006116699
006 m o d
007 cr cnu---unuuu
008 061117s2005 enka ob 001 0 eng d
005 20240125142204.2
020 |a 9780470093368 
020 |a 9780470093344  |q (print ed.) 
020 |a 047009334X  |q (print ed.) 
020 |a 0470093366  |q (electronic bk.) 
020 |a 0470093358  |q (electronic bk.) 
020 |a 9780470093351  |q (electronic bk.) 
020 |a 1280339829 
020 |a 9781280339820 
020 |a 9786610339822 
020 |a 6610339821 
024 3 |z 9780470093344 
024 7 |a 10.1002/0470093366  |2 doi 
035 |a (NhCcYBP)e80fa363d91f485b9a695405a0a7a4bb9780470093368 
035 |a 1wileyeba9780470093368 
037 |b OverDrive, Inc.  |n http://www.overdrive.com 
037 |a 77EE41BC-6C64-458F-83B6-4F0FA2A1BF4F  |b OverDrive, Inc.  |n http://www.overdrive.com 
040 |a NhCcYBP  |b eng  |c NhCcYBP 
042 |a dlr 
050 4 |a TK6680.5  |b .K56 2005 
082 0 4 |a 006.6/96  |2 22 
084 |a ST 325  |2 rvk 
084 |a ST 330  |2 rvk 
100 1 |a Kim, Hyoung-Gook. 
245 1 0 |a MPEG-7 audio and beyond :  |b audio content indexing and retrieval /  |c Hyoung-Gook Kim, Nicolas Moreau, Thomas Sikora. 
260 |a Chichester, West Sussex, England ;  |a Hoboken, NJ, USA :  |b J. Wiley,  |c ©2005. 
300 |a 1 online resource (xviii, 285 pages) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
500 |a Wiley EBA  |5 TMurS 
504 |a Includes bibliographical references and index. 
588 0 |a Print version record. 
533 |a Electronic reproduction.  |b [Place of publication not identified] :  |c HathiTrust Digital Library,  |d 2010.  |5 MiAaHDL 
505 0 |a MPEG-7 Audio and Beyond; Contents; List of Acronyms; List of Symbols; 1 Introduction; 1.1 Audio Content Description; 1.2 MPEG-7 Audio Content Description -- An Overview; 1.2.1 MPEG-7 Low-Level Descriptors; 1.2.2 MPEG-7 Description Schemes; 1.2.3 MPEG-7 Description Definition Language (DDL); 1.2.4 BiM (Binary Format for MPEG-7); 1.3 Organization of the Book; 2 Low-Level Descriptors; 2.1 Introduction; 2.2 Basic Parameters and Notations; 2.2.1 Time Domain; 2.2.2 Frequency Domain; 2.3 Scalable Series; 2.3.1 Series of Scalars; 2.3.2 Series of Vectors; 2.3.3 Binary Series; 2.4 Basic Descriptors 
505 8 |a 2.4.1 Audio Waveform2.4.2 Audio Power; 2.5 Basic Spectral Descriptors; 2.5.1 Audio Spectrum Envelope; 2.5.2 Audio Spectrum Centroid; 2.5.3 Audio Spectrum Spread; 2.5.4 Audio Spectrum Flatness; 2.6 Basic Signal Parameters; 2.6.1 Audio Harmonicity; 2.6.2 Audio Fundamental Frequency; 2.7 Timbral Descriptors; 2.7.1 Temporal Timbral: Requirements; 2.7.2 Log Attack Time; 2.7.3 Temporal Centroid; 2.7.4 Spectral Timbral: Requirements; 2.7.5 Harmonic Spectral Centroid; 2.7.6 Harmonic Spectral Deviation; 2.7.7 Harmonic Spectral Spread; 2.7.8 Harmonic Spectral Variation; 2.7.9 Spectral Centroid 
505 8 |a 2.8 Spectral Basis Representations2.9 Silence Segment; 2.10 Beyond the Scope of MPEG-7; 2.10.1 Other Low-Level Descriptors; 2.10.2 Mel-Frequency Cepstrum Coefficients; References; 3 Sound Classification and Similarity; 3.1 Introduction; 3.2 Dimensionality Reduction; 3.2.1 Singular Value Decomposition (SVD); 3.2.2 Principal Component Analysis (PCA); 3.2.3 Independent Component Analysis (ICA); 3.2.4 Non-Negative Factorization (NMF); 3.3 Classification Methods; 3.3.1 Gaussian Mixture Model (GMM); 3.3.2 Hidden Markov Model (HMM); 3.3.3 Neural Network (NN); 3.3.4 Support Vector Machine (SVM) 
505 8 |a 3.4 MPEG-7 Sound Classification3.4.1 MPEG-7 Audio Spectrum Projection (ASP) Feature Extraction; 3.4.2 Training Hidden Markov Models (HMMs); 3.4.3 Classification of Sounds; 3.5 Comparison of MPEG-7 Audio Spectrum Projection vs. MFCC Features; 3.6 Indexing and Similarity; 3.6.1 Audio Retrieval Using Histogram Sum of Squared Differences; 3.7 Simulation Results and Discussion; 3.7.1 Plots of MPEG-7 Audio Descriptors; 3.7.2 Parameter Selection; 3.7.3 Results for Distinguishing Between Speech, Music and Environmental Sound; 3.7.4 Results of Sound Classification Using Three Audio Taxonomy Methods 
505 8 |a 3.7.5 Results for Speaker Recognition3.7.6 Results of Musical Instrument Classification; 3.7.7 Audio Retrieval Results; 3.8 Conclusions; References; 4 Spoken Content; 4.1 Introduction; 4.2 Automatic Speech Recognition; 4.2.1 Basic Principles; 4.2.2 Types of Speech Recognition Systems; 4.2.3 Recognition Results; 4.3 MPEG-7 SpokenContent Description; 4.3.1 General Structure; 4.3.2 SpokenContentHeader; 4.3.3 SpokenContentLattice; 4.4 Application: Spoken Document Retrieval; 4.4.1 Basic Principles of IR and SDR; 4.4.2 Vector Space Models; 4.4.3 Word-Based SDR 
520 |a Advances in technology, such as MP3 players, the Internet and DVDs, have led to the production, storage and distribution of a wealth of audio signals, including speech, music and more general sound signals and their combinations. MPEG-7 audio tools were created to enable the navigation of this data, by providing an established framework for effective multimedia management. MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval is a unique insight into the technology, covering the following topics:the fundamentals of MPEG-7 audio, principally low-level descriptors and soun. 
546 |a English. 
650 0 |a MPEG (Video coding standard) 
650 0 |a Multimedia systems. 
650 0 |a Sound  |x Recording and reproducing  |x Digital techniques  |x Standards. 
700 1 |a Moreau, Nicolas. 
700 1 |a Sikora, Thomas. 
730 0 |a WILEYEBA 
776 0 8 |i Print version:  |a Kim, Hyoung-Gook.  |t MPEG-7 audio and beyond.  |d Chichester, West Sussex, England ; Hoboken, NJ, USA : J. Wiley, ©2005  |z 047009334X  |w (DLC) 2005011807 
856 4 0 |u https://ezproxy.mtsu.edu/login?url=https://onlinelibrary.wiley.com/book/10.1002/0470093366  |z CONNECT  |3 Wiley  |t 0 
949 |a ho0 
975 |p Wiley UBCM Online Book All Titles thru 2023 
976 |a 6006612 
998 |a wi  |d z 
999 f f |s 2de78d48-7cdd-464c-8c90-b19599a3d43f  |i 2de78d48-7cdd-464c-8c90-b19599a3d43f  |t 0 
952 f f |a Middle Tennessee State University  |b Main  |c James E. Walker Library  |d Electronic Resources  |t 0  |e TK6680.5 .K56 2005  |h Library of Congress classification