MPEG-7 audio and beyond : audio content indexing and retrieval /
Advances in technology, such as MP3 players, the Internet and DVDs, have led to the production, storage and distribution of a wealth of audio signals, including speech, music and more general sound signals and their combinations. MPEG-7 audio tools were created to enable the navigation of this data,...
Saved in:
Main Author: | |
---|---|
Other Authors: | , |
Format: | Electronic eBook |
Language: | English |
Published: |
Chichester, West Sussex, England ; Hoboken, NJ, USA :
J. Wiley,
©2005.
|
Subjects: | |
Online Access: | CONNECT |
MARC
LEADER | 00000nam a2200000 a 4500 | ||
---|---|---|---|
001 | in00006116699 | ||
006 | m o d | ||
007 | cr cnu---unuuu | ||
008 | 061117s2005 enka ob 001 0 eng d | ||
005 | 20240125142204.2 | ||
020 | |a 9780470093368 | ||
020 | |a 9780470093344 |q (print ed.) | ||
020 | |a 047009334X |q (print ed.) | ||
020 | |a 0470093366 |q (electronic bk.) | ||
020 | |a 0470093358 |q (electronic bk.) | ||
020 | |a 9780470093351 |q (electronic bk.) | ||
020 | |a 1280339829 | ||
020 | |a 9781280339820 | ||
020 | |a 9786610339822 | ||
020 | |a 6610339821 | ||
024 | 3 | |z 9780470093344 | |
024 | 7 | |a 10.1002/0470093366 |2 doi | |
035 | |a (NhCcYBP)e80fa363d91f485b9a695405a0a7a4bb9780470093368 | ||
035 | |a 1wileyeba9780470093368 | ||
037 | |b OverDrive, Inc. |n http://www.overdrive.com | ||
037 | |a 77EE41BC-6C64-458F-83B6-4F0FA2A1BF4F |b OverDrive, Inc. |n http://www.overdrive.com | ||
040 | |a NhCcYBP |b eng |c NhCcYBP | ||
042 | |a dlr | ||
050 | 4 | |a TK6680.5 |b .K56 2005 | |
082 | 0 | 4 | |a 006.6/96 |2 22 |
084 | |a ST 325 |2 rvk | ||
084 | |a ST 330 |2 rvk | ||
100 | 1 | |a Kim, Hyoung-Gook. | |
245 | 1 | 0 | |a MPEG-7 audio and beyond : |b audio content indexing and retrieval / |c Hyoung-Gook Kim, Nicolas Moreau, Thomas Sikora. |
260 | |a Chichester, West Sussex, England ; |a Hoboken, NJ, USA : |b J. Wiley, |c ©2005. | ||
300 | |a 1 online resource (xviii, 285 pages) : |b illustrations | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
500 | |a Wiley EBA |5 TMurS | ||
504 | |a Includes bibliographical references and index. | ||
588 | 0 | |a Print version record. | |
533 | |a Electronic reproduction. |b [Place of publication not identified] : |c HathiTrust Digital Library, |d 2010. |5 MiAaHDL | ||
505 | 0 | |a MPEG-7 Audio and Beyond; Contents; List of Acronyms; List of Symbols; 1 Introduction; 1.1 Audio Content Description; 1.2 MPEG-7 Audio Content Description -- An Overview; 1.2.1 MPEG-7 Low-Level Descriptors; 1.2.2 MPEG-7 Description Schemes; 1.2.3 MPEG-7 Description Definition Language (DDL); 1.2.4 BiM (Binary Format for MPEG-7); 1.3 Organization of the Book; 2 Low-Level Descriptors; 2.1 Introduction; 2.2 Basic Parameters and Notations; 2.2.1 Time Domain; 2.2.2 Frequency Domain; 2.3 Scalable Series; 2.3.1 Series of Scalars; 2.3.2 Series of Vectors; 2.3.3 Binary Series; 2.4 Basic Descriptors | |
505 | 8 | |a 2.4.1 Audio Waveform2.4.2 Audio Power; 2.5 Basic Spectral Descriptors; 2.5.1 Audio Spectrum Envelope; 2.5.2 Audio Spectrum Centroid; 2.5.3 Audio Spectrum Spread; 2.5.4 Audio Spectrum Flatness; 2.6 Basic Signal Parameters; 2.6.1 Audio Harmonicity; 2.6.2 Audio Fundamental Frequency; 2.7 Timbral Descriptors; 2.7.1 Temporal Timbral: Requirements; 2.7.2 Log Attack Time; 2.7.3 Temporal Centroid; 2.7.4 Spectral Timbral: Requirements; 2.7.5 Harmonic Spectral Centroid; 2.7.6 Harmonic Spectral Deviation; 2.7.7 Harmonic Spectral Spread; 2.7.8 Harmonic Spectral Variation; 2.7.9 Spectral Centroid | |
505 | 8 | |a 2.8 Spectral Basis Representations2.9 Silence Segment; 2.10 Beyond the Scope of MPEG-7; 2.10.1 Other Low-Level Descriptors; 2.10.2 Mel-Frequency Cepstrum Coefficients; References; 3 Sound Classification and Similarity; 3.1 Introduction; 3.2 Dimensionality Reduction; 3.2.1 Singular Value Decomposition (SVD); 3.2.2 Principal Component Analysis (PCA); 3.2.3 Independent Component Analysis (ICA); 3.2.4 Non-Negative Factorization (NMF); 3.3 Classification Methods; 3.3.1 Gaussian Mixture Model (GMM); 3.3.2 Hidden Markov Model (HMM); 3.3.3 Neural Network (NN); 3.3.4 Support Vector Machine (SVM) | |
505 | 8 | |a 3.4 MPEG-7 Sound Classification3.4.1 MPEG-7 Audio Spectrum Projection (ASP) Feature Extraction; 3.4.2 Training Hidden Markov Models (HMMs); 3.4.3 Classification of Sounds; 3.5 Comparison of MPEG-7 Audio Spectrum Projection vs. MFCC Features; 3.6 Indexing and Similarity; 3.6.1 Audio Retrieval Using Histogram Sum of Squared Differences; 3.7 Simulation Results and Discussion; 3.7.1 Plots of MPEG-7 Audio Descriptors; 3.7.2 Parameter Selection; 3.7.3 Results for Distinguishing Between Speech, Music and Environmental Sound; 3.7.4 Results of Sound Classification Using Three Audio Taxonomy Methods | |
505 | 8 | |a 3.7.5 Results for Speaker Recognition3.7.6 Results of Musical Instrument Classification; 3.7.7 Audio Retrieval Results; 3.8 Conclusions; References; 4 Spoken Content; 4.1 Introduction; 4.2 Automatic Speech Recognition; 4.2.1 Basic Principles; 4.2.2 Types of Speech Recognition Systems; 4.2.3 Recognition Results; 4.3 MPEG-7 SpokenContent Description; 4.3.1 General Structure; 4.3.2 SpokenContentHeader; 4.3.3 SpokenContentLattice; 4.4 Application: Spoken Document Retrieval; 4.4.1 Basic Principles of IR and SDR; 4.4.2 Vector Space Models; 4.4.3 Word-Based SDR | |
520 | |a Advances in technology, such as MP3 players, the Internet and DVDs, have led to the production, storage and distribution of a wealth of audio signals, including speech, music and more general sound signals and their combinations. MPEG-7 audio tools were created to enable the navigation of this data, by providing an established framework for effective multimedia management. MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval is a unique insight into the technology, covering the following topics:the fundamentals of MPEG-7 audio, principally low-level descriptors and soun. | ||
546 | |a English. | ||
650 | 0 | |a MPEG (Video coding standard) | |
650 | 0 | |a Multimedia systems. | |
650 | 0 | |a Sound |x Recording and reproducing |x Digital techniques |x Standards. | |
700 | 1 | |a Moreau, Nicolas. | |
700 | 1 | |a Sikora, Thomas. | |
730 | 0 | |a WILEYEBA | |
776 | 0 | 8 | |i Print version: |a Kim, Hyoung-Gook. |t MPEG-7 audio and beyond. |d Chichester, West Sussex, England ; Hoboken, NJ, USA : J. Wiley, ©2005 |z 047009334X |w (DLC) 2005011807 |
856 | 4 | 0 | |u https://ezproxy.mtsu.edu/login?url=https://onlinelibrary.wiley.com/book/10.1002/0470093366 |z CONNECT |3 Wiley |t 0 |
949 | |a ho0 | ||
975 | |p Wiley UBCM Online Book All Titles thru 2023 | ||
976 | |a 6006612 | ||
998 | |a wi |d z | ||
999 | f | f | |s 2de78d48-7cdd-464c-8c90-b19599a3d43f |i 2de78d48-7cdd-464c-8c90-b19599a3d43f |t 0 |
952 | f | f | |a Middle Tennessee State University |b Main |c James E. Walker Library |d Electronic Resources |t 0 |e TK6680.5 .K56 2005 |h Library of Congress classification |