Semantic audio
Semantic audio is the extraction of symbols or meaning from an audio stream. Speech recognition is an important semantic audio application. But for speech, other semantic operations include language identification, speaker identification or gender identification. For more general audio or music, it includes identifying a piece of music (e.g. Shazam (service)) or a movie soundtrack.
Areas of research in semantic audio include the ability to label an audio waveform with where the harmonies change and what they are and where material is repeated and what instruments are playing.
External links
- The Audio Engineering Society Technical Council on Semantic Audio Analysis
- AES 42nd Conference on Semantic Audio
- AES 53rd Conference on Semantic Audio
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.