Re: speech/music characteristics (Lonce LaMar Wyse )


Subject: Re: speech/music characteristics
From:    Lonce LaMar Wyse  <lwyse(at)KRDL.ORG.SG>
Date:    Sun, 29 Mar 1998 10:09:52 +0800

> I'm working in speech recognition, and am trying to be able to distinguish > between speech and non-speech (especially music) sounds in an audio track. Going more for the music vs. "other" distinction, Mike Hawley's (1993) MIT PhD Thesis, "Structure out of Sound" used measurements of the length of constant-frequency peaks in the spectrum for discrimination. Steve Smoliar and I extended this work in Lonce Wyse and Steven W. Smoliar, ``Toward Content-Based Audio Indexing and Retrieval and a New Speaker Discrimination Technique''. (to appear in) D.F. Rosenthal & H.G. Okuno (eds.) Readings In Computational Auditory Scene Analysis. (Lawrence Erlbaum, Mahwah NJ) 1998. (linked from my homepage) Of course, using only information in the signal, there can be no perfect discrimination between music and other since the domain of music in general (e.g. electro-acoustic music) is all sound. - lonce


This message came from the mail archive
http://www.auditory.org/postings/1998/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University