Re: MFCC method (Matt Flax )


Subject: Re: MFCC method
From:    Matt Flax  <flatmax@xxxxxxxx>
Date:    Sat, 10 Jan 2009 10:03:37 +1100
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

Whilst we are on the topic of alternative scales for representing notions of pitch .... Another approach is to use the Equivalent Rectangular Bandwidth (ERB). Further, in an attempt to match the difference in innervations between the afferent and efferent neural systems along the length of the Organ of Corti (within the Cochlear), a combination of mappings may be used. In a preliminary study about 7 years ago, it is found that the results of the mapping process reduces non-pitched noise to some degree as well as improving the salience of inter-note relations within chords [1]. You can see the block diagram of the processing algorithm and get example code from here (I think the original article is in the download): http://mffmpitch.sourceforge.net/ [1] @xxxxxxxx{Flax:2002, author = {Flax, M.R.}, title = {Afferent/Efferent Pitch Processing}, booktitle = {Proceedings of the 9th Australian International Conference on Speech Science \& Technology Melbourne}, year = {2002}, month = {December}, organization = {Australian Speech Science \& Technology Association Inc.} } thanks Matt On Thu, Jan 08, 2009 at 11:28:07PM -0800, Arturo Camacho wrote: > Dear Dick, > > The Wikipedia page that you mention says that the Mel scale > "approximates the human auditory system's response more closely than > the linearly-spaced frequency bands used in the normal cepstrum." If > that means that the Mel scale approximates better the tonotopic > response of the cochlea than the linear scale, I wonder if it would > not be an even better idea to use the Greenwood function (see entry in > Wikipedia), which was explicitly created with that purpose. (Recall > that the Mel scale was designed to represent equidistant steps in > pitch, but that does not necessarily corresponds with equidistant > tonotopic steps.) > > Regards, > > Arturo > > > On Thu, Jan 8, 2009 at 8:46 PM, Richard F. Lyon <DickLyon@xxxxxxxx> wrote: > > Thanks Malcolm; now that you've told us, it's in wikipedia: > > http://en.wikipedia.org/wiki/Mel-frequency_cepstrum#History > > Including the connection to earlier work by Pols; I can share > > a copy of Plomp, Pols, and van de Geer (1967) on request. > > > > Dick > > > > At 2:07 PM -0800 1/7/09, Malcolm Slaney wrote: > >> > >> On Jan 7, 2009, at 12:40 PM, James W. Beauchamp wrote: > >>> > >>> I'm looking for a (the?) seminal article on the MFCC method of > >>> coding spectral envelopes. It could be a journal paper or a chapter > >>> in a book. Also, who was the first to publish on this idea? > >> > >> These are the usual references, especially the 1980 paper. > >> > >> P. Mermelstein, Distance measures for speech recognition, psychological > >> and instrumental, in Pattern Recognition and Artificial Intelligence, C. H. > >> Chen, Ed., pp. 374­388. Academic, New York, 1976. > >> > >> S.B. Davis, and P. Mermelstein, Comparison of Parametric Representations > >> for Monosyllabic Word Recognition in Continuously Spoken Sentences, in IEEE > >> Transactions on Acoustics, Speech, and Signal Processing, vol. 28(4), 1980, > >> pp. 357­366. > >> > >> > >> But Mermelstein usually credits John Bridle's work for the idea > >> JSRU Report No. 1003 > >> AN EXPERIMENTAL AUTOMATIC WORD·RECOGNITION SYSTEM: > >> INTERIM REPORT > >> J . S. Bridle and M. D. Brown > >> > >> > >> I have copies of the early two if you need them. > >> > >> - Malcolm > > > > > > -- > __________________________________________________ > > Arturo Camacho, PhD > Alumni > Computer and Information Science and Engineering > University of Florida > > E-mail: acamacho@xxxxxxxx > Web page: www.cise.ufl.edu/~acamacho > __________________________________________________ -- ,dPYb,,dPYb, I8 IP'`YbIP'`Yb I8 http://www.flatmaxstudios.com/ I8 8II8 8I 88888888 http://www.flatmax.org I8 8'I8 8' I8 I8 dP I8 dP ,gggg,gg I8 ,ggg,,ggg,,ggg, ,gggg,gg ,gg, ,gg I8dP I8dP dP" "Y8I I8 ,8" "8P" "8P" "8, dP" "Y8I d8""8b,dP" I8P I8P i8' ,8I ,I8, I8 8I 8I 8I i8' ,8I dP ,88" ,d8b,_,d8b,_,d8, ,d8b,d88b,dP 8I 8I Yb,d8, ,d8b,dP ,dP"Y8, PI8"888P'"Y8P"Y8888P"`Y8P""Y8P' 8I 8I `YP"Y8888P"`Y8" dP" "Y8 I8 `8, I8 `8, I8 8I Public Projects : I8 8I http://sourceforge.net/search/?type_of_search=soft&words=mffm I8, ,8' http://www.psysound.org "Y8P'


This message came from the mail archive
http://www.auditory.org/postings/2009/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University