[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Origin of the Mel frequency scale equation?



An earlier paper describing the Mel frequency is:
Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL. ASSP-28, NO. 4, AUGUST 1980 pp.357

Nengheng
______________________________________________
Nengheng Zheng, Ph.D.
Digital Signal Processing and Speech Technology Laboratory
Department of Electronic Engineering
The Chinese University of Hong Kong
Email: nhzheng@xxxxxxxxxxxxxx

----- Original Message ----- From: "Arturo Camacho" <acamacho@xxxxxxxxxxxx>
To: <AUDITORY@xxxxxxxxxxxxxxx>
Sent: Monday, March 10, 2008 2:09 PM
Subject: Origin of the Mel frequency scale equation?


Dear members of the list,

I am looking for the reference of first use of the equation

m = C log(1+f/700)

known as mel frequency scale transformation. In Wikipedia says that the
scale was originated by Stevens, Volkman and Newman in 1937 (J. Acoust.
Soc. Am 8(3) 185--190), but the paper only has tabulated data and no
equation. The paper by S.B. Davis & P. Mermelstein (1980), "Comparison of
parametric representations for monosyllabic word recognition in
continuously spoken sentences", IEEE Trans. on ASSP 28, 357-366 is usually
cited in the speech recognition community as origin of MFCCs, but the
equation is absent there as well.

Thanks,

Arturo

--
__________________________________________________

Arturo Camacho, PhD
Alumni
Computer and Information Science and Engineering
University of Florida

E-mail: acamacho@xxxxxxxxxxxx
Web page: www.cise.ufl.edu/~acamacho
__________________________________________________