[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Origin of the Mel frequency scale equation?

An earlier paper describing the Mel frequency is:
Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL. ASSP-28, NO. 4, AUGUST 1980 pp.357

Nengheng Zheng, Ph.D.
Digital Signal Processing and Speech Technology Laboratory
Department of Electronic Engineering
The Chinese University of Hong Kong
Email: nhzheng@xxxxxxxxxxxxxx

----- Original Message ----- From: "Arturo Camacho" <acamacho@xxxxxxxxxxxx>
To: <AUDITORY@xxxxxxxxxxxxxxx>
Sent: Monday, March 10, 2008 2:09 PM
Subject: Origin of the Mel frequency scale equation?

Dear members of the list,

I am looking for the reference of first use of the equation

m = C log(1+f/700)

known as mel frequency scale transformation. In Wikipedia says that the
scale was originated by Stevens, Volkman and Newman in 1937 (J. Acoust.
Soc. Am 8(3) 185--190), but the paper only has tabulated data and no
equation. The paper by S.B. Davis & P. Mermelstein (1980), "Comparison of
parametric representations for monosyllabic word recognition in
continuously spoken sentences", IEEE Trans. on ASSP 28, 357-366 is usually
cited in the speech recognition community as origin of MFCCs, but the
equation is absent there as well.




Arturo Camacho, PhD
Computer and Information Science and Engineering
University of Florida

E-mail: acamacho@xxxxxxxxxxxx
Web page: www.cise.ufl.edu/~acamacho