[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Frequency to Mel Formula

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: Frequency to Mel Formula
From: Donald D Greenwood <ddg@xxxxxxxxxxxxx>
Date: Thu, 30 Jul 2009 11:29:08 -0700
Approved-by: ddg@xxxxxxxxxxxxx
Delivery-date: Thu Jul 30 14:34:35 2009
In-reply-to: <20090730031526.4DAD98857@xxxxxxxxxxxxxxxxxxxxxxx>
List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <95C3910C-83F5-46F1-A953-5C8F1D141D5D@xxxxxxx> <20090728204713.3E48F95F1@xxxxxxxxxxxxxxxxxxxxxxx> <20090729115312.D431C677C@xxxxxxxxxxxxxxxxxxxxxxx> <20090729232205.7FD8C9275@xxxxxxxxxxxxxxxxxxxxxxx> <20090730031526.4DAD98857@xxxxxxxxxxxxxxxxxxxxxxx>
Reply-to: Donald D Greenwood <ddg@xxxxxxxxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Dick,

Your comments and questions are well taken. Stevens discarded the"half-pitch" mel scale of 1937 completely, as I commented. Siegel1964 or 65) did the only published effort I know about to replicatethe 1937 experiment of Stevens, as mentioned by Richard Warren and asdescribed also in Footnote 4 of my 1997 paper, where I presentarguments against half-pitch judgments that you and others have.Stevens replaced the 1937 scale with the 1940 mel scale, that wasbased on experiments intended to divide frequency intervals into four(4 not 2) equal perceptual intervals. That scale appears to have beenmethodologically biased, as I commented. That scale is also the onethat was approximated by Fant and used thereafter, as Pierre says, bythe "speech science and technology community". I agree with Pierrethat they have been ill-advised to use it, but no doubling or halvingwas involved in arriving at the 1940 scale.

In 1940 Stevens, as you have noted, did include a separate experiment(reported in the same paper as the equisection data) asking subjectsto make what he called half-pitch judgments, but Stevens provided 40Hz signals to the subjects to provide them with "zero" pitchapproximations. That seems to have converted those subject judgmentsinto bisection experiments, i.e. 40 Hz on one end, standard tone onthe other, with subject setting the "middle" tone. In any case, whenStevens provided the "zero point" you call for, the resulting pseudohalf-pitch data fit in with his 4 part equisection results, but (as hesays) they were not used to make the 1940 scale, which used 4 partequisection data only.

The use of narrow noise bands signals to try equisection experimentssounds like an experiment that could be tried, whether the outcomewould be useful or not.


Donald Greenwood
On 29 Jul, 2009, at 8:15 PM, Richard F. Lyon wrote:

Diana,
Certainly the circular or helical aspect of pitch is crucial, inmany aspects of pitch perception. But there's also this one-dimensional scale that can be valid in some contexts. I hadn't saidor known anything about this "half-pitch" concept, which wouldcertainly bring in the whole octave equivalence complication. Butis that what was used for the mel-scale tests and such? I didn'tthink so. Rather, the idea was to subdivide intervals intoperceptually equal intervals ("equisection"). Of course, if theintervals are like 2 octaves or such, or the subject is musicallysavvy, that's going to bias the judgements based on the pitchcircularity. But if the signals are something like narrow noisebands, maybe it would be possible to do the task while avoidingthose cues of "consonance" and such?
The "half pitch" idea presumes a well-defined, or well-perceived atleast, zero point, as well as a nonlinear mapping to try to get at.Plus it puts the likely result right where the octave is, at leastfor low frequencies. Did anyone actually use that approach?Richard Warren and Snorre Farner say several studies did so; I'msurprised; it seems like a bad idea. Wouldn't you almost always geta result of half pitch equal to half frequency? Is that theexplanation for why the linear-to-log breakpoint ended up so high?Or did they really do equisection of intervals defined by twononzero tone frequencies?
Stevens says they did both, but the curve he plots show only theequisection results:
http://books.google.com/books?id=r5JOHlXX8bgC&pg=PA166&dq=pitch+curve+equisection&lr=&as_brr=3&ei=VudwStWOPIrykATalqz4Dg

Dick

Prev by Date: Re: Mel scale, in general
Next by Date: Re: Mel scale, in general
Previous by thread: Re: Frequency to Mel Formula
Next by thread: Re: Frequency to Mel Formula
Index(es):
- Date
- Thread