Re: About importance of "phase" in sound recognition ("reinifrosch@xxxxxxxx" )


Subject: Re: About importance of "phase" in sound recognition
From:    "reinifrosch@xxxxxxxx"  <reinifrosch@xxxxxxxx>
Date:    Wed, 6 Oct 2010 08:55:48 +0000
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

------=_Part_1087_5009672.1286355348548 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hello Emad, I just did an experiment on my Yamaha DX11 synthesizer. I programmed it to = generate sine tones (i.e., "pure" tones). White key C1: 440.0 Hz; black key= C#1: 440.0 Hz, too; white key D1: 441.6 Hz. First case: first press key C1, keep it pressed, and then press key C#1 (gi= ving the same frequency of 440.0 Hz). The loudness differs strongly from on= e try to the next, i.e., the loudness depends on the relative phase of the = two sine-tones. That result is plausible: If the phase happens to be zero o= r 2pi or 4pi, etc., then there is constructive interference. If the phase i= s pi or 3pi or 5pi, etc., then the interference is destructive. Second case: first press key C1, keep it pressed, and then press key D1 (44= 1.6 Hz). What one hears does not differ from one try to the next. One hears= 1.6 beats per second. Reinhart. =20 ----Urspr=C3=BCngliche Nachricht---- Von: emad.burke@xxxxxxxx Datum: 05.10.2010 19:03 An: <AUDITORY@xxxxxxxx> Betreff: Re: About importance of "phase" in sound recognition Hi Kevin, thanks for the reply. the phase definition that I'm talking about is more o= f the third definition of yours. I'm exactly talking about what is called = "in-sensitivity to phase". I'm talking about the phase information that is = discarded in the process of MFCC feature extraction and it has been proven = to be succesfull feature set for speech recognition. The "insensitivity to = phase" that implicitly implies that if you change the order (precedence) of= travelling waves in each cochlear channel among each other, it will not af= fect the perception and you can add random phases to different channels wit= hout affecting the perception(?). Now this was on one hand. On the other hand, couple of years ago there was = a publication by a mathematician (pete-cassaza) that kind of reinforced the= argument of phase insensitivety of speech recognition, but this time mathe= matically; very briefly stating that if you have a redundant set of magnitu= de coeeficients, then phase doesnt matter at all, and as they say in the pa= per this mathematically confirms the belief in the speech recognition commu= nity over the years about phase insensitivity, ... And also there are some papers on the opposition side as well. This basical= ly is the source of my confusion. Emad=20 --------------------------------------- Reinhart Frosch, Dr. phil. nat., CH-5200 Brugg. reinifrosch@xxxxxxxx . ------=_Part_1087_5009672.1286355348548 Content-Type: text/html;charset="UTF-8" Content-Transfer-Encoding: quoted-printable <html><head><style type=3D'text/css'> <!-- div.bwmail { background-color:#ffffff; font-family: Trebuchet MS,Arial,Helv= etica; font-size: 12px; margin:0; padding:0;} div.bwmail p { margin:0; padding:0; } div.bwmail table { font-family: Trebuchet MS,Arial,Helvetica; font-size: 12= px; } div.bwmail li { margin:0; padding:0; } --> </style> </head><body><div class=3D'bwmail'><P>Hello Emad,</P> <P>I just did an experiment on my Yamaha DX11 synthesizer. I programmed it = to generate sine tones (i.e., "pure" tones). White key C1: 440.0 Hz; black = key C#1: 440.0 Hz, too; white key D1: 441.6 Hz.</P> <P>First case: first press key C1, keep it pressed, and then press key C#1 = (giving the same frequency of 440.0 Hz). The loudness&nbsp;differs strongly= from one try to the next, i.e., the loudness depends on the relative phase= of the two sine-tones. That result is plausible: If the phase happens to b= e zero or&nbsp;2pi or 4pi, etc., then there is constructive interference. I= f the phase is pi or 3pi or 5pi, etc., then the interference is destructive= .</P> <P>Second case: first press key C1, keep it pressed, and then press key D1 = (441.6 Hz). What one hears does not differ from one try to the next. One he= ars 1.6 beats per second.</P> <P>Reinhart.</P> <P>&nbsp;<BR>----Urspr=C3=BCngliche Nachricht----<BR>Von: emad.burke@xxxxxxxx= COM<BR>Datum: 05.10.2010 19:03<BR>An: &lt;AUDITORY@xxxxxxxx&gt;<BR>B= etreff: Re: About importance of "phase" in sound recognition<BR><BR>Hi Kevi= n,<BR><BR>thanks for the reply. the phase definition that I'm talking about= is more of the third definition of yours. I'm exactly talking about what i= s called&nbsp; "in-sensitivity to phase". I'm talking about the phase infor= mation that is discarded in the process of MFCC feature extraction and it h= as been proven to be succesfull feature set for speech recognition. The "in= sensitivity to phase" that implicitly implies that if you change the order = (precedence) of travelling waves in each cochlear channel among each other,= it will not affect the perception and you can add random phases to differe= nt channels without affecting the perception(?).<BR><BR>Now this was on one= hand. On the other hand, couple of years ago there was a publication by a = mathematician (pete-cassaza) that kind of reinforced the argument of phase = insensitivety of speech recognition, but this time mathematically; very bri= efly stating that if you have a redundant set of magnitude coeeficients, th= en phase doesnt matter at all, and as they say in the paper this mathematic= ally confirms the belief in the speech recognition community over the years= about phase insensitivity, ...<BR><BR>And also there are some papers on th= e opposition side as well. This basically is the source of my confusion.<BR= ><BR>Emad <BR>---------------------------------------</P> <P>Reinhart Frosch,<BR>Dr. phil. nat.,<BR>CH-5200 Brugg.<BR>reinifrosch@xxxxxxxx= ewin.ch .<BR><BR></P></div></body></html> ------=_Part_1087_5009672.1286355348548--


This message came from the mail archive
/home/empire6/dpwe/public_html/postings/2010/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University