Re: New book -- Communication acoustics: An introduction to Speech, Audio and Psychoacoustics (Pulkki Ville )


Subject: Re: New book -- Communication acoustics: An introduction to Speech, Audio and Psychoacoustics
From:    Pulkki Ville  <ville.pulkki@xxxxxxxx>
Date:    Tue, 25 Nov 2014 06:24:37 +0000
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--_000_D099E7F4A2E4VillePulkkiaaltofi_ Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable Articulation has of course many meanings. By knowing your background, I bel= ieve you mean =93articulation=94 as measure of speech quality over an acous= tic channel, "articulation=94. There is one chapter (17.6.1) about subjecti= ve methods and measures, which discusses 11 different methods, and discusse= s briefly also articulation. The next chapter (17.6.2) is about objective m= ethods and measures, which mentions 8 different approaches, and discusses v= ery briefly articulation index. The book does not talk too much about them,= instead it talks for few pages about STI and ALcons, and also about method= s based on auditory models, such as PESQ, POLQA, TOSQA, HASQI. Speech quali= ty takes about 7 pages, sound quality in total about 31 pages, which means = that the level of details is limited. I pasted below some excerpts from the book covering articulation and artic= ulation index. The process of producing speech, also called articulation, i= s discussed elsewhere in the book. -Ville =85=85=85. Different subjective and objective methods have been developed to measure t= he quality of speech, indicating the articulation, intelligibility, and qua= lity of the reproduction of timbre (Quackenbush et al., 1988). We will list= some relevant techniques and later present some of them in greater detail. 17.6.1 Subjective Methods and Measures * Articulation. The term articulation here means the overall functionin= g of the speech trans- mission channel, not just the functioning of the spe= ech organs, as discussed in Section 5.1.3. A measure for the quantity is ob= tained from a listening test, where the task of the subjects is to listen t= o nonsense phoneme sequences composed as a catenation of consonants (C) and= vowels (V), such as /CV/ or /CVC/, and to report the sequences perceived. = The percentage of correct answers gives the articulation score. The articul= ation index is the articulation score modified to obtain additivity, just a= s the values of loudness are additive but the values of loudness level are = not (Fletcher, 1995). * Intelligibility and intelligibility score. The articulation test, but= this time conducted with real words or sentences measures the intelligibil= ity of the communication channel. The percentage of correct answers is the = intelligibility score. * Rhyme test. The test uses rhyming words or one-syllable words where c= hanging the first phoneme changes the meaning of the word, such as pay/may/= day/say/way. The percentage of correct answers measured gives this measure = of speech quality. Different variations of this test exist, differing in th= e application and realization. * [continues] =85=85=85=85=85=85.. 17.6.2 Objective Methods and Measures =95 Articulation index (AI). This was developed to measure speech intelligi= bility over a trans- mission channel that is assumed to be nearly linear, b= ut, with disturbance caused by additive noise. The method assumes that the = loss of articulation can be estimated by summing the AI values over 20 freq= uency bands, following roughly the Bark scale. =95 Percentage articulation loss of consonants (%ALcons) (Peutz, 1971). Thi= s is a simple and relatively often used estimate of speech intelligibility = in a room, auditorium, or other large space. The %ALcons value is computed = from the basic acoustic parameters of the space. The method is described in= Section 17.9.3. =95 Speech transmission index (STI). The index is based on the modulation t= ransfer function (MTF), and it can be used to estimate relatively reliably = the effect of reverberation and additive noise of a transmission channel on= speech intelligibility. The method is described in Sections 17.7.1 and 17.= 7.2, and STIPA, a simplified version of STI, is discussed in Section 17.7.4= . [continues] =85=85=85=85. From: James Johnston <audioskeptic@xxxxxxxx<mailto:audioskeptic@xxxxxxxx>= > Reply-To: James Johnston <audioskeptic@xxxxxxxx<mailto:audioskeptic@xxxxxxxx= COM>> Date: Tuesday 25 November 2014 07:56 To: "AUDITORY@xxxxxxxx<mailto:AUDITORY@xxxxxxxx>" <AUDITORY@xxxxxxxx= ISTS.MCGILL.CA<mailto:AUDITORY@xxxxxxxx>> Subject: Re: New book -- Communication acoustics: An introduction to Speech= , Audio and Psychoacoustics Hmm. What kind of treatment do you give articulation and articulation impai= rments? On Mon, Nov 24, 2014 at 3:08 AM, Pulkki Ville <ville.pulkki@xxxxxxxx<mailto= :ville.pulkki@xxxxxxxx>> wrote: Dear Colleague, [sorry if you get this message multiple times] this is to inform that the book "Communication acoustics: An introduction t= o Speech, Audio and Psychoacoustics=94 by Ville Pulkki and Matti Karjalain= en is available as a hardcover and as an ebook from Wiley. The book makes an introduction to the fields which concern some kind of co= mmunication channel having the human as listener in the end, the fields tog= ether are called as =93communication acoustics=94. The main fields discussed in the book are: Physics of sound / Signal processing / Human voice and speech / Music instr= uments and synthesis / Psychoacoustic testing / Psychoacoustic quantities /= Spatial hearing / Auditory modeling / Sound reproduction / Time-frequency-= domain audio processing / Speech technologies / Sound quality / Technical a= udiology The book has 456 pages and 250 figures. Its table of contents can be viewed= here: http://eu.wiley.com/WileyCDA/WileyTitle/productCd-1118866541,subjectCd-PH51= .html The book is primarily meant for Msc-level teaching in technical universitie= s, but it also serves as a reference book for professionals in the field. In my university, the book is partially read on the first course in the mas= ter program on acoustics and audio technology, after which more detailed co= urses take place. If you are thinking that the book could be used in your teaching, you can r= equest a copy for evaluation from Wiley. I am planning to publish also the = slides that I use in my classes. All the best, Ville Pulkki Tenure track assistant professor Department of Signal Processing and Acoustics School of Electrical Engineering Aalto University -- James D. (jj) Johnston Independent Audio and Electroacoustics Consultant --_000_D099E7F4A2E4VillePulkkiaaltofi_ Content-Type: text/html; charset="Windows-1252" Content-ID: <8546FC260DCC3C439772AE1F5C1A0AA4@xxxxxxxx> Content-Transfer-Encoding: quoted-printable <html> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3DWindows-1= 252"> <title>untitled</title> </head> <body style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-lin= e-break: after-white-space;"> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <br> </div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> Articulation has of course many meanings. By knowing your background, I bel= ieve you mean =93articulation=94 as measure of speech quality over an acous= tic channel, &quot;articulation=94. There is one chapter (17.6.1) about sub= jective methods and measures, which discusses 11 different methods, and discusses briefly also articulation. The next ch= apter (17.6.2) is about objective methods and measures, which mentions 8 di= fferent approaches, and discusses very briefly articulation index. The book= does not talk too much about them, instead it talks for few pages about STI and ALcons, and also about method= s based on auditory models, such as PESQ, POLQA, TOSQA, HASQI. Speech quali= ty takes about 7 pages, sound quality in total about 31 pages, which means = that the level of details is limited.&nbsp;</div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <br> </div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> I pasted below some excerpts from the book covering &nbsp;articulation and = articulation index. The process of producing speech, also called articulati= on, is discussed elsewhere in the book.&nbsp;</div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <br> </div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> -Ville</div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <br> </div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> =85=85=85.</div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <div class=3D"page" title=3D"Page 383"> <div class=3D"layoutArea"> <div class=3D"column"> <p><span style=3D"font-size: 10.000000pt; font-family: 'TimesLTStd'">Differ= ent subjective and objective methods have been developed to measure the qua= lity of speech, indicating the articulation, intelligibility, and quality o= f the reproduction of timbre (Quackenbush </span><span style=3D"font-size: 10.000000pt; font-family: 'TimesLTStd'; fo= nt-style: italic">et al</span><span style=3D"font-size: 10.000000pt; font-f= amily: 'TimesLTStd'">., 1988). We will list some relevant techniques and la= ter present some of them in greater detail. </span></p> <p><span style=3D"font-size: 12.000000pt; font-family: 'TimesLTStd'; font-s= tyle: italic">17.6.1 Subjective Methods and Measures </span></p> <ul> <li style=3D"font-size: 10.000000pt; font-family: 'TimesLTStd'"> <p><span style=3D"font-size: 10pt; font-style: italic;">Articulation</span>= <span style=3D"font-size: 10pt;">. The term articulation here means the ove= rall functioning of the speech trans- mission channel, not just the functio= ning of the speech organs, as discussed in Section 5.1.3. A measure for the quantity is obtained from a listening = test, where the task of the subjects is to listen to nonsense phoneme seque= nces composed as a catenation of consonants (C) and vowels (V), such as /CV= / or /CVC/, and to report the sequences perceived. The percentage of correct answers gives the </span><span style= =3D"font-size: 10pt; font-style: italic;">articulation score</span><span st= yle=3D"font-size: 10pt;">. The </span><span style=3D"font-size: 10pt; font-style: italic;">articulation in= dex </span> <span style=3D"font-size: 10pt;">is the articulation score modified to obta= in additivity, just as the values of loudness are additive but the values o= f loudness level are not (Fletcher, 1995).</span></p> </li></ul> <div> <div class=3D"page" title=3D"Page 383"> <div class=3D"layoutArea"> <div class=3D"column"> <ul> <li style=3D"font-size: 10.000000pt; font-family: 'TimesLTStd'"> <p><span style=3D"font-size: 10pt; font-style: italic;">Intelligibility </s= pan><span style=3D"font-size: 10pt;">and </span><span style=3D"font-size: 10pt; font-style: italic;">intelligibility= score</span><span style=3D"font-size: 10pt;">. The articulation test, but = this time conducted with real words or sentences measures the intelligibili= ty of the communication channel. The percentage of correct answers is the intelligibility score. </span></p> </li><li style=3D"font-size: 10.000000pt; font-family: 'TimesLTStd'"> <p><span style=3D"font-size: 10pt; font-style: italic;">Rhyme test</span><s= pan style=3D"font-size: 10pt;">. The test uses rhyming words or one-syllabl= e words where changing the first phoneme changes the meaning of the word, s= uch as pay/may/day/say/way. The percentage of correct answers measured gives this measure of speech quality. Differen= t variations of this test exist, differing in the application and realizati= on.&nbsp;</span></p> </li><li><span style=3D"color: rgb(0, 0, 0); font-family: TimesLTStd; font-= size: 13px; font-style: normal; font-weight: normal; text-decoration: none;= ">[continues]</span> </li></ul> </div> </div> </div> </div> <ul> </ul> </div> </div> </div> </div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> =85=85=85=85=85=85..</div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <br> </div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <div>17.6.2 Objective Methods and Measures</div> <div>=95 Articulation index (AI). This was developed to measure speech inte= lligibility over a trans- mission channel that is assumed to be nearly line= ar, but, with disturbance caused by additive noise. The method assumes that= the loss of articulation can be estimated by summing the AI values over 20 frequency bands, following roughly the Ba= rk scale.</div> <div>=95 Percentage articulation loss of consonants (%ALcons) (Peutz, 1971)= . This is a simple and relatively often used estimate of speech intelligibi= lity in a room, auditorium, or other large space. The %ALcons value is comp= uted from the basic acoustic parameters of the space. The method is described in Section 17.9.3.</div> </div> <div><font face=3D"Calibri,sans-serif">=95 Speech transmission index (STI).= The index is based on the modulation transfer function (MTF), and it can b= e used to estimate relatively reliably the effect of reverberation and addi= tive noise of a transmission channel on speech intelligibility. The method is described in Sections 17.7.1 and = 17.7.2, and STIPA, a simplified version of STI, is discussed in Section 17.= 7.4.</font></div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> [continues]</div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <br> </div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> =85=85=85=85.</div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <br> </div> <div style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; font-s= ize: 14px;"> <br> </div> <span id=3D"OLK_SRC_BODY_SECTION" style=3D"color: rgb(0, 0, 0); font-family= : Calibri, sans-serif; font-size: 14px;"> <div style=3D"font-family:Calibri; font-size:11pt; text-align:left; color:b= lack; BORDER-BOTTOM: medium none; BORDER-LEFT: medium none; PADDING-BOTTOM:= 0in; PADDING-LEFT: 0in; PADDING-RIGHT: 0in; BORDER-TOP: #b5c4df 1pt solid;= BORDER-RIGHT: medium none; PADDING-TOP: 3pt"> <span style=3D"font-weight:bold">From: </span>James Johnston &lt;<a href=3D= "mailto:audioskeptic@xxxxxxxx">audioskeptic@xxxxxxxx</a>&gt;<br> <span style=3D"font-weight:bold">Reply-To: </span>James Johnston &lt;<a hre= f=3D"mailto:audioskeptic@xxxxxxxx">audioskeptic@xxxxxxxx</a>&gt;<br> <span style=3D"font-weight:bold">Date: </span>Tuesday 25 November 2014 07:5= 6<br> <span style=3D"font-weight:bold">To: </span>&quot;<a href=3D"mailto:AUDITOR= Y@xxxxxxxx">AUDITORY@xxxxxxxx</a>&quot; &lt;<a href=3D"mailto= :AUDITORY@xxxxxxxx">AUDITORY@xxxxxxxx</a>&gt;<br> <span style=3D"font-weight:bold">Subject: </span>Re: New book -- Communicat= ion acoustics: An introduction to Speech, Audio and Psychoacoustics<br> </div> <div><br> </div> <blockquote id=3D"MAC_OUTLOOK_ATTRIBUTION_BLOCKQUOTE" style=3D"BORDER-LEFT:= #b5c4df 5 solid; PADDING:0 0 0 5; MARGIN:0 0 0 5;"> <div> <div> <div dir=3D"ltr">Hmm. What kind of treatment do you give articulation and a= rticulation impairments?<br> </div> <div class=3D"gmail_extra"><br> <div class=3D"gmail_quote">On Mon, Nov 24, 2014 at 3:08 AM, Pulkki Ville <s= pan dir=3D"ltr"> &lt;<a href=3D"mailto:ville.pulkki@xxxxxxxx" target=3D"_blank">ville.pulkki= @xxxxxxxx</a>&gt;</span> wrote:<br> <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p= x #ccc solid;padding-left:1ex"> <div style=3D"word-wrap:break-word;color:rgb(0,0,0);font-size:14px;font-fam= ily:Calibri,sans-serif"> <div><br> </div> <div> <div>Dear Colleague,&nbsp;</div> <div>[sorry if you get this message multiple times]</div> <div><br> </div> <div><font face=3D"Calibri,sans-serif">this is to inform that the book&nbsp= ;</font><font face=3D"Calibri,sans-serif"><i>&quot;Communication acoustics:= An introduction to Speech, Audio and Psychoacoustics=94&nbsp;</i>&nbsp;by = Ville Pulkki and Matti Karjalainen is available as a hardcover and as an ebook from Wiley.</font></div> <div><br> </div> <div>The book makes an introduction to the fields which concern some kind o= f &nbsp;communication channel having the human as listener in the end, the = fields together are called as =93communication acoustics=94.&nbsp;</div> <div><br> </div> <div>The main fields &nbsp;discussed in the book are:&nbsp;</div> <div>Physics of sound / Signal processing / Human voice and speech / Music = instruments and synthesis / Psychoacoustic testing / Psychoacoustic quantit= ies / Spatial hearing / Auditory modeling / Sound reproduction / Time-frequ= ency-domain audio processing / Speech technologies / Sound quality / Technical audiology</div> <div><br> </div> <div>The book has 456 pages and 250 figures. Its table of contents can be v= iewed here:&nbsp;</div> <div><a href=3D"http://eu.wiley.com/WileyCDA/WileyTitle/productCd-111886654= 1,subjectCd-PH51.html" target=3D"_blank">http://eu.wiley.com/WileyCDA/Wiley= Title/productCd-1118866541,subjectCd-PH51.html</a></div> <div><br> </div> <div>The book is primarily meant for Msc-level teaching in technical univer= sities, but it also serves as a reference book for professionals in the fie= ld.&nbsp;</div> <div>In my university, the book is partially read on the first course in th= e master program on acoustics and audio technology, after which more detail= ed courses take place.&nbsp;</div> <div><br> </div> <div>If you are thinking that the book could be used in your teaching, you = can request a copy for evaluation from Wiley. I am planning to publish also= the slides that I use in my classes.&nbsp;</div> <div><br> </div> <div>All the best,&nbsp;</div> <span class=3D"HOEnZb"><font color=3D"#888888"> <div>Ville Pulkki</div> </font></span></div> <span class=3D"HOEnZb"><font color=3D"#888888"> <div>Tenure track assistant professor&nbsp;</div> <div>Department of Signal Processing and Acoustics</div> <div>School of Electrical Engineering</div> <div>Aalto University</div> <div><br> </div> <span> <blockquote style=3D"BORDER-LEFT:#b5c4df 5 solid;PADDING:0 0 0 5;MARGIN:0 0= 0 5"> <div> <div style=3D"word-wrap:break-word;color:rgb(0,0,0);font-size:14px;font-fam= ily:Calibri,sans-serif"> </div> </div> </blockquote> </span></font></span></div> </blockquote> </div> <br> <br clear=3D"all"> <br> -- <br> <div class=3D"gmail_signature"> <div>James D. (jj) Johnston</div> <div>Independent Audio and Electroacoustics Consultant</div> </div> </div> </div> </div> </blockquote> </span> </body> </html> --_000_D099E7F4A2E4VillePulkkiaaltofi_--


This message came from the mail archive
http://www.auditory.org/postings/2014/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University