Re: dynamic range and sample bit depth ("Richard F. Lyon" )


Subject: Re: dynamic range and sample bit depth
From:    "Richard F. Lyon"  <dicklyon@xxxxxxxx>
Date:    Wed, 3 Dec 2014 23:10:19 -0800
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--001a11c2bd9e2ed16805095ea7f1 Content-Type: text/plain; charset=UTF-8 Dan, CD audio is 16 bits per sample (linear, uncompressed). So that's a pretty good quality reference point. For parts of a song that are 48 dB down from the peak, there are only 8 bits being used. But those parts are pretty quiet, and the quantization noise is still about 96 dB down from the very loudest signals, which puts the noise somewhere near the absolute hearing threshold, and probably well below the masked threshold, if you have good normal hearing and the loudest sound you make is not painful. Still, a few more bits would be nice. 24 is certainly overkill. It's not clear to me what sort of dithering you have in mind or how it's going to help here. Dick On Wed, Dec 3, 2014 at 10:05 AM, Dan Goodman <d.goodman@xxxxxxxx> wrote: > Dear auditory list, > > I have been worried about an issue to do with sampling bit depth and > dynamic range for a while, and I have not yet been able to find a > definitive answer. Hopefully some of you may be able to shed some light on > this. > > Essentially, the question revolves around the fact that with a digital > signal, when it is attenuated you lose information. For example, for a > signal with 16 bits per sample, if you attenuate by 20*log10(2^8)=48 dB > then the output will only be using 8 bits per sample. Having listened to 8 > bit sounds, they are clearly of very poor quality. So, although it is often > written that the 'dynamic range' of 16 bit sound is 96 dB > (=20*log10(2^16)), at even 48 dB of attenuation the quality becomes > terribly poor. > > So some questions: > > 1. How many bits per sample do we need for a high quality encoding of a > sound without any attenuation? > > 2. How much dynamic range can we therefore get from standard audio systems > using 16 or 24 bits per sample? > > 3. Are we routinely using more than this dynamic range in our experiments > (and in musical recordings) and is this a problem for the results of, for > example, studies mixing normal and hearing impaired listeners? > > 4. Is there anything we can do about this? > > Some more details: > > The clearest thing I have managed to find on this subject so far is a > paper by Bob Stuart of Meridian Audio (https://www.meridian-audio. > com/meridian-uploads/ara/coding2.pdf) that concludes that if you have 19 > bits per sample at a 52 kHz sample rate, and you use dithering, and your > audio system doesn't do any further processing on the sound, then at 90 dB > attenuation from the maximum level you shouldn't hear any noise from the > encoding (based on the extremes of measured hearing thresholds). This > suggests that using 20 bit audio you can probably get 96 dB of high quality > dynamic range (see below for why I mention 20 bit audio). > > This doesn't take into account that many (most?) researchers are probably > not dithering their signals. As far as I can tell, Matlab's wavplay and > audioplayer functions do not use dithering, for example. So how much > dynamic range are we getting without introducing noise if we don't use > dithering? And are any of the commonly used packages for playing sounds > doing this dithering? > > Note: I mentioned 20 bit audio because I have read that 24 bit DACs only > really use at most 22 bits of the signal, and due to thermal noise give > about 20 bits noise free. I worked with one system in the past that I was > told allowed you to select which 22 bits were used (although this was > hardware specific and had to be coded in at a very low level, not using > standard audio APIs). > > I am very far from an expert on any of this, but what it seems like to me > is that we need to be using 24 bit audio and (very importantly) dithering, > and if so we can probably get 96 dB of high quality dynamic range. It is > possible that in some experimental setups, especially if we're testing > normal hearing and hearing impaired listeners across a wide range of sounds > on the same system, that we might be exceeding this. If so, is there > anything we can do? > > Finally, any thoughts on the relevance for music / commercial audio? I > guess it is much less of an issue there since the problems only seem to > arise if you really push the limits of dynamic range. > > Thanks in advance, > Dan Goodman > --001a11c2bd9e2ed16805095ea7f1 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr"><div><div><div><div>Dan,<br><br></div>CD audio is 16 bits = per sample (linear, uncompressed).=C2=A0 So that&#39;s a pretty good qualit= y reference point.<br><br></div>For parts of a song that are 48 dB down fro= m the peak, there are only 8 bits being used.=C2=A0 But those parts are pre= tty quiet, and the quantization noise is still about 96 dB down from the ve= ry loudest signals, which puts the noise somewhere near the absolute hearin= g threshold, and probably well below the masked threshold, if you have good= normal hearing and the loudest sound you make is not painful.<br><br></div= >Still, a few more bits would be nice.=C2=A0 24 is certainly overkill.<br><= br></div><div>It&#39;s not clear to me what sort of dithering you have in m= ind or how it&#39;s going to help here.<br></div><div><br></div>Dick<br><br= ></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Wed, De= c 3, 2014 at 10:05 AM, Dan Goodman <span dir=3D"ltr">&lt;<a href=3D"mailto:= d.goodman@xxxxxxxx" target=3D"_blank">d.goodman@xxxxxxxx</a>&gt= ;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 = .8ex;border-left:1px #ccc solid;padding-left:1ex">Dear auditory list,<br> <br> I have been worried about an issue to do with sampling bit depth and dynami= c range for a while, and I have not yet been able to find a definitive answ= er. Hopefully some of you may be able to shed some light on this.<br> <br> Essentially, the question revolves around the fact that with a digital sign= al, when it is attenuated you lose information. For example, for a signal w= ith 16 bits per sample, if you attenuate by 20*log10(2^8)=3D48 dB then the = output will only be using 8 bits per sample. Having listened to 8 bit sound= s, they are clearly of very poor quality. So, although it is often written = that the &#39;dynamic range&#39; of 16 bit sound is 96 dB (=3D20*log10(2^16= )), at even 48 dB of attenuation the quality becomes terribly poor.<br> <br> So some questions:<br> <br> 1. How many bits per sample do we need for a high quality encoding of a sou= nd without any attenuation?<br> <br> 2. How much dynamic range can we therefore get from standard audio systems = using 16 or 24 bits per sample?<br> <br> 3. Are we routinely using more than this dynamic range in our experiments (= and in musical recordings) and is this a problem for the results of, for ex= ample, studies mixing normal and hearing impaired listeners?<br> <br> 4. Is there anything we can do about this?<br> <br> Some more details:<br> <br> The clearest thing I have managed to find on this subject so far is a paper= by Bob Stuart of Meridian Audio (<a href=3D"https://www.meridian-audio.com= /meridian-uploads/ara/coding2.pdf" target=3D"_blank">https://www.meridian-a= udio.<u></u>com/meridian-uploads/ara/<u></u>coding2.pdf</a>) that concludes= that if you have 19 bits per sample at a 52 kHz sample rate, and you use d= ithering, and your audio system doesn&#39;t do any further processing on th= e sound, then at 90 dB attenuation from the maximum level you shouldn&#39;t= hear any noise from the encoding (based on the extremes of measured hearin= g thresholds). This suggests that using 20 bit audio you can probably get 9= 6 dB of high quality dynamic range (see below for why I mention 20 bit audi= o).<br> <br> This doesn&#39;t take into account that many (most?) researchers are probab= ly not dithering their signals. As far as I can tell, Matlab&#39;s wavplay = and audioplayer functions do not use dithering, for example. So how much dy= namic range are we getting without introducing noise if we don&#39;t use di= thering? And are any of the commonly used packages for playing sounds doing= this dithering?<br> <br> Note: I mentioned 20 bit audio because I have read that 24 bit DACs only re= ally use at most 22 bits of the signal, and due to thermal noise give about= 20 bits noise free. I worked with one system in the past that I was told a= llowed you to select which 22 bits were used (although this was hardware sp= ecific and had to be coded in at a very low level, not using standard audio= APIs).<br> <br> I am very far from an expert on any of this, but what it seems like to me i= s that we need to be using 24 bit audio and (very importantly) dithering, a= nd if so we can probably get 96 dB of high quality dynamic range. It is pos= sible that in some experimental setups, especially if we&#39;re testing nor= mal hearing and hearing impaired listeners across a wide range of sounds on= the same system, that we might be exceeding this. If so, is there anything= we can do?<br> <br> Finally, any thoughts on the relevance for music / commercial audio? I gues= s it is much less of an issue there since the problems only seem to arise i= f you really push the limits of dynamic range.<br> <br> Thanks in advance,<br> Dan Goodman<br> </blockquote></div><br></div> --001a11c2bd9e2ed16805095ea7f1--


This message came from the mail archive
http://www.auditory.org/postings/2014/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University