Re: request for dataset (Ross Maddox )


Subject: Re: request for dataset
From:    Ross Maddox  <rkmaddox@xxxxxxxx>
Date:    Thu, 16 Oct 2014 11:19:11 -0700
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

--f46d043c814a2d55dd05058e4afc Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi Shabih, If the only labeling you need is between the categorical values you list above, then you could combine several datasets that have only one of those types into a meta-dataset that may suit your needs. Here is a list of many speech datasets (with transcriptions): https://wiki.inria.fr/rosp/Datasets#Speech_datasets You could add other types of sounds by downloading them from archive.org. They have many hours of live music, tv recordings (mostly videos from which you could extract the audio), political speeches, poem readings, etc. that may potentially be combined. Hope that helps. Best, Ross -- Ross Maddox, Ph.D. Postdoctoral Fellow Institute for Learning & Brain Sciences University of Washington phone: 206-685-4662 http://faculty.washington.edu/rkmaddox/ On Wed, Oct 15, 2014 at 8:04 AM, Syed Shabih Hasan <hasanshabih@xxxxxxxx> wrote: > Dear All > > I am working on creating a classifier that can identify live speech, > music, media sounds (tv, radio etc). Can someone, please, point me to > publicly available datasets of audio that are also annotated with the > proper labels? > > Best Regards > Shabih > > =E2=80=94 > *Syed Shabih Hasan* > Graduate Student in CS > University of Iowa > http://shabih.hasan.net > > > > > > --f46d043c814a2d55dd05058e4afc Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable <div dir=3D"ltr">Hi Shabih,<div><br></div><div>If the only labeling you nee= d is between the categorical values you list above, then you could combine = several datasets that have only one of those types into a meta-dataset that= may suit your needs.</div><div><br></div><div>Here is a list of many speec= h datasets (with transcriptions):=C2=A0<a href=3D"https://wiki.inria.fr/ros= p/Datasets#Speech_datasets">https://wiki.inria.fr/rosp/Datasets#Speech_data= sets</a></div><div><br></div><div>You could add other types of sounds by do= wnloading them from <a href=3D"http://archive.org">archive.org</a>. They ha= ve many hours of live music, tv recordings (mostly videos from which you co= uld extract the audio), political speeches, poem readings, etc. that may po= tentially be combined.</div><div><br></div><div>Hope that helps.</div><div>= <br></div><div>Best,</div><div>Ross</div><div><br></div><div><br></div></di= v><div class=3D"gmail_extra"><br clear=3D"all"><div><div dir=3D"ltr">--<div= >Ross Maddox, Ph.D.<div>Postdoctoral Fellow</div><div>Institute for Learnin= g &amp; Brain Sciences</div><div>University of Washington</div><div>phone: = 206-685-4662</div><div><a href=3D"http://faculty.washington.edu/rkmaddox/" = target=3D"_blank">http://faculty.washington.edu/rkmaddox/</a><br></div></di= v></div></div> <br><div class=3D"gmail_quote">On Wed, Oct 15, 2014 at 8:04 AM, Syed Shabih= Hasan <span dir=3D"ltr">&lt;<a href=3D"mailto:hasanshabih@xxxxxxxx" targe= t=3D"_blank">hasanshabih@xxxxxxxx</a>&gt;</span> wrote:<br><blockquote cla= ss=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;pa= dding-left:1ex"><div style=3D"word-wrap:break-word">Dear All<div><br></div>= <div>I am working on creating a classifier that can identify live speech, m= usic, media sounds (tv, radio etc). Can someone, please, point me to public= ly available datasets of audio that are also annotated with the proper labe= ls?</div><div><br></div><div>Best Regards</div><div>Shabih</div><div><br></= div><div><div> <div style=3D"color:rgb(0,0,0);letter-spacing:normal;text-align:start;text-= indent:0px;text-transform:none;white-space:normal;word-spacing:0px;word-wra= p:break-word"><div style=3D"color:rgb(0,0,0);letter-spacing:normal;text-ali= gn:start;text-indent:0px;text-transform:none;white-space:normal;word-spacin= g:0px;word-wrap:break-word"><div>=E2=80=94=C2=A0</div><span class=3D"HOEnZb= "><font color=3D"#888888"><div><b>Syed Shabih Hasan</b></div><div>Graduate = Student in CS</div><div>University of Iowa</div><div><a href=3D"http://shab= ih.hasan.net" target=3D"_blank">http://shabih.hasan.net</a></div><div><br><= /div></font></span></div><br></div><br><br> </div> <br></div></div></blockquote></div><br></div> --f46d043c814a2d55dd05058e4afc--


This message came from the mail archive
http://www.auditory.org/postings/2014/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University