[AUDITORY] Announcing SOUNDATA: A Python library for reproducible use of audio datasets (Justin Salamon )


Subject: [AUDITORY] Announcing SOUNDATA: A Python library for reproducible use of audio datasets
From:    Justin Salamon  <000000b4a42fd03d-dmarc-request@xxxxxxxx>
Date:    Wed, 3 Nov 2021 00:26:38 +0000

--_000_BYAPR02MB53333ED80753C5198BD5FEC5AB8C9BYAPR02MB5333namp_ Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable *** apologies for any cross-postings *** Dear colleagues, We=92re excited to announce the release of soundata, a python library for r= eproducible use of audio datasets. Soundata can be installed via: pip install soundata The source code lives here: https://github.com/soundata/soundata We=92re launching with 14 popular environmental sound datasets<https://soun= data.readthedocs.io/en/latest/source/quick_reference.html>, with plans to c= ontinue expanding with additional datasets spanning a range of audio domain= s including speech and bioacoustics. For music datasets see mirdata<https:/= /github.com/mir-dataset-loaders/mirdata>, which was the inspiration for sou= ndata. Soundata makes it easy to: * Download datasets to a common location and format * Validate that a downloaded dataset is complete and perfectly matches = a canonical version * Load audio and annotation files into a common format * Parse clip-level metadata for detailed evaluations We hope soundata will help the community to: * Ensure results are reproducible by working against exactly the same d= ata * Save time by avoiding manual downloads and having to write custom dat= aset parsers * Automate large-scale download, training, and evaluation pipelines * Increase the visibility of new datasets by adding them to soundata Soundata is a cross-organizational collaboration spanning researchers from = MARL@xxxxxxxx<https://steinhardt.nyu.edu/marl>, Adobe Research<https://research.= adobe.com/research/audio/>, MTG@xxxxxxxx<https://www.upf.edu/web/mtg>, and GPA@xxxxxxxx= delaR<https://iie.fing.edu.uy/investigacion/grupos/gpa/en/audio-processing-= group/>. You can learn more about the library on our docs page: https://soundata.rea= dthedocs.io/ A bit more about the motivation for soundata can be found in our (work in p= rogress) paper: "Soundata: A Python library for reproducible use of audio datasets" Magdalena Fuentes, Justin Salamon, Pablo Zinemanas, Mart=EDn Rocamora, Gen= =EDs Plaja, Ir=E1n R. Rom=E1n, Marius Miron, Xavier Serra, Juan Pablo Bello [arXiv<https://arxiv.org/abs/2109.12690>] We *welcome and encourage* contributions from the community, especially dat= a loaders for datasets not included yet in soundata. Cheers, Justin & Magdalena on behalf of the soundata team -- Justin Salamon | Adobe Research | www.justinsalamon.com --_000_BYAPR02MB53333ED80753C5198BD5FEC5AB8C9BYAPR02MB5333namp_ Content-Type: text/html; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable <html> <head> <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3DWindows-1= 252"> <style type=3D"text/css" style=3D"display:none;"> P {margin-top:0;margin-bo= ttom:0;} </style> </head> <body dir=3D"ltr"> <div style=3D"font-family: Arial, Helvetica, sans-serif; font-size: 10pt; c= olor: rgb(0, 0, 0);"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >*** apologies for any cross-postings ***</span></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >Dear colleagues,</span></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >We=92re excited to announce the release of<span style=3D"margin:0px">&nbsp= ;</span></span><span style=3D"margin:0px;font-weight:700;font-size:11pt;fon= t-family:Arial">soundata</span><span style=3D"margin:0px;font-weight:400;fo= nt-size:11pt;font-family:Arial">, a python library for reproducible use of audio datasets.</span></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >Soundata can be installed via:<span style=3D"margin:0px">&nbsp;</span></sp= an><span style=3D"margin:0px;font-weight:700;font-size:11pt;font-family:&qu= ot;Courier New&quot;">pip install soundata</span></p> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >The source code lives here:<span style=3D"margin:0px">&nbsp;</span></span>= <a href=3D"https://github.com/soundata/soundata" style=3D"margin:0px"><span= style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial;color= :rgb(17, 85, 204);text-decoration:underline;text-decoration-skip-ink:none">= https://github.com/soundata/soundata</span></a></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >We=92re launching with<span style=3D"margin:0px">&nbsp;</span></span><a hr= ef=3D"https://soundata.readthedocs.io/en/latest/source/quick_reference.html= " style=3D"margin:0px"><span style=3D"margin:0px;font-weight:400;font-size:= 11pt;font-family:Arial;color:rgb(17, 85, 204);text-decoration:underline;tex= t-decoration-skip-ink:none">14 popular environmental sound datasets</span></a><span style=3D"margin:0px;f= ont-weight:400;font-size:11pt;font-family:Arial">, with plans to continue e= xpanding with additional datasets spanning a range of audio domains includi= ng speech and bioacoustics. For music datasets see<span style=3D"margin:0px">&nbsp;</span></span><a href=3D"http= s://github.com/mir-dataset-loaders/mirdata" style=3D"margin:0px"><span styl= e=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial;color:rgb(= 17, 85, 204);text-decoration:underline;text-decoration-skip-ink:none">mirda= ta</span></a><span style=3D"margin:0px;font-weight:400;font-size:11pt;font-= family:Arial">, which was the inspiration for soundata.</span></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >Soundata makes it easy to:</span></p> <ul style=3D"background-color:rgb(255, 255, 255);font-weight:bold;margin-to= p:0px;margin-bottom:0px;padding-inline-start:48px"> <li dir=3D"ltr" style=3D"font-size:11pt;font-family:Arial;font-weight:400"> <p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt">= <span style=3D"margin:0px">Download datasets to a common location and forma= t</span></p> </li><li dir=3D"ltr" style=3D"font-size:11pt;font-family:Arial;font-weight:= 400"> <p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt">= <span style=3D"margin:0px">Validate that a downloaded dataset is complete a= nd perfectly matches a canonical version</span></p> </li><li dir=3D"ltr" style=3D"font-size:11pt;font-family:Arial;font-weight:= 400"> <p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt">= <span style=3D"margin:0px">Load audio and annotation files into a common fo= rmat</span></p> </li><li dir=3D"ltr" style=3D"font-size:11pt;font-family:Arial;font-weight:= 400"> <p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt">= <span style=3D"margin:0px">Parse clip-level metadata for detailed evaluatio= ns</span></p> </li></ul> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >We hope soundata will help the community to:</span></p> <ul style=3D"background-color:rgb(255, 255, 255);font-weight:bold;margin-to= p:0px;margin-bottom:0px;padding-inline-start:48px"> <li dir=3D"ltr" style=3D"font-size:11pt;font-family:Arial;font-weight:400"> <p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt">= <span style=3D"margin:0px">Ensure results are reproducible by working again= st<span style=3D"margin:0px">&nbsp;</span></span><span style=3D"margin:0px;= font-style:italic">exactly</span><span style=3D"margin:0px">&nbsp;the same data</span></p> </li><li dir=3D"ltr" style=3D"font-size:11pt;font-family:Arial;font-weight:= 400"> <p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt">= <span style=3D"margin:0px">Save time by avoiding manual downloads and havin= g to write custom dataset parsers</span></p> </li><li dir=3D"ltr" style=3D"font-size:11pt;font-family:Arial;font-weight:= 400"> <p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt">= <span style=3D"margin:0px">Automate large-scale download, training, and eva= luation pipelines</span></p> </li><li dir=3D"ltr" style=3D"font-size:11pt;font-family:Arial;font-weight:= 400"> <p dir=3D"ltr" style=3D"line-height:1.38;margin-top:0pt;margin-bottom:0pt">= <span style=3D"margin:0px">Increase the visibility of new datasets by addin= g them to soundata</span></p> </li></ul> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >Soundata is a cross-organizational collaboration spanning researchers from= <span style=3D"margin:0px">&nbsp;</span></span><a href=3D"https://steinhard= t.nyu.edu/marl" style=3D"margin:0px"><span style=3D"margin:0px;font-weight:= 400;font-size:11pt;font-family:Arial;color:rgb(17, 85, 204);text-decoration= :underline;text-decoration-skip-ink:none">MARL@xxxxxxxx</span></a><span style=3D= "margin:0px;font-weight:400;font-size:11pt;font-family:Arial">,<span style= =3D"margin:0px">&nbsp;</span></span><a href=3D"https://research.adobe.com/r= esearch/audio/" style=3D"margin:0px"><span style=3D"margin:0px;font-weight:= 400;font-size:11pt;font-family:Arial;color:rgb(17, 85, 204);text-decoration= :underline;text-decoration-skip-ink:none">Adobe Research</span></a><span style=3D"margin:0px;font-weight:400;font-size:11p= t;font-family:Arial">,<span style=3D"margin:0px">&nbsp;</span></span><a hre= f=3D"https://www.upf.edu/web/mtg" style=3D"margin:0px"><span style=3D"margi= n:0px;font-weight:400;font-size:11pt;font-family:Arial;color:rgb(17, 85, 20= 4);text-decoration:underline;text-decoration-skip-ink:none">MTG@xxxxxxxx</span><= /a><span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Ari= al">, and<span style=3D"margin:0px">&nbsp;</span></span><a href=3D"https://iie.f= ing.edu.uy/investigacion/grupos/gpa/en/audio-processing-group/" style=3D"ma= rgin:0px"><span style=3D"margin:0px;font-weight:400;font-size:11pt;font-fam= ily:Arial;color:rgb(17, 85, 204);text-decoration:underline;text-decoration-= skip-ink:none">GPA@xxxxxxxx</span></a><span style=3D"margin:0px;font-weight:4= 00;font-size:11pt;font-family:Arial">.&nbsp;</span></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >You can learn more about the library on our docs page:<span style=3D"margi= n:0px">&nbsp;</span></span><a href=3D"https://soundata.readthedocs.io/" sty= le=3D"margin:0px"><span style=3D"margin:0px;font-weight:400;font-size:11pt;= font-family:Arial;color:rgb(17, 85, 204);text-decoration:underline;text-dec= oration-skip-ink:none">https://soundata.readthedocs.io/</span></a></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >A bit more about the motivation for soundata can be found in our (work in = progress) paper:</span></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);line-height:1.3= 8;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-size:11pt;font-family:Arial">&quot;Soundata:= A Python library for reproducible use of audio datasets&quot;</span></p> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >Magdalena Fuentes, Justin Salamon, Pablo Zinemanas, Mart=EDn Rocamora, Gen= =EDs Plaja, Ir=E1n R. Rom=E1n, Marius Miron, Xavier Serra, Juan Pablo Bello= </span></p> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >[</span><a href=3D"https://arxiv.org/abs/2109.12690" style=3D"margin:0px">= <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial;= color:rgb(17, 85, 204);text-decoration:underline;text-decoration-skip-ink:n= one">arXiv</span></a><span style=3D"margin:0px;font-weight:400;font-size:11= pt;font-family:Arial">]</span></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >We *welcome and encourage* contributions from the community, especially da= ta loaders for datasets not included yet in soundata.</span></p> <br style=3D"background-color:rgb(255, 255, 255)"> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >Cheers,</span></p> <p dir=3D"ltr" style=3D"background-color:rgb(255, 255, 255);font-weight:bol= d;line-height:1.38;margin-top:0pt;margin-bottom:0pt"> <span style=3D"margin:0px;font-weight:400;font-size:11pt;font-family:Arial"= >Justin &amp; Magdalena on behalf of the soundata team</span></p> <br> </div> <div> <div style=3D"font-family: Arial, Helvetica, sans-serif; font-size: 10pt; c= olor: rgb(0, 0, 0);"> <br> </div> <div id=3D"Signature"> <div> <div style=3D"font-family:Arial,Helvetica,sans-serif; font-size:10pt; color= :rgb(0,0,0)"> --</div> <div style=3D"font-family:Arial,Helvetica,sans-serif; font-size:10pt; color= :rgb(0,0,0)"> Justin Salamon | Adobe Research | www.justinsalamon.com</div> </div> </div> </div> </body> </html> --_000_BYAPR02MB53333ED80753C5198BD5FEC5AB8C9BYAPR02MB5333namp_--


This message came from the mail archive
src/postings/2021/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University