[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: MSD: announcing the Last.fm dataset

To: AUDITORY@xxxxxxxxxxxxxxx
Subject: Re: MSD: announcing the *Last.fm dataset*
From: Yi Yu <yi.yu.yy@xxxxxxxxx>
Date: Fri, 21 Oct 2011 14:16:57 +0800
Approved-by: yi.yu.yy@xxxxxxxxx
Comments: To: Thierry Bertin-Mahieux <tb2332@xxxxxxxxxxxx>
Delivery-date: Fri Oct 21 02:23:55 2011
In-reply-to: <25014_1319127673_4EA04A79_25014_42_1_20111020121241.05dc07nu4gw8gk8s@xxxxxxxxxxxxxxxxxxxxxxx>
List-archive: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
List-help: <http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO AUDITORY>
List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
References: <25014_1319127673_4EA04A79_25014_42_1_20111020121241.05dc07nu4gw8gk8s@xxxxxxxxxxxxxxxxxxxxxxx>
Reply-to: Yi Yu <yi.yu.yy@xxxxxxxxx>
Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Hi Thierry

The attached are two papers addressing how to improve content-based scalable searchability over a large audio database.

PS: I am at NUS in Singapore. If you have any question, please donot hesitate to contact me.

Congratulations on your job.

Thanks

Yi Yu

On Fri, Oct 21, 2011 at 12:12 AM, Thierry Bertin-Mahieux <tb2332@xxxxxxxxxxxx> wrote:

The Million Song Dataset (MSD) team is proud to partner with Last.fm to announce a new complementary dataset: the Last.fm dataset. It contains song-level tags and song-to-song similarity. And it's big (i.e. BIG)! A few numbers:
http://labrosa.ee.columbia.edu/millionsong/lastfm

* 943,347 matched tracks MSD <-> Last.fm
* 505,216 tracks with at least one tag
* 584,897 tracks with at least one similar track
* 522,366 unique tags
* 8,598,630 (track - tag) pairs
* 56,506,688 (track - similar track) pairs

We thank Last.fm (http://www.last.fm/) for making this data available, it is the largest addition to the MSD so far. We are convinced that its impact on music information retrieval will be considerable.

As always, we appreciate any feedback! For instance, my favorite tag so far is "Acid Smurfs". A few additional notes on the MSD:
- we are working on some additional data regarding collaborative filtering, more on this at ISMIR
- we turned the CAL500 and CAL10K datasets into MSD format (http://bit.ly/oyBCwQ)
- please consider attending our tutorial at ISMIR (http://bit.ly/pSwlEA)

Happy swimming in data!
Thierry Bertin-Mahieux
Million Song Dataset team
http://labrosa.ee.columbia.edu/millionsong/

Attachment: MM09.pdf
Description: Adobe PDF document

Attachment: MM10.pdf
Description: Adobe PDF document

Prev by Date: SysMus 2012 - Appel de communications / Call for paper
Next by Date: using apex software
Previous by thread: MSD: announcing the *Last.fm dataset*
Next by thread: SysMus 2012 - Appel de communications / Call for paper
Index(es):
- Date
- Thread