[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [AUDITORY] Feedback on features for music similarity
- To: AUDITORY@xxxxxxxxxxxxxxx
- Subject: Re: [AUDITORY] Feedback on features for music similarity
- From: Paul Arzelier <paul.arzelier@xxxxxxx>
- Date: Wed, 31 Jul 2024 20:19:33 +0200
- Approved-by: paul.arzelier@xxxxxxx
- Arc-authentication-results: i=1; mx.google.com;       dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=D5gJKYie;       spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx;       dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=free.fr
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816;        h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help         :precedence:in-reply-to:to:subject:from:sender:reply-to:date         :message-id:content-transfer-encoding:content-language:references         :user-agent:mime-version:approved-by:dkim-signature;        bh=CWXFns+qFQTFwA79T1B6WQM9M2ZjERJ0dWq0pg86KQk=;        fh=5/42mu9FVmfuMp6n0xGXVcDar2H3ENcHt8Uv11Om8gY=;        b=wsOogE//NtFZ7WO3qnGVEhu1J4aK3DlnBZZjFBRzupzbS1tvqjeJ+ifGw4oszHxtfV         J2JbAVWXpFfOP6bD7/wia8m6ZEsCH3SCuGusdGudW6neEQoRJRerR8GWO7yfVOWcbwpw         gsTcHoFqw7beIrXDIWkSe1HJR48Vp4GmqFgA5drI86yB6DAv2//TBwd7BsiZ9aEIAVC/         Of4ifA03gDteDsL1+w18S1QqvjfRd6AbilinZ0MOLX76XAhn5id8C+hmRLcAiZq3LluN         M7gY7Yp5P2n9CXRCgYWeqpSsWxmRgzc3MejJH8z7AIwA0b6pD/hh3EjI+xdHAVt2P6C4         1KLQ==;        dara=google.com
- Arc-seal: i=1; a=rsa-sha256; t=1722485808; cv=none;        d=google.com; s=arc-20160816;        b=ILbtDeZvN/FLWRsSE5Ut9/zjdJxCo3lAjJbR4tpwalSbtXTYuUvYrYZB/J0+z2P4nR         c3GMWAnZ4OoM3lr16lvbk6CT9TQjrOAFP/ffbaOfC0ji6LCrWcPv3vNLFLwh4zNKPDeb         Bg8QPVcwzJCrIfWEw9j5q/DNno/QuFp7un79gYIc4sPUUNvjYiovtPBnGL/1ffi+8U/F         nYfi+5CFDOkf0Nhgo9fuG/w1jorueodWJRnBVcXeh3QIB2MLvQq/rFHBg6V4qYVm7XnQ         jcLieNI1J14zj2tjvdOKXKaPzt8A0NE6BIY7HwxumWU9xbylNFm6DwTRYaOXFVjzosz3         E4MA==
- Authentication-results: mx.google.com;       dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=D5gJKYie;       spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx;       dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=free.fr
- Delivered-to: dan.ellis@xxxxxxxxx
- Dkim-signature: v=1; a=rsa-sha256; d=LISTS.MCGILL.CA; s=SELECTOR1; c=relaxed/relaxed; bh=CWXFns+qFQTFwA79T1B6WQM9M2ZjERJ0dWq0pg86KQk=; i=@LISTS.MCGILL.CA; h=Approved-By:MIME-Version:User-Agent:References:Content-Language:Content-Type:Content-Transfer-Encoding:Message-ID:Date:Reply-To:Sender:From:Subject:To:In-Reply-To:List-Help:List-Unsubscribe:List-Subscribe:List-Owner:List-Archive; b=D5gJKYie6eJubJgGue3CHTz4s9R63nKicEGcKuag2jsxduUQv8wrsHemBEkKw7wzIt21CQ4r8m3qTbz795nkpd3F9uOzbAhTmSW7jO2nMX5FwCvc7KwGZRI6kZBLe1kfl8jtwx98CAaRycLQLXj0VI6LDkRH6i8EiS5XNCtNC2dvaLS26jKqR0/Akv1Kjf6QxNyTcCreY2TEsT8nePIBsfXh38yqT4apc9MR9zSSLJL3ivllPmTRMabHsjvIZdzOAZUUbIkoaQlcGXwbHxA0duGPUeYp56piobfe7/VpYcL6lQG8oeEw/u0wuXnjSQhIYI3ayu8QUg4DPiAPhdDI5A==
- In-reply-to: <19C7F226-DF0C-4779-9F5B-AF324476CB5B@ccrma.stanford.edu>
- List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>
- List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>,           <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>
- List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>
- List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>
- List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>
- References: <ee531ad8-9554-4612-99a7-5cb3ba3fd452@free.fr>            <19C7F226-DF0C-4779-9F5B-AF324476CB5B@ccrma.stanford.edu>
- Reply-to: Paul Arzelier <paul.arzelier@xxxxxxx>
- Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>
- User-agent: Mozilla Thunderbird
Hi!
Thanks for your answer, and appreciate the kind words! You definitely 
make some good points:
1. I did NOT think about windowing the songs in "meaningful" moments, 
but the beginning and the end make a lot of sense since we wanna make it 
smooth. I'll explore and see what that yield.
3. I'm also thinking of letting users choose themselves, and see what 
they like more.
5. It's super cool, I didn't know they exposed that! I'll definitely use 
it to check that it matches at least e.g. the key the song is in.
@Pierre - do you have a hunch of what they might say? One problem is 
that musicians are not really concerned with the harsh reality of 
digital signal processing :D
Best Regards,
Paul
Le 30/07/2024 à 17:54, Hem, Charles a écrit :
Hi Paul,
Very cool project!  I'm a PhD student at Harvard studying cochlear 
implants on the human perception side of things, so I'm no audio 
specialist myself.  Trying to study music perception for cochlear 
implanted recipients brings me back to featurization of music 
frequently, though, so I have a couple non-expert thoughts for bullets 
1, 3, and 5:
1:  Maybe it's adding too much complexity, but I wonder if 
splitting your feature analysis into more windows could be helpful if 
you're most concerned about seamless transitions between songs.  For 
example, "start of song" (first 20 seconds, or something near that), 
"whole song", and "end of song" (last 20 seconds) could be used to 
give some extra weight to making sure that the "end of song" from the 
first song matches the "start of song" of the following song. Mainly 
for songs that are highly variable or have a drastically different intro.
3:  I haven't read the chroma paper you cited, but gut reaction feels 
like these features shouldn't outweigh tempo and loudness so much.  I 
think your four class division could be a good idea.
5:  Spotify song features are pretty easy to get using this: 
https://developer.spotify.com/documentation/web-api/reference/get-audio-features. 
The features are much more abstract than your features, but in my 
experience, Spotify's smart playlists do a pretty good job.  I'm 
guessing these features are partially used in that, so could be a nice 
reference to compare your results to.
Most of this probably isn't super helpful, but I'll keep an eye out 
for any updates about the project!  It could be a cool research tool 
to quantify people's music preferences one day, which would be an 
interesting use for me.
Best,
Charles Hem