Re: [AUDITORY] Emotions vocalisations database

Subject: Re: [AUDITORY] Emotions vocalisations database

From: Frank Russo <000003cbca248916-dmarc-request@xxxxxxxxxxxxxxx>

Date: Thu, 26 Feb 2026 08:44:35 -0500

Arc-authentication-results: i=1; mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=gr6uJ1MS; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.58 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=mcgill.ca

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20240605; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:subject:from:sender:reply-to:date :message-id:references:mime-version:approved-by:dkim-signature; bh=eqWABF5k1BDyDihNJRgVc1HpV2AWBC0RVEg2lXqugK0=; fh=5/42mu9FVmfuMp6n0xGXVcDar2H3ENcHt8Uv11Om8gY=; b=UOT3Myl5T3hVAtYT34fBaV7Gbt7u4HeZFgLyAdIJbkTnw4zq2rikaB89hSKgu0m2Cy FyriXmsEnE4W9bo8+3QyfybdTf1b4BPl+VphgE4XDbmZbmcUEmgdKjkuA7ytpMvyz+fy mF/K8Dsee023U5k/tCd1cSk14GlFpBycaE12UCjKdmKKP112OWzBTEaQm7JvfnFmfaT9 qLbwCoHKDVHEKH2zd1BxIXJ9WjRsFO4wMNDJTpnZtCus27ZqlIO+bPO71+P2iZeHNgPH oM29OxA8aw5WBnMaUsmuWq7SvXxKkBXGG2TLqlH0P0CwNFZWC0a80n+o+jgAZYNei19+ PJGQ==; dara=google.com

Arc-seal: i=1; a=rsa-sha256; t=1772169123; cv=none; d=google.com; s=arc-20240605; b=MBjcAbOhMfWLhHB7Il9I49MB/gOyn/y1UpFl9pS2Il7ApHotHOrfkvwENzzO3Klort gz59OgHTuPJH+wAvBiiyKaKn7jDaIIo2j3ICi+QvoRspiG2+OoyK21B/B4atP0j7IOSJ 632Vazc7ix//db3p2j9xU7kswimUis95N2HGA9nJSDEmgJLvtGYXkujtdW3cN77YIkhn UAvBPnH6AWG7COaH3nYeveShJ4U/WIZcrsGep2UolXUt2z6ZDRovnTUr5zt6rNdZ4nlz EnQfvC34jVzj11lHHM6yQSmzhlIMWdiK6XslIT8iNrAl/3OI3uICb/UWbGbjmc51QYOg Fmpg==

Authentication-results: mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=gr6uJ1MS; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.58 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=mcgill.ca

Delivered-to: dan.ellis@xxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; d=LISTS.MCGILL.CA; s=SELECTOR1; c=relaxed/relaxed; bh=eqWABF5k1BDyDihNJRgVc1HpV2AWBC0RVEg2lXqugK0=; i=@LISTS.MCGILL.CA; h=Approved-By:Content-Type:Mime-Version:References:Message-ID:Date:Reply-To:Sender:From:Subject:To:In-Reply-To:List-Help:List-Unsubscribe:List-Subscribe:List-Owner:List-Archive; b=gr6uJ1MSFe6QWaPzaxzL/eXXfpSxBkbPvuYlcUgxVhDq2uKmR49HYaOonaxgxPJIEzI5KUJUj9gcZZWHRrfxaLvP847M6KrrjOorgnsJJxs0quJKqqhP2HtHjsoxN43/Egc+WRYSVOmRkg3/J50t76A4H8PthFWouFkV9L9ScyR07DP8DO57WkHUR8OesOcDqxHMlLgEo8oC0X4m2SXJYkEMzBwox8snq4jc+fgUSodUiih5o9SZVJcyuSnuOBRfFuzugsHLy2QfWwltMfjE+lomW83IOw4tvGrFjt4KwyMKaTFI96zssNpgbSkJLDOgKizB/6MKngDs2j+E8YadVQ==

In-reply-to: <9c76a590-2e33-cc52-f8a1-6b0a4cc40b8a@uni-oldenburg.de>

List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

References: <CWLP265MB5889FA068DF17DF0A44EB30BB677A@CWLP265MB5889.GBRP265.PROD.OUTLOOK.COM> <PR3PR01MB69852622A2DB1187DCDC0A9BC574A@PR3PR01MB6985.eurprd01.prod.exchangelabs.com> <PH7PR03MB699531C3012065F70AB4211D9F75A@PH7PR03MB6995.namprd03.prod.outlook.com> <9c76a590-2e33-cc52-f8a1-6b0a4cc40b8a@uni-oldenburg.de>

Reply-to: Frank Russo <russo@xxxxxxxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

The RAVDESS is a validated multimodal database of emotional speech and song. The database is gender balanced consisting of 24 professional actors, vocalizing lexically-matched statements in a neutral North American accent. Speech includes calm, happy, sad, angry, fearful, surprise, and disgust expressions, and song contains calm, happy, sad, angry, and fearful emotions. Each _expression_ is produced at two levels of emotional intensity, with an additional neutral _expression_. All conditions are available in face-and-voice, face-only, and voice-only formats. The set of 7356 recordings were each rated 10 times on emotional validity, intensity, and genuineness. Ratings were provided by 247 individuals who were characteristic of untrained research participants from North America.

Livingstone, S. R., & Russo, F. A. (2018). The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English. PloS one, 13(5), e0196391.

The database has been widely used in psychological, neural, and computational studies of emotion. An acoustic analysis of the RAVDESS was recently reported here:

Major, Devon P., and Monita Chatterjee. "Acoustic analyses of the RAVDESS corpus of emotional stimuli." JASA Express Letters 6.2 (2026).

Best,

Frank

________________________________
Frank A. Russo, PhD
Professor (Full), Psychology, Faculty of Arts, Toronto Metropolitan University
Professor (Status), Faculties of Music and Medicine, University of Toronto
Adjunct Scientist, KITE, Toronto Rehabilitation Institute, University Health Network
Scientific Director, SMART Lab
Scientific Director, SingWell Project
Chief Science Officer, LUCID Therapeutics

Tel: 01-416-979-5000, x. 552647 (office), x. 554989 (lab)
russo@xxxxxxxxxxxx

On Feb 26, 2026, at 04:13, Jochem Rieger <jochem.rieger@xxxxxxxxxxxxxxxx> wrote:
GAUDIE is an extensively validated German naturalistic auditory speech database with positive, neutral, and negative speech sequences available for non-profit academic research purposes. It comprises 37 audio speech sequences with a total duration of 92 minutes. The database is described in Lingelbach et al 2023

Lingelbach, K., Vukelić, M. & Rieger, J.W. GAUDIE: Development, validation, and exploration of a naturalistic German AUDItory Emotional database. Behav Res 56, 2049–2063 (2024). https://doi.org/10.3758/s13428-023-02135-z

Best,
Jochem

On 25.02.26 15:46, Morgan, Shae wrote:
ACHTUNG! Diese E-Mail kommt von Extern! WARNING! This email originated off-campus.
If the stimuli can have speech vs nonverbal or just emotional sounds, there are lots of databases available:

Toronto Emotional Speech Set (Dupuis & Pichora Fuller)
https://www.kaggle.com/datasets/ejlok1/toronto-emotional-speech-set-tess
Morgan Emotional Speech Set (Morgan, 2019)
https://zenodo.org/records/7378320
Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS; Livingstone and Russo 2018)
https://zenodo.org/records/1188976

They're all different in terms of which variables they code/don't code for. RAVDESS has song and speech, limited sentence variability (2 sentence frames), but many talkers and emotional expressions in high and low intensity. MESS has more semantic variability and target words (making it suitable for word and emotion recognition studies, masking, intelligibility, etc.), but fewer talkers (3 female 3 male) recorded at a single intensity for 4 emotion groups. TESS has few talkers, but includes age as a factor, a good number of emotional expressions.

Table 1 in Morgan & LaPaugh (2025) has a summary of commonly used emotional speech databases with citations for each - you can read up on the methods to see which database would suit your needs!

Morgan, S. D., & LaPaugh, B. (2025). Methodological Stimulus Considerations for Auditory Emotion Recognition Test Design. Journal of Speech, Language, and Hearing Research, 68(3), 1209-1224.

Cheers!

Shae Morgan, PhD, AuD, CCC-A (he/him/his)
Associate Professor
Program Director, Audiology

From: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx> on behalf of Sophie Scott <sophie.scott@xxxxxxxxx>
Sent: Tuesday, February 24, 2026 6:34 AM
To: AUDITORY@xxxxxxxxxxxxxxx <AUDITORY@xxxxxxxxxxxxxxx>
Subject: Re: [AUDITORY] Emotions vocalisations database

CAUTION: This email originated from outside of our organization. Do not click links, open attachments, or respond unless you recognize the sender's email address and know the contents are safe.
The stimulus set of non verbal emotional vocalizations that are reported in this paper, Sauter et al, 2010, QJEP (https://pure.mpg.de/rest/items/item_1169646/component/file_1169645/content), and some of which are cross culturally recognised (Sauter et al, PNAS 2010) are available if you email me directly.

Best wishes
Sophie

Prof Sophie Scott CBE
Director, Institute of Cognitive Neuroscience, UCL,
17 Queen Square,
London WC1N 3AZ
020 7679 1144 (office)
07881853586 (mobile)

From: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx> on behalf of Uther, Maria <M.Uther@xxxxxxxxx>
Sent: 23 February 2026 11:03
To: AUDITORY@xxxxxxxxxxxxxxx <AUDITORY@xxxxxxxxxxxxxxx>
Subject: Emotions vocalisations database

⚠ Caution: External sender

Hello all,

I am in need of a set of freely available emotions vocalisations for a study I am doing on detecting emotions in the human voice.

I am aware of the Montreal Affective Voices. I am also aware of the International Affective Digitized Sounds (IADS-E) database.

Is there any reason to choose one over the other or anybody have any other suggestions?

Thanks in advance,

Maria

Professor Maria Uther, C.Psychol., CSci., AFBPsS
Honorary Professor
University of Wolverhampton
-- 
Jochem Rieger
Prof. Dr. rer. nat. habil.
Applied Neurocognitive Psychology
DFG Center for Open and Reproducible Neuroscience Tools 
COST Action INDoS CA24161
DFG RTG 2783 Neuromodulation of motor and cognitive function

Carl-von-Ossietzky University Oldenburg
Phone: +49 (0)441 798 4533
Web: https://uol.de/en/applied-neurocognitive-psychology
github: https://github.com/ANCPLabOldenburg
https://www.indos-costaction.eu/

Attachment: smime.p7s
Description: S/MIME cryptographic signature