Re: Features for robust speaker identification (DeLiang Wang )


Subject: Re: Features for robust speaker identification
From:    DeLiang Wang  <dwang@xxxxxxxx>
Date:    Wed, 17 Sep 2014 21:21:46 -0400
List-Archive:<http://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

One feature we proposed and found to be rather effective for robust speaker identification is GFCC (gammatone frequency cepstral coefficient). Its description and analysis are given below: - Shao Y. and Wang D.L. (2008): "Robust speaker identification using auditory features and computational auditory scene analysis." ICASSP-08, pp. 1589-1592. - Zhao X., Shao Y., and Wang D.L. (2012): "CASA-based robust speaker identification," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, pp. 1608-1616. - Zhao X. and Wang D.L. (2013): "Analyzing noise robustness of MFCC and GFCC features in speaker identification," ICASSP-13, pp. 7204-7208. You can also find the Matlab code for GFCC extraction on my lab's website. Cheers, DeLiang On 9/16/2014 12:23 PM, Celestino Alvarez wrote: > Dear list, > > I was planning to build a speaker identification application, and I > was wondering what are the best features for a robust identification. > > Any advise on the right papers to read, would help. > > Best, > > Tino -- ------------------------------------------------------------ DeLiang Wang, Professor Co-Editor-in-Chief, Neural Networks Department of Computer Science and Engineering The Ohio State University 2015 Neil Ave. Columbus, OH 43210-1277, U.S.A. Phone: 614-292-6827 (OFFICE); 614-292-7402 (LAB) http://www.cse.ohio-state.edu/~dwang "Happiness = Reality - Expectation"


This message came from the mail archive
http://www.auditory.org/postings/2014/
maintained by:
DAn Ellis <dpwe@ee.columbia.edu>
Electrical Engineering Dept., Columbia University