:::

詳目顯示

回上一頁
題名:建立語者韻母音色模型並應用於非限定語詞式之語者驗證
書刊名:前瞻科技與管理
作者:呂嘉穀耿良才蒲長恩蕭志濱
作者(外文):Leu, Jia-guuGeeng, Liang-tsairPu, Chang-enShiau, Jyh-bin
出版日期:2014
卷期:4:2
頁次:頁71-98
主題關鍵詞:非限定語詞語者模型語者驗證頻譜正規化韻母識別Text-independentSpeaker modelSpeaker verificationSpectrum normalizationVowel detection
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:15
以非限定語詞方式進行語者驗證,較以限定語詞方式增加了許多彈性。但是,因為用來比對的語句,其中的語詞並不相同,故難以做直接的對應。然而,儘管語句不同,其中之字音卻是由相同的音素集所建構而成。音素包括聲母與韻母,一般而言,單韻母的數目較聲母為少,且在語句中韻母出現的時間長度及音量都較聲母為大,適合用來做為比對音色的基礎。本文所提出的方法是先為5個國語中的基本單韻母建立頻譜模型。其次,再用這些模型來分析一個語者的語句,偵測出其中出現的單韻母音,並用以建立起該語者的一個韻母音色模型。在驗證2組語句之語者是否為同一人時,我們先分別自兩組語句建立起兩個語者的音色模型,再藉著比對此二模型之相似程度來進行驗證。
In text-independent speaker verification, we compare two sets of sentences with different text content for timbre similarity to determine if they came from the same speaker. Since the sentences are different, we may not have many matching words to compare. However, the sentences are constructed from the same set of phonemes of the language used, including vowels and consonants. Generally speaking, simple vowels are fewer in number, but are the more prominent parts of a sentence in terms of duration and loudness, very suitable to be used for timbre comparison. In this paper, we first built spectral models for 5 simple vowels in Mandarin Chinese. Then we applied the models to analyze two given sets of speech sentences, detecting the various simple vowels in the sentences, and used the detected vowels to build a timbre model for each speaker. After that, we are able to compare the two speaker models to determine if the two speakers are indeed the same person.
期刊論文
1.Kinnunen, T.、Li, H.(2010)。An Overview of Text-Independent Speaker Recognition: From Features to Supervectors。Speech Communication,52,12-40。  new window
2.Benzeguiba, M.、De Mori, R.、Deroo, O.、Dupon, S.、Erbes, T.、Fissore, L.、Wellekens, C.(2007)。Automatic Speech Recognition and Speech Variability: A Review。Speech Communication,49(10),763-786。  new window
3.Campbell, J. P.、Shen, W.、Campbell, W. M.、Schwartz, R.、Bonastre, J. F.、Matrouf, D.(2009)。Forensic Speaker Recognition-A need for caution。IEEE Signal Processing Magazine,26(2),95-103。  new window
4.Faundez-Zanuy, M.、Monte-Moreno, E.(2005)。State-of-the-art in Speaker Recognition。IEEE A&E System Magazine,20(5),7-12。  new window
5.Jessen, M.、Becker, T.(2010)。Long-Term Formant Distribution as a Forensic-Phonetic Feature。Journal of Acoustical Society of America,28(4),2378。  new window
6.Juang, B. H.、Rabiner, L. R.、Wilpon, J. G.(1987)。On the Use of Bandpass Liftering in Speech Recognition。IEEE Trans. On Acoustics, Speech, and Signal Processing,ASSP-35(7),947-954。  new window
7.Klerin, W.、Plomp, R.、Pols, L. C. W.(1970)。Vowel Spectra, Vowel Spaces, and Vowel Identification。The Journal of the Acoustical Society of America,48(2),999-1009。  new window
8.Markel, J. D.、Davis, B.(1979)。Text-Independent Speaker Recognition from a Large Linguistically Unconstrained Time-Spaced Database。IEEE Trans. Acoustics, Speech, and Signal Processing,ASSP-27(1),74-82。  new window
9.Morrison, G. S.(2009)。Forensic Voice Comparison and the Paradigm Shift。Science and Justice,49,298-308。  new window
10.Nolan, F.、Grigoras, C.(2005)。A Case for Formant Analysis in Forensic Speaker Identification。International Journal of Speech Language and the Law,12(2),143-173。  new window
11.Paliwal, K. K.(1982)。On the Performance of the Quefrency-Weighted Cepstral Coefficients in Vowel Recognition。Speech Communication,1(2),151-154。  new window
12.Wakita, H.(1977)。Normalization of Vowels by Vocal-Tract Length and Its Application to Vowel Identification。IEEE Trans. On Acoustics, Speech, and Signal Processing,ASSP-25(2),183-192。  new window
13.Zahorian, S. A.、Nossair, Z. B.(1999)。A Partitioned Neural Network Approach for Vowel Classification Using Smoothed Time/Frequency Features。IEEE Trans. On Speech and Audio Processing,7(4),414-425。  new window
14.Zhang, C.、Tan, T.(2008)。Voice Disguise and Automatic Speaker Recognition。Forensic Science Informational,175,118-122。  new window
15.Bimbot, F. J.、Bonastre, F.、Fredouille, C.、Gravier, G.、Magrin-Chagnolleau, I.、Meignier, S.、Merlin, T.、Ortega-Garcia, J.、Petrovska-Delacretaz, D.、Reynolds, D. A.(2004)。A tutorial on text-independent speaker verification。EURASIP Journal of Applied Signal Processing,4,430-451。  new window
會議論文
1.Han, W.、Chan, C.、Choy, C.、Pun, K.(2006)。An efficient MFCC extraction method in speech recognition。2006 IEEE International Symposium on Circuits and Systems,(會議日期: 2006/05/21-25/24)。Island of Kos。145-148。  new window
2.Azmi, S., M. Y.、Siraj, F.、Yaacob, S.、Paulraj, M. P.、Nazri, A.(2010)。Improved Malay Vowel Feature Extraction Method Based on First and Second Formants。The 2nd International Conference on Computational Intelligence, Modeling and Simulation,339-344。  new window
3.Kimber, D.、Bush, M. A.、Tajchman, G. N.(1990)。Speaker-Independent Vowel Classification Using hidden Markov Models and LVQ2。1990 IEEE ICASSP Conference。Albuquerque, New Mexico。497-500。  new window
4.Pan, F.、Zhao, Q.、Yan, Y.(2008)。Mandarin Vowel Pronunciation Quality Evaluation by a Novel Formant Classification Method and Its Combination with Traditional Algorithms。2008 IEEE ICASSP Conference。Las Vegas, Nevada。5061-5064。  new window
5.Paulraj, M. P.、Yaacob, S.、Azmi, S. M. Y.(2008)。Vowel Classification Based on Frequency Response of Vocal Tmct。2008 IEEE International Conference on Computer and Communication Engineering。Kuala Lumpur, Malaysia。1125-1130。  new window
6.Pellegrino, F.、Andre-Obercht, R.(1997)。From Vocalic Detection to Automatic Emergence of Vowel Systems。1997 IEEE ICASSP Conference。Munich, Germany。1651-1654。  new window
7.Schmid, P.、Barnard, E.(1997)。Explicit, N-best Formant Features for Vowel Classification。1997 IEEE ICASSP Conference。Munich, Germany。991-994。  new window
8.Ye, J.、Kobayashi, T.、Higuchi, T.(2010)。Vowel Recognition Based on FLAC Acoustic Features and Subspace Classifier。2010 IEEE ICASSP Conference。Dallas, TX。530-533。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE