Data Driven Approaches to Phonetic Transcription with Integration of Automatic Speech Recognition and Grapheme-to-Phoneme for Spoken Buddhist Sutra_

:::

詳目顯示

第 1 筆 / 總合 1 筆

/1頁

來源文獻資料
外文摘要
引文資料

題名：	Data Driven Approaches to Phonetic Transcription with Integration of Automatic Speech Recognition and Grapheme-to-Phoneme for Spoken Buddhist Sutra
書刊名：	International Journal of Computational Linguistics & Chinese Language Processing
作者：	Liang, Min-siong／Lyu, Ren-yuan／Chiang, Yuang-chin
出版日期：	2008
卷期：	13:2
頁次：	頁233-253
主題關鍵詞：	Automatic phonetic transcription；Phone recognition；Grapheme-to-phoneme；G2P；Pronunciation variation；Chinese text；Taiwanese；Min-Nan；Dialect；Buddhist Sutra
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:0 共同引用:3 點閱:48

We propose a new approach for performing phonetic transcription of text that utilizes automatic speech recognition (ASR) to help traditional grapheme-to-phoneme (G2P) techniques. This approach was applied to transcribe Chinese text into Taiwanese phonetic symbols. By augmenting the text with speech and using automatic speech recognition with a sausage searching net constructed from multiple pronunciations of text, we are able to reduce the error rate of phonetic transcription. Using a pronunciation lexicon with multiple pronunciations for each item, a transcription error rate of 12.74% was achieved. Further improvement can be achieved by adapting the pronunciation lexicon with pronunciation variation (PV) rules derived manually from corrected transcription in a speech corpus. The PV rules can be categorized into two kinds: knowledge-based and data-driven rules. By incorporating the PV rules, an error rate of 10.56% could be achieved. Although this technique was developed for Taiwanese speech, it could easily be adapted to other Chinese spoken languages or dialects.

以文找文

期刊論文
1.	Lyu,Ren-yuan、Liang,Min-siong、Chiang,Yuang-chin(20040800)。Toward Constructing a Multilingual Speech Corpus for Taiwanese (Min-nan), Hakka, and Mandarin。International Journal of Computational Linguistics & Chinese Language Processing，9:2，頁1-12。
2.	Lamel, L. F.、Gauvain, J. L.、Adda, G.(2002)。Lightly Supervised and Unsupervised Acoustic Model Training。Computer Speech and Language，16(1)，115-229。
3.	Cremelie, N.、Martens, J.-P.(1999)。In Search of Better Pronunciation Models for Speech Recognition。Speech Communication，29，115-136。
4.	Hain, T.(2005)。Implicit modeling of pronunciation variation in automatic speech recognition。Speech Communication，46，171-188。
5.	Nanjo, H.、Kawahara, T.(2004)。Language Model and Speaking Rate Adaptation for Spontaneous Presentation Speech Recognition。IEEE Transaction on Speech and Audio Processing，12，391-400。
6.	Saraclar, M.、Khudanpur, S.(2004)。Pronunciation change in conversation speech and its implications for automatic speech recognition。Computer Speech and Language，18，375-395。

會議論文
1.	Liang, M. S.、Yang, J. C.、Chiang, Y. C.、Lyu, R. Y.(2004)。A Taiwanese Text-to-Speech System with Applications to Language Learning。

研究報告
1.	(2003)。U.S. Department of State's Bureau of International Information Programs, IIP report。

圖書
1.	Cover, Thomas M.、Thomas, Joy A.(1991)。Elements of Information Theory。John Wiley & Sons, Inc.。
2.	Sik, D. G.(2004)。The Four Basic Sutra in Taiwanese。The Four Basic Sutra in Taiwanese。HsinChu, Taiwan。
3.	Sik, D. G.(2004)。Earth Treasure Sutra in Taiwanese。Earth Treasure Sutra in Taiwanese。HsinChu, Taiwan。
4.	Young, S.、Evermann, G.、Gales, M.、Hain, T.、Kershaw, D.、Liu, X.、Moore, G.、Odell, J.、Ollason, D.、Povey, D.、Valtchev, V.、Woodland, P.(2008)。The HTK Book 3.2。The HTK Book 3.2。

其他
1.	Chen, C. H.(2006)。Sutra on the original Vows of Bodhisattva Earth Treasure in English。
2.	Evermann, G.，Chan, H. Y.，Gales, M. J. F.，Hain, T.，Liu, X.，Mrva, D.，Wang, L.，Woodland, P. C.(2004)。Development of the 2003 CU-HTK Conversational Telephone Speech Transcription System，Montreal, Canada。
3.	Haeb-Umbach, R.，Beyerlein, P.，Thelen, E.(1995)。Automatic Transcription of Unknown Words in a Speech Recognition system，Detroit。
4.	Kanokphara, S.，Tesprasit, V.，Thongprasirt, R.(2003)。Pronunciation Variation Speech Recognition without Dictionary Modification on Sparse Database，Hong Kong。
5.	Kim, D. Y.，Chan, H. Y.，Evermann, G.，Gales, M. J. F.，Mrva, D.，Sim, K. C.，Woodland, P. C.(2005)。Development of the CU-HTK 2004 Broadcast News Transcription Systems，Philadelphia, USA。
6.	Liang, M. S.，Lyu, D. C.，Chiang, Y. C.，Lyu, R. Y.(2004)。Construct a Multi-Lingual Speech Corpus in Taiwan with Extracting Phonetically Balanced Articles，Jeju Island, Korea。
7.	Nouza, J.，Nejedlova, D.，Zdansky, J.，Kolorenc, J.(2004)。Very Large Vocabulary Speech Recognition System for Automatic Transcription of Czech Broadcast Programs，Jeju Island, Korea。
8.	Raux, A.(2004)。Automated Lexical Adaptation and Speaker Clustering based on Pronunciation Habits for Non-Native Speech Recognition，Jeju Island, Korea。
9.	Sarada, G. L.，Hemalatha, N.，Nagarajan, T.，Murthy, H. A.(2004)。Automatic Transcription of Continuous Speech using Unsupervised and Incremental Training，Jeju Island, Korea。
10.	Siohan, O.，Ramabhadran, B.，Zweig, G.(2004)。Speech Recognition Error Analysis on the English MALACH Corpus，Jeju Island, Korea。
11.	Soltau, H.，Kingsbury, B.，Mangu, L.，Povey, D.，Saon, G.，Zweig, G.(2005)。The IBM 2004 Conversational Telephony System for Rich Transcription，Philadelphia, USA。
12.	Tripitaka, S. S.(2005)。Sutra on the original Vows of Bodhisattva Earth Treasure in Chinese。
13.	Tsai, M. Y.，Chou, F. C.，Lee, L. S.(2002)。Improved pronunciation modeling by inverse word frequency and pronunciation entropy。
14.	Wu, J.，Gupta, V.(1999)。Application of Simultaneous Decoding Algorithm to Automatic Transcription of Known and Unknown Words，Phoenix, USA。

推文
推薦
引用網址
引用嵌入語法
轉寄

top

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	Development and Testing of Transcription Software for a Southern Min Spoken Corpus
2.	Acoustic Model Optimization for Multilingual Speech Recognition
3.	Modeling Pronunciation Variation for Bi-Lingual Mandarin/Taiwanese Speech Recognition
4.	Toward Constructing a Multilingual Speech Corpus for Taiwanese (Min-nan), Hakka, and Mandarin

無相關博士論文

無相關書籍

無相關著作

1.	佛經抄寫制式的確立及其意義
2.	On the Sanskrit-Tangut Phonetic Transcription Rule : A Study of “dhāraénī” in the Buddhist Texts
3.	The Chinese Transcription of Tibetan Consonant Clusters
4.	香港少數族裔小學生的中文字形知識、轉寫能力和寫作表現的關係
5.	再探《名僧傳抄》的編選特點及其抄記意義
6.	“普粵拼音通”教學方案探索
7.	Development and Testing of Transcription Software for a Southern Min Spoken Corpus
8.	成家徹郎〈說文解字的研究〉譯介
9.	Lightly Supervised and Data-Driven Approaches to Mandarin Broadcast News Transcription

QR Code

臺灣人文及社會科學引文索引資料庫系統

詳目顯示

臺灣人文及社會科學引文索引資料庫