結合鑑別式訓練聲學模型之類神經網路架構及優化方法的改進__臺灣人文及社會科學引文索引資料庫

:::

詳目顯示

回上一頁

第 1 筆 / 總合 1 筆

第一頁

上一頁

下一頁

最後一頁

/1頁

來源文獻資料
引文資料

題名：	結合鑑別式訓練聲學模型之類神經網路架構及優化方法的改進
書刊名：	International Journal of Computational Linguistics & Chinese Language Processing
作者：	趙偉成／張修瑞／羅天宏／陳柏琳
作者(外文)：	Chao, Wei-cheng／Chang, Hsiu-jui／Lo, Tien-hong／Chen, Berlin
出版日期：	2018
卷期：	23:2
頁次：	頁35-46
主題關鍵詞：	中文大詞彙連續語音辨識；聲學模型；鑑別式訓練；矩陣分解；來回針法；Mandarin large vocabulary continuous speech recognition；Acoustic model；Discriminative training；Matrix factorization；Backstitch
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:0 共同引用:8 點閱:0

期刊論文
1.	Rabiner, Lawrence R.(1989)。A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition。Proceedings of the IEEE，77(2)，257-286。
2.	Wang, Hsin-min、Chen, Berlin、Kuo, Jen-wei、Cheng, Shih-sian(20050600)。MATBN: A Mandarin Chinese Broadcast News Corpus。International Journal of Computational Linguistics & Chinese Language Processing，10(2)，219-235。
3.	Hinton, Geoffrey E.、Li, Deng、Dong, Yu、Dahl, George E.、Mohamed, Abdel-rahman、Jaitly, Navdeep、Kingsbury, Brian(2012)。Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups。IEEE Signal processing magazine，29(6)，82-97。
4.	Waibel, A.、Hanazawa, T.、Hinton, G.、Shikano, K.、Lang, K. J.(1989)。Phoneme recognition using time-delay neural networks。IEEE Transactions on Acoustics, Speech, and Signal Processings，37(3)，328-339。
5.	Gales, M.、Yang, S.(2008)。The application of hidden markov models in speech recognition。Foundations and Trends® in Signal Processing，1(3)，195-304。
6.	Pascanu, R.、Mikolov, T.、Bengio, Y.(2013)。On the difficulty of training recurrent neural networks。Proceedings of ICML 2013，28，1310-1318。

會議論文
1.	Stolcke, A.(2002)。SRILM--An Extensible Language Modeling Toolkit。International Conference on Spoken Language Processing，901-904。
2.	Povey, D.、Ghoshal, A.、Boulianne, G.、Burget, L.、Glembek, O.、Goel, N.、Hannemann, M.、Motlicek, P.、Qian, Y.、Schwarz, P.、Silovsky, J.、Stemmer, G.、Vesely, K.(2011)。The Kaldi speech recognition toolkit。The IEEE 2011 Workshop on Automatic Speech Recognition and Understanding。
3.	Graves, Alex、Mohamed, Abdel-Rahman、Hinton, Geoffrey E.(2013)。Speech recognition with deep recurrent neural networks。The 2013 IEEE International Conference on Acoustics, Speech and Signal Processing。IEEE。
4.	Povey, D.、Peddinti, V.、Galvez, D.、Ghahrmani, P.、Manohar, V.、Na, X.、Wang, Y.、Khudanpur, S.(2016)。Purely sequence-trained neural networks for ASR based on lattice-free MMI。17th Annual Conference of the International Speech Communication Association，2751-2755。
5.	Bahl, L.、Brown, P.、de Souza, P.、Mercer, R.(1986)。Maximum mutual information estimation of hidden markov model parameters for speech recognition。IEEE International Conference on Acoustics, Speech, and Signal Processing，49-52。
6.	Graves, A.、Fernández, S.、Gomez, F.、Schmidhuber, J.(2006)。Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks。ICML 2006，369-376。
7.	Ba, J.、Rich, C.(2014)。Do deep nets really need to be deep?。NIPS 2014，2654-2662。
8.	Povey, D.、Cheng, G.、Wang, Y.、Li, K.、Xu, H.、Yarmohamadi, M.、Khudanpur, S.(2018)。Semi-orthogonal low-rank matrix factorization for deep neural networks。19th Annual Conference of the International Speech Communication Association，3743-3747。
9.	Hadian, H.、Sameti, H.、Povey, D.、Khudanpur, S.(2018)。End-to-end speech recognition using lattice-free MMI。19th Annual Conference of the International Speech Communication Association，12-16。
10.	Povey, D.、Hadian, H.、Ghahremani, P.、Li, K.、Khudanpur, S.(2018)。A time-restricted self-attention layer for ASR。IEEE International Conference on Acoustics, Speech and Signal Processing，5874-5878。
11.	Veselý, K.、Ghoshal, A.、Burget, L.、Povey, D.(2013)。Sequence-discriminative training of deep neural networks。Interspeech 2013，2345-2349。
12.	Wang, Y.、Peddinti, V.、Xu, H.、Zhang, X.、Povey, D.、Khudanpur, S.(2017)。Backstitch: counteracting finite-sample bias via negative steps。Interspeech 2017。
13.	He, Kaiming、Zhang, Xiangyu、Ren, Shaoqing、Sun, Jian(2016)。Deep residual learning for image recognition。2016 IEEE Conference on Computer Vision and Pattern Recognition。IEEE。770-778。

單篇論文
1.	Sak, H.，Senior, A.，Beaufays, F.(2014)。Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition，https://arxiv.org/abs/1402.1128，(arXiv:1402.1128)。
2.	Simonyan, K.，Zisserman, A.(2014)。Very deep convolutional networks for large-scale image recognition，https://arxiv.org/abs/1409.1556，(1409.1556)。

其他
1.	Povey, D.，Zhang, X.，Khudanpur, S.(2014)。Parallel training of DNNs with natural gradient and parameter averaging，https://arxiv.org/abs/1410.7455。
2.	Ruder, S.(2016)。An overview of gradient descent optimization algorithms，https://arxiv.org/abs/1609.04747。

推文
推薦
引用網址
引用嵌入語法
轉寄

第一頁

上一頁

下一頁

最後一頁

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	基於端對端模型化技術之語音文件摘要
2.	基於特徵粒度之訓練策略於中文口語問答系統之應用
3.	當代非監督式方法之比較於節錄式語言摘要
4.	融合多任務學習類神經網路聲學模型訓練於會議語音辨識之研究
5.	節錄式語音文件摘要使用表示法學習技術
6.	使用概念資訊於中文大詞彙連續語音辨識之研究
7.	Improved Minimum Phone Error Based Discriminative Training of Acoustic Models for Mandarin Large Vocabulary Continuous Speech Recognition
8.	An Empirical Study of Word Error Minimization Approaches for Mandarin Large Vocabulary Continuous Speech Recognition
9.	MATBN: A Mandarin Chinese Broadcast News Corpus

無相關博士論文

無相關書籍

無相關著作

無相關點閱

QR Code

QRCODE