:::

詳目顯示

回上一頁
題名:結合鑑別式訓練聲學模型之類神經網路架構及優化方法的改進
書刊名:International Journal of Computational Linguistics & Chinese Language Processing
作者:趙偉成張修瑞羅天宏陳柏琳
作者(外文):Chao, Wei-chengChang, Hsiu-juiLo, Tien-hongChen, Berlin
出版日期:2018
卷期:23:2
頁次:頁35-46
主題關鍵詞:中文大詞彙連續語音辨識聲學模型鑑別式訓練矩陣分解來回針法Mandarin large vocabulary continuous speech recognitionAcoustic modelDiscriminative trainingMatrix factorizationBackstitch
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:8
  • 點閱點閱:0
期刊論文
1.Rabiner, Lawrence R.(1989)。A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition。Proceedings of the IEEE,77(2),257-286。  new window
2.Wang, Hsin-min、Chen, Berlin、Kuo, Jen-wei、Cheng, Shih-sian(20050600)。MATBN: A Mandarin Chinese Broadcast News Corpus。International Journal of Computational Linguistics & Chinese Language Processing,10(2),219-235。new window  new window
3.Hinton, Geoffrey E.、Li, Deng、Dong, Yu、Dahl, George E.、Mohamed, Abdel-rahman、Jaitly, Navdeep、Kingsbury, Brian(2012)。Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups。IEEE Signal processing magazine,29(6),82-97。  new window
4.Waibel, A.、Hanazawa, T.、Hinton, G.、Shikano, K.、Lang, K. J.(1989)。Phoneme recognition using time-delay neural networks。IEEE Transactions on Acoustics, Speech, and Signal Processings,37(3),328-339。  new window
5.Gales, M.、Yang, S.(2008)。The application of hidden markov models in speech recognition。Foundations and Trends® in Signal Processing,1(3),195-304。  new window
6.Pascanu, R.、Mikolov, T.、Bengio, Y.(2013)。On the difficulty of training recurrent neural networks。Proceedings of ICML 2013,28,1310-1318。  new window
會議論文
1.Stolcke, A.(2002)。SRILM--An Extensible Language Modeling Toolkit。International Conference on Spoken Language Processing,901-904。  new window
2.Povey, D.、Ghoshal, A.、Boulianne, G.、Burget, L.、Glembek, O.、Goel, N.、Hannemann, M.、Motlicek, P.、Qian, Y.、Schwarz, P.、Silovsky, J.、Stemmer, G.、Vesely, K.(2011)。The Kaldi speech recognition toolkit。The IEEE 2011 Workshop on Automatic Speech Recognition and Understanding。  new window
3.Graves, Alex、Mohamed, Abdel-Rahman、Hinton, Geoffrey E.(2013)。Speech recognition with deep recurrent neural networks。The 2013 IEEE International Conference on Acoustics, Speech and Signal Processing。IEEE。  new window
4.Povey, D.、Peddinti, V.、Galvez, D.、Ghahrmani, P.、Manohar, V.、Na, X.、Wang, Y.、Khudanpur, S.(2016)。Purely sequence-trained neural networks for ASR based on lattice-free MMI。17th Annual Conference of the International Speech Communication Association,2751-2755。  new window
5.Bahl, L.、Brown, P.、de Souza, P.、Mercer, R.(1986)。Maximum mutual information estimation of hidden markov model parameters for speech recognition。IEEE International Conference on Acoustics, Speech, and Signal Processing,49-52。  new window
6.Graves, A.、Fernández, S.、Gomez, F.、Schmidhuber, J.(2006)。Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks。ICML 2006,369-376。  new window
7.Ba, J.、Rich, C.(2014)。Do deep nets really need to be deep?。NIPS 2014,2654-2662。  new window
8.Povey, D.、Cheng, G.、Wang, Y.、Li, K.、Xu, H.、Yarmohamadi, M.、Khudanpur, S.(2018)。Semi-orthogonal low-rank matrix factorization for deep neural networks。19th Annual Conference of the International Speech Communication Association,3743-3747。  new window
9.Hadian, H.、Sameti, H.、Povey, D.、Khudanpur, S.(2018)。End-to-end speech recognition using lattice-free MMI。19th Annual Conference of the International Speech Communication Association,12-16。  new window
10.Povey, D.、Hadian, H.、Ghahremani, P.、Li, K.、Khudanpur, S.(2018)。A time-restricted self-attention layer for ASR。IEEE International Conference on Acoustics, Speech and Signal Processing,5874-5878。  new window
11.Veselý, K.、Ghoshal, A.、Burget, L.、Povey, D.(2013)。Sequence-discriminative training of deep neural networks。Interspeech 2013,2345-2349。  new window
12.Wang, Y.、Peddinti, V.、Xu, H.、Zhang, X.、Povey, D.、Khudanpur, S.(2017)。Backstitch: counteracting finite-sample bias via negative steps。Interspeech 2017。  new window
13.He, Kaiming、Zhang, Xiangyu、Ren, Shaoqing、Sun, Jian(2016)。Deep residual learning for image recognition。2016 IEEE Conference on Computer Vision and Pattern Recognition。IEEE。770-778。  new window
單篇論文
1.Sak, H.,Senior, A.,Beaufays, F.(2014)。Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition,https://arxiv.org/abs/1402.1128,(arXiv:1402.1128)。  new window
2.Simonyan, K.,Zisserman, A.(2014)。Very deep convolutional networks for large-scale image recognition,https://arxiv.org/abs/1409.1556,(1409.1556)。  new window
其他
1.Povey, D.,Zhang, X.,Khudanpur, S.(2014)。Parallel training of DNNs with natural gradient and parameter averaging,https://arxiv.org/abs/1410.7455。  new window
2.Ruder, S.(2016)。An overview of gradient descent optimization algorithms,https://arxiv.org/abs/1609.04747。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top