| 期刊論文1. | Rabiner, Lawrence R.(1989)。A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition。Proceedings of the IEEE,77(2),257-286。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 2. | Wang, Hsin-min、Chen, Berlin、Kuo, Jen-wei、Cheng, Shih-sian(20050600)。MATBN: A Mandarin Chinese Broadcast News Corpus。International Journal of Computational Linguistics & Chinese Language Processing,10(2),219-235。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 3. | Hinton, Geoffrey E.、Li, Deng、Dong, Yu、Dahl, George E.、Mohamed, Abdel-rahman、Jaitly, Navdeep、Kingsbury, Brian(2012)。Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups。IEEE Signal processing magazine,29(6),82-97。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 4. | Waibel, A.、Hanazawa, T.、Hinton, G.、Shikano, K.、Lang, K. J.(1989)。Phoneme recognition using time-delay neural networks。IEEE Transactions on Acoustics, Speech, and Signal Processings,37(3),328-339。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 5. | Gales, M.、Yang, S.(2008)。The application of hidden markov models in speech recognition。Foundations and Trends® in Signal Processing,1(3),195-304。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 6. | Pascanu, R.、Mikolov, T.、Bengio, Y.(2013)。On the difficulty of training recurrent neural networks。Proceedings of ICML 2013,28,1310-1318。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 會議論文1. | Stolcke, A.(2002)。SRILM--An Extensible Language Modeling Toolkit。International Conference on Spoken Language Processing,901-904。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 2. | Povey, D.、Ghoshal, A.、Boulianne, G.、Burget, L.、Glembek, O.、Goel, N.、Hannemann, M.、Motlicek, P.、Qian, Y.、Schwarz, P.、Silovsky, J.、Stemmer, G.、Vesely, K.(2011)。The Kaldi speech recognition toolkit。The IEEE 2011 Workshop on Automatic Speech Recognition and Understanding。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 3. | Graves, Alex、Mohamed, Abdel-Rahman、Hinton, Geoffrey E.(2013)。Speech recognition with deep recurrent neural networks。The 2013 IEEE International Conference on Acoustics, Speech and Signal Processing。IEEE。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 4. | Povey, D.、Peddinti, V.、Galvez, D.、Ghahrmani, P.、Manohar, V.、Na, X.、Wang, Y.、Khudanpur, S.(2016)。Purely sequence-trained neural networks for ASR based on lattice-free MMI。17th Annual Conference of the International Speech Communication Association,2751-2755。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 5. | Bahl, L.、Brown, P.、de Souza, P.、Mercer, R.(1986)。Maximum mutual information estimation of hidden markov model parameters for speech recognition。IEEE International Conference on Acoustics, Speech, and Signal Processing,49-52。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 6. | Graves, A.、Fernández, S.、Gomez, F.、Schmidhuber, J.(2006)。Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks。ICML 2006,369-376。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 7. | Ba, J.、Rich, C.(2014)。Do deep nets really need to be deep?。NIPS 2014,2654-2662。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 8. | Povey, D.、Cheng, G.、Wang, Y.、Li, K.、Xu, H.、Yarmohamadi, M.、Khudanpur, S.(2018)。Semi-orthogonal low-rank matrix factorization for deep neural networks。19th Annual Conference of the International Speech Communication Association,3743-3747。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 9. | Hadian, H.、Sameti, H.、Povey, D.、Khudanpur, S.(2018)。End-to-end speech recognition using lattice-free MMI。19th Annual Conference of the International Speech Communication Association,12-16。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 10. | Povey, D.、Hadian, H.、Ghahremani, P.、Li, K.、Khudanpur, S.(2018)。A time-restricted self-attention layer for ASR。IEEE International Conference on Acoustics, Speech and Signal Processing,5874-5878。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 11. | Veselý, K.、Ghoshal, A.、Burget, L.、Povey, D.(2013)。Sequence-discriminative training of deep neural networks。Interspeech 2013,2345-2349。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 12. | Wang, Y.、Peddinti, V.、Xu, H.、Zhang, X.、Povey, D.、Khudanpur, S.(2017)。Backstitch: counteracting finite-sample bias via negative steps。Interspeech 2017。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 13. | He, Kaiming、Zhang, Xiangyu、Ren, Shaoqing、Sun, Jian(2016)。Deep residual learning for image recognition。2016 IEEE Conference on Computer Vision and Pattern Recognition。IEEE。770-778。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 單篇論文1. | Sak, H.,Senior, A.,Beaufays, F.(2014)。Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition,https://arxiv.org/abs/1402.1128,(arXiv:1402.1128)。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 2. | Simonyan, K.,Zisserman, A.(2014)。Very deep convolutional networks for large-scale image recognition,https://arxiv.org/abs/1409.1556,(1409.1556)。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 其他1. | Povey, D.,Zhang, X.,Khudanpur, S.(2014)。Parallel training of DNNs with natural gradient and parameter averaging,https://arxiv.org/abs/1410.7455。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | 2. | Ruder, S.(2016)。An overview of gradient descent optimization algorithms,https://arxiv.org/abs/1609.04747。 ![](/gs32/thssjcncl/image/nclsfx.gif) ![new window](/gs32/images/newin.png) | |