| 期刊論文1. | Gale, W. A.(1995)。Good-turing frequency estimation without tears。Journal of Quantitative Linguistics,2(3),217-237。 | 2. | Luo, Y.、Mesgarani, N.(2019)。Conv-TasNet: Surpassing ideal time-frequency magnitude masking for speech separation。IEEE/ACM Transactions on Audio, Speech, and Language Processing,27(8),1256-1266。 | 3. | Thiemann, J.、Ito, N.、Vincent, E.(2013)。The diverse environments multichannel acoustic noise database (DEMAND): A database of multichannel environmental noise recordings。The Journal of the Acoustical Society of America,133(5)。 | 會議論文1. | Povey, D.、Ghoshal, A.、Boulianne, G.、Burget, L.、Glembek, O.、Goel, N.、Hannemann, M.、Motlicek, P.、Qian, Y.、Schwarz, P.、Silovsky, J.、Stemmer, G.、Vesely, K.(2011)。The Kaldi speech recognition toolkit。The IEEE 2011 Workshop on Automatic Speech Recognition and Understanding。 | 2. | Povey, D.、Peddinti, V.、Galvez, D.、Ghahrmani, P.、Manohar, V.、Na, X.、Wang, Y.、Khudanpur, S.(2016)。Purely sequence-trained neural networks for ASR based on lattice-free MMI。17th Annual Conference of the International Speech Communication Association,2751-2755。 | 3. | Manohar, V.、Hadian, H.、Povey, D.、Khudanpur, S.(2018)。Semi-supervised training of acoustic models using lattice-free MMI。2018 IEEE International Conference on Acoustics, Speech and Signal Processing。 | 4. | Povey, D.、Cheng, G.、Wang, Y.、Li, K.、Xu, H.、Yarmohamadi, M.、Khudanpur, S.(2018)。Semi-orthogonal low-rank matrix factorization for deep neural networks。19th Annual Conference of the International Speech Communication Association,3743-3747。 | 5. | Hadian, H.、Sameti, H.、Povey, D.、Khudanpur, S.(2018)。End-to-end speech recognition using lattice-free MMI。19th Annual Conference of the International Speech Communication Association,12-16。 | 6. | Ghahremani, P.、Manohar, V.、Hadian, H.、Povey, D.、Khudanpur, S.(2017)。Investigation of transfer learning for ASR using LF-MMI trained neural networks。2017 IEEE Automatic Speech Recognition and Understanding Workshop,279-286。 | 7. | Dean, D. B.、Sridharan, S.、Vogt, R. J.、Mason, M. W.(2010)。The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms。11th Annual Conference of the International Speech Communication Association,3110-3113。 | 8. | Chiu, S.-H.、Chen, B.(2021)。Innovative BERT-based reranking language models for speech recognition。2021 IEEE Spoken Language Technology Workshop,266-271。 | 9. | Jaitly, N.、Hinton, G. E.(2013)。Vocal tract length perturbation (VTLP) improves speech recognition。International Conference on Machine Learning。 | 10. | Kinoshita, K.、Ochiai, T.、Delcroix, M.、Nakatani, T.(2020)。Improving noise robust automatic speech recognition with single-channel time-domain enhancement network。2020 IEEE International Conference on Acoustics, Speech and Signal Processing,7009-7013。 | 11. | Ko, T.、Peddinti, V.、Povey, D.、Khudanpur, S.(2015)。Audio augmentation for speech recognition。16th Annual Conference of the International Speech Communication Association,3586-3589。 | 12. | Liao, Y.-F.、Chang, Y.-H. S.、Wang, S.-Y.、Chen, J.-W.、Wang, S.-M.、Wang, J.-H.(2017)。A progress report of the Taiwan Mandarin Radio Speech Corpus Project。2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment。 | 13. | Liao, Y.-F.、Chang, C.-Y.、Tiun, H.-K.、Su, H.-L.、Khoo, H.-L.、Tsay, J. S.、Tan, L.-K.、Kang, P.、Thiann, T.-G.、Iunn, U.-G.、Yang, J.-H.、Liang, C.-N.(2020)。Formosa Speech Recognition Challenge 2020 and Taiwanese Across Taiwan Corpus。23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques,65-70。 | 14. | Park, D. S.、Chan, W.、Zhang, Y.、Chiu, C.-C.、Zoph, B.、Cubuk, E. D.、Le, Q. V.(2019)。SpecAugment: A simple data augmentation method for automatic speech recognition。The 20th Annual Conference of the International Speech Communication Association,2613-2617。 | 15. | Ney, H.、Essen, U.(1991)。On smoothing techniques for bigram-based natural language modelling。1991 International Conference on Acoustics, Speech, and Signal Processing,825-828。 | 16. | Lo, T.-H.、Chen, B.(2019)。Semi-supervised training of acoustic models leveraging knowledge transferred from out-of-domain data。2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference,1400-1404。 | 17. | Saki, F.、Sehgal, A.、Panahi, I.、Kehtarnavaz, N.(2016)。Smart phone-based real-time classification of noise signals using subband features and random forest classifier。2016 IEEE International Conference on Acoustics, Speech and Signal Processing,2204-2208。 | 18. | Saki, F.、Kehtarnavaz, N.(2016)。Automatic switching between noise classification and speech enhancement for hearing aid devices。2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society,736-739。 | 19. | Stolcke, A.(2002)。SRILM--an extensible language modeling toolkit。2002 IEEE International Conference on Acoustics, Speech, and Signal Processing,901-904。 | 20. | Xu, H.、Li, K.、Wang, Y.、Wang, J.、Kang, S.、Chen, X.、Povey, D.、Khudanpur, S.(2018)。Neural network language modeling with letter-based features and importance sampling。2018 IEEE International Conference on Acoustics, Speech and Signal Processing,6109-6113。 | 單篇論文1. | Snyder, D.,Chen, G.,Povey, D.(2015)。MUSAN: A music, speech, and noise corpus(1510.08484)。 | 2. | Wang, C.,Li, M.,Smola, A. J.(2019)。Language models with transformers(1904.09408)。 | |