:::

詳目顯示

回上一頁
題名:The NTNU Taiwanese ASR System for Formosa Speech Recognition Challenge 2020
書刊名:International Journal of Computational Linguistics & Chinese Language Processing
作者:Chao, Fu-anLo, Tien-hongWeng, Shi-yanChiu, Shih-hsuanSung, Yao-tingChen, Berlin
出版日期:2021
卷期:26:1
頁次:頁1-16
主題關鍵詞:Formosa speech recognition challengeDeep learningTransfer learningSemi-supervised training
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:5
期刊論文
1.Gale, W. A.(1995)。Good-turing frequency estimation without tears。Journal of Quantitative Linguistics,2(3),217-237。  new window
2.Luo, Y.、Mesgarani, N.(2019)。Conv-TasNet: Surpassing ideal time-frequency magnitude masking for speech separation。IEEE/ACM Transactions on Audio, Speech, and Language Processing,27(8),1256-1266。  new window
3.Thiemann, J.、Ito, N.、Vincent, E.(2013)。The diverse environments multichannel acoustic noise database (DEMAND): A database of multichannel environmental noise recordings。The Journal of the Acoustical Society of America,133(5)。  new window
會議論文
1.Povey, D.、Ghoshal, A.、Boulianne, G.、Burget, L.、Glembek, O.、Goel, N.、Hannemann, M.、Motlicek, P.、Qian, Y.、Schwarz, P.、Silovsky, J.、Stemmer, G.、Vesely, K.(2011)。The Kaldi speech recognition toolkit。The IEEE 2011 Workshop on Automatic Speech Recognition and Understanding。  new window
2.Povey, D.、Peddinti, V.、Galvez, D.、Ghahrmani, P.、Manohar, V.、Na, X.、Wang, Y.、Khudanpur, S.(2016)。Purely sequence-trained neural networks for ASR based on lattice-free MMI。17th Annual Conference of the International Speech Communication Association,2751-2755。  new window
3.Manohar, V.、Hadian, H.、Povey, D.、Khudanpur, S.(2018)。Semi-supervised training of acoustic models using lattice-free MMI。2018 IEEE International Conference on Acoustics, Speech and Signal Processing。  new window
4.Povey, D.、Cheng, G.、Wang, Y.、Li, K.、Xu, H.、Yarmohamadi, M.、Khudanpur, S.(2018)。Semi-orthogonal low-rank matrix factorization for deep neural networks。19th Annual Conference of the International Speech Communication Association,3743-3747。  new window
5.Hadian, H.、Sameti, H.、Povey, D.、Khudanpur, S.(2018)。End-to-end speech recognition using lattice-free MMI。19th Annual Conference of the International Speech Communication Association,12-16。  new window
6.Ghahremani, P.、Manohar, V.、Hadian, H.、Povey, D.、Khudanpur, S.(2017)。Investigation of transfer learning for ASR using LF-MMI trained neural networks。2017 IEEE Automatic Speech Recognition and Understanding Workshop,279-286。  new window
7.Dean, D. B.、Sridharan, S.、Vogt, R. J.、Mason, M. W.(2010)。The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms。11th Annual Conference of the International Speech Communication Association,3110-3113。  new window
8.Chiu, S.-H.、Chen, B.(2021)。Innovative BERT-based reranking language models for speech recognition。2021 IEEE Spoken Language Technology Workshop,266-271。  new window
9.Jaitly, N.、Hinton, G. E.(2013)。Vocal tract length perturbation (VTLP) improves speech recognition。International Conference on Machine Learning。  new window
10.Kinoshita, K.、Ochiai, T.、Delcroix, M.、Nakatani, T.(2020)。Improving noise robust automatic speech recognition with single-channel time-domain enhancement network。2020 IEEE International Conference on Acoustics, Speech and Signal Processing,7009-7013。  new window
11.Ko, T.、Peddinti, V.、Povey, D.、Khudanpur, S.(2015)。Audio augmentation for speech recognition。16th Annual Conference of the International Speech Communication Association,3586-3589。  new window
12.Liao, Y.-F.、Chang, Y.-H. S.、Wang, S.-Y.、Chen, J.-W.、Wang, S.-M.、Wang, J.-H.(2017)。A progress report of the Taiwan Mandarin Radio Speech Corpus Project。2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment。  new window
13.Liao, Y.-F.、Chang, C.-Y.、Tiun, H.-K.、Su, H.-L.、Khoo, H.-L.、Tsay, J. S.、Tan, L.-K.、Kang, P.、Thiann, T.-G.、Iunn, U.-G.、Yang, J.-H.、Liang, C.-N.(2020)。Formosa Speech Recognition Challenge 2020 and Taiwanese Across Taiwan Corpus。23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques,65-70。  new window
14.Park, D. S.、Chan, W.、Zhang, Y.、Chiu, C.-C.、Zoph, B.、Cubuk, E. D.、Le, Q. V.(2019)。SpecAugment: A simple data augmentation method for automatic speech recognition。The 20th Annual Conference of the International Speech Communication Association,2613-2617。  new window
15.Ney, H.、Essen, U.(1991)。On smoothing techniques for bigram-based natural language modelling。1991 International Conference on Acoustics, Speech, and Signal Processing,825-828。  new window
16.Lo, T.-H.、Chen, B.(2019)。Semi-supervised training of acoustic models leveraging knowledge transferred from out-of-domain data。2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference,1400-1404。  new window
17.Saki, F.、Sehgal, A.、Panahi, I.、Kehtarnavaz, N.(2016)。Smart phone-based real-time classification of noise signals using subband features and random forest classifier。2016 IEEE International Conference on Acoustics, Speech and Signal Processing,2204-2208。  new window
18.Saki, F.、Kehtarnavaz, N.(2016)。Automatic switching between noise classification and speech enhancement for hearing aid devices。2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society,736-739。  new window
19.Stolcke, A.(2002)。SRILM--an extensible language modeling toolkit。2002 IEEE International Conference on Acoustics, Speech, and Signal Processing,901-904。  new window
20.Xu, H.、Li, K.、Wang, Y.、Wang, J.、Kang, S.、Chen, X.、Povey, D.、Khudanpur, S.(2018)。Neural network language modeling with letter-based features and importance sampling。2018 IEEE International Conference on Acoustics, Speech and Signal Processing,6109-6113。  new window
單篇論文
1.Snyder, D.,Chen, G.,Povey, D.(2015)。MUSAN: A music, speech, and noise corpus(1510.08484)。  new window
2.Wang, C.,Li, M.,Smola, A. J.(2019)。Language models with transformers(1904.09408)。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top