| 期刊論文1. | Watanabe, S.、Hori, T.、Kim, S.、Hershey, J. R.、Hayash, T.(2017)。Hybrid CTC/attention architecture for end-to-end speech recognition。IEEE Journal of Selected Topics in Signal Processing,11(8),1240-1253。 | 2. | Elman, J. L.(1990)。Finding structure in time。Cognitive science,14(2),179-211。 | 3. | Hochreiter, Sepp、Schmidhuber, Jürgen(1997)。Long Short-term Memory。Neural Computation,9(8),1735-1780。 | 會議論文1. | Fiscus, Jonathan G.(1997)。A post-processing system to yield reduced word error rates: recognizer output voting error reduction (ROVER)。1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings。Santa Barbara, California。347-354。 | 2. | Bahl, L.、Brown, P.、de Souza, P.、Mercer, R.(1986)。Maximum mutual information estimation of hidden markov model parameters for speech recognition。IEEE International Conference on Acoustics, Speech, and Signal Processing,49-52。 | 3. | Liao, Y.-F.、Chang, C.-Y.、Tiun, H.-K.、Su, H.-L.、Khoo, H.-L.、Tsay, J. S.、Tan, L.-K.、Kang, P.、Thiann, T.-G.、Iunn, U.-G.、Yang, J.-H.、Liang, C.-N.(2020)。Formosa Speech Recognition Challenge 2020 and Taiwanese Across Taiwan Corpus。23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques,65-70。 | 4. | Dong, L.、Xu, S.、Xu, B.(2018)。Speech-transformer: a no-recurrence sequence-to-sequence model for speech recognition。2018 IEEE International Conference on Acoustics, Speech and Signal,5884-5888。 | 5. | Vaswani, Ashish、Shazeer, Noam、Parmar, Niki、Uszkoreit, Jakob、Jones, Llion、Gomez, Aidan N.、Kaiser, L.、Kaiser, Łukasz、Polosukhin, Illia(2017)。Attention is all you need。31st Annual Conference on Neural Information Processing Systems,5998-6008。 | 單篇論文1. | Cho, K.,van Merrienboer, B.,Gulcehre, C.,Bahdanau, D.,Bougares, F.,Schwenk, H.,Bengio, Y.(2014)。Learning phrase representations using RNN encoder-decoder for statistical machine translation(1406.1078)。 | 2. | Gulati, A.,Qin, J.,Chiu, C.-C.,Parmar, N.,Zhang, Y.,Yu, J.,Han, W.,Wang, S.,Zhang, Z.,Wu, Y.,Pang, R.(2020)。Conformer: Convolution-augmented transformer for speech recognition(2005.08100)。 | 3. | Watanabe, S.,Hori, T.,Karita, S.,Hayashi, T.,Nishitoba, J.,Unno, Y.,Soplin, N. E. Y.,Heymann, J.,Wiesner, M.,Chen, N.,Renduchintala, A.,Ochiai, T.(2018)。Espnet: End-to-end speech processing toolkit(1804.00015)。 | |