資料載入處理中...
臺灣人文及社會科學引文索引資料庫系統
:::
網站導覽
國圖首頁
聯絡我們
操作說明
English
行動版
(18.227.48.82)
登入
字型:
**字體大小變更功能,需開啟瀏覽器的JAVASCRIPT,如您的瀏覽器不支援,
IE6請利用鍵盤按住ALT鍵 + V → X → (G)最大(L)較大(M)中(S)較小(A)小,來選擇適合您的文字大小,
如為IE7以上、Firefoxy或Chrome瀏覽器則可利用鍵盤 Ctrl + (+)放大 (-)縮小來改變字型大小。
來源文獻查詢
引文查詢
瀏覽查詢
作者權威檔
引用/點閱統計
我的研究室
資料庫說明
相關網站
來源文獻查詢
/
簡易查詢
/
查詢結果列表
/
詳目列表
:::
詳目顯示
第 1 筆 / 總合 1 筆
/1
頁
來源文獻資料
外文摘要
引文資料
題名:
Pitch Marking Based on an Adaptable Filter and a Peak-Valley Estimation Method
書刊名:
International Journal of Computational Linguistics & Chinese Language Processing
作者:
Chen,Jau-hung
/
Kao,Yung-an
出版日期:
2001
卷期:
6:2
頁次:
頁31-42
原始連結:
連回原系統網址
相關次數:
被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
排除自我引用:0
共同引用:
3
點閱:10
In a text-to-speech (TTS) conversion system based on the time-domain pitch-synchronous overlap-add (TD-PSOLA) method, accurate estimation of pitch periods and pitch marks is necessary for pitch modification to assure optimal quality of synthetic speech. In general, there are two major tasks in pitch marking: pitch detection and location determination. In this paper, an adaptable filter, which serves as a bandpass filter, is proposed for use in pitch detection to transform voiced speech into a sine-like wave. The pass band of the adaptable filter can be adapted based on the fundamental frequency. Based on the sine-like wave, a peak-valley decision method is proposed to determine the appropriate parts (positive part and negative part) of voiced speech for use in pitch mark estimation. In each pitch period, two possible peaks/valleys are searched, and dynamic programming is performed to obtain pitch marks. Experimental results indicate that our proposed method performs very well if correct pitch information is estimated.
以文找文
期刊論文
1.
Shih,Chilin、Sproat,Richard(19960800)。Issues in Text-to-Speech Conversion for Mandarin。International Journal of Computational Linguistics & Chinese Language Processing,1:1,頁37-86。
2.
Markel, John D.(1972)。The sift algorithm for fundamental frequency estimation。IEEE Transactions on Audio and Electroacoustics,20,367-377。
3.
Iwahashi, N.、Sagisaka, Yoshinori(1995)。Speech segment network approach for optimization of synthesis unit set。Computer Speech and Language,335-352。
4.
陳順孝、Hwang, S. H.、Wang, Y. R.(1998)。An RNN-based prosodic information Synthesizer for Mandarin text-to-speech。IEEE Trans. Speech and Audio Proc.,6(3),226-239。
5.
Rabiner, L. R.、Chen, Ming-Jun、Rosenberg, A. E.、McGonegal, C. A.(1976)。A Comparative performance study of several pitch detection algorithms。IEEE transactions on acoustics, speech, and signal processing,24,399-417。
6.
Rabiner, Lawrence R.(1977)。On the use of autocorrelation analysis for pitch detection。IEEE Transactions on Acoustics, Speech and Signal Processing,25,24-33。
7.
Noll, A. M.(1967)。Cepstrum pitch determination。The Journal of the Acoustical Society of America,47,293-309。
8.
Barnard, E.、Cole, R. A.、Vea, M. P.、Alleva, F. A.(1991)。Pitch detection with a neural-net classifier。IEEE Trans. Signal Proc.,39(2),298-307。
9.
Barner, K. E.(2000)。Colored L-1 filters and their application in speech pitch detection。IEEE Trans. Signal Proc.,48(9),2601-2606。
10.
Kobayashi, M.、Sakamoto, M.、Hashimoto, Y.、Nishimura, Masanari、Suzuki, K.(1998)。Wavelet analysis used in text-to-speech synthesis。IEEE Transactions on Circuists and Systems-II, Analog and Digital Signal Processing,45(8),1125-1129。
會議論文
1.
Hamon, C.、Moulines, E.、Charpentier, F.(1989)。A diphone synthesis based on time-domain prosodic modifications of speech。New York。238-241。
2.
Chou, F. C.、Tseng, C. Y.(1998)。Corpus-based Mandarin speech synthesis with contextual syllabic units based on phonetic properties。New York。893-896。
3.
Charpentier, F. J.、Stella, M. G.(1986)。Diphone synthesis using an overlap-add technique for speech waveforms concatenation。New York。2015-2020。
4.
Huang, H.、Seide, F.(2000)。Pitch tracking and tone features for Mandarin speech recognition。New York。1523-1526。
5.
Moulines, E.、Emerard, F.、Larreur, D.、Milon, J. L. Le Saint、Faucheur, L. Le、Marty, F.、Charpentier, F.、Sorin, C.(1990)。A real-time French text-to-speech system generating high-quality synthetic speech。New York。309-312。
推文
當script無法執行時可按︰
推文
推薦
當script無法執行時可按︰
推薦
引用網址
當script無法執行時可按︰
引用網址
引用嵌入語法
當script無法執行時可按︰
引用嵌入語法
轉寄
當script無法執行時可按︰
轉寄
top
:::
相關期刊
相關論文
相關專書
相關著作
熱門點閱
1.
The Polysemy Problem, an Important Issue in a Chinese to Taiwanese TTS System
2.
The Prediction of Pronunciation of Polyphonic Words in a Chinese to Taiwanese TTS System
3.
A System Framework for Integrated Synthesis of Mandarin, Min-Nan, and Hakka Speech
4.
Issues in Text-to-Speech Conversion for Mandarin
無相關博士論文
無相關書籍
無相關著作
無相關點閱
QR Code