《人民日報》語料庫命名實體分類的研究__臺灣人文及社會科學引文索引資料庫

:::

詳目顯示

第 1 筆 / 總合 1 筆

/1頁

來源文獻資料
摘要
外文摘要
引文資料

題名：	《人民日報》語料庫命名實體分類的研究
書刊名：	International Journal of Computational Linguistics & Chinese Language Processing
作者：	夏迎炬／于浩／西野文人
作者(外文)：	Xia, Ying Ju／Yu, Hao／Nishino, Fumihito
出版日期：	2005
卷期：	10:4
頁次：	頁533-542
主題關鍵詞：	Named entity；Classification；Corpus；Natural language processing；命名實體；分類；語料庫；自然語言處理
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:0 共同引用:0 點閱:23

在信息檢索、信息抽取等應用中，命名實体的處理十分重要。本文在目前的命名實体分類体系的基礎上，從信息檢索和抽取的角度對命名實体的細分類進行了深入的研究。提出了命名實体的多級分類并給出了每一級的詳細分類。為了檢驗該分類体系的實際效果，我們在人民日報語料上進行了初步的標注。并使用常用的基于統計模型的命名實体識別算法在人民日報語料上做了一系列的對比實驗。實驗結果表明：面向機器處理的細分類能有效地提高識別系統的性能并最終有助于信息檢索和抽取。

以文找文

Named entity recognition is a very important part of information retrieval and information extraction. Classification is also very important. This paper investigates the sub-classification of named entities from the point of view of information retrieval and information extraction. This paper also presents multi-classification and gives detailed information about each sub-class. We have manually annotated people’s daily corpus (1998) and conducted a serial of experiments using the statistical model of named entity recognition. Theexperimental results show that the sub-classes presented by this paper can enhance the recognition system’s performance and aid information retrieval and information extraction.

以文找文

期刊論文
1.	段慧明、松井久仁于、徐國偉、胡國昕、俞士汶(2002)。大規模漢語標注語料庫的製作與使用。語言文字應用，2，72-77。延伸查詢

學位論文
1.	Borthwick, A.(1999)。A Maximum Entropy Approach to Named Entity Recognition，New York。

圖書
1.	俞士汶、朱學鋒、王惠、張芸芸(1998)。現代漢語語法信息詞典詳解。北京。延伸查詢
2.	黃昌寧、李涓子(2002)。語料庫語言學。北京：商務印書館。延伸查詢
3.	Aberdenn, J.、Day, D.、Hirschman, L.、Robinson, Peter、Vilain, M.(1995)。MITRE: Description of the Alembic System Used for MUC-6。Proceedings of the Sixth Message Understanding Conference(MUC-6)。San Francisco。

其他
1.	Sekine, S.(1998)。A decision tree method for finding and classifying names in Japanese texts，Montreal, Canada。
2.	Sun, J.，Gao, J.，Zhang, L.，Zhou, M.，Huang, C.(2002)。Chinese Named Entity Identification Using Class-based Language Model。
3.	馮志偉(2001)。中國語料庫研究的歷史和現狀。延伸查詢
4.	俞士汶，朱學鋒，段慧明(2000)。大規模現代漢語標注語料庫的加工規範。延伸查詢
5.	Bikel, D. M.，Miller, S.，Schwartz, R.，Weischedel, R.(1997)。Nymble: a high-performance learning name-finder，Washington, D. C.。
6.	Yi'an, Wu，Zhao, J.，Xu, B.(2003)。Chinese Named Entity Recognition Combining Statistical Model wih Human Knowledge，Japan。

推文
推薦
引用網址
引用嵌入語法
轉寄

top

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	重建現代化的巴比塔--機器翻譯的歷史和現狀

無相關博士論文

無相關書籍

無相關著作

無相關點閱

QR Code

臺灣人文及社會科學引文索引資料庫系統

詳目顯示

臺灣人文及社會科學引文索引資料庫