:::

詳目顯示

回上一頁
題名:電子文獻主題之自動辨識
書刊名:中國圖書館學會會報
作者:陳光華 引用關係
作者(外文):Cheng, Kuang-hua
出版日期:1997
卷期:59
頁次:頁43-58
主題關鍵詞:資訊檢索電子文獻主題辨識Information retrievalElectronic documentTopic identification
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(2) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:1
  • 共同引用共同引用:38
  • 點閱點閱:22
     網際網路上的電子文件數量極為龐大, 如何快速有效的進行電子文件主題標引的 工作逐漸成為一項重要的研究課題。目前有關的研究著重於名詞的行為,期望藉由文獻中名 詞的頻率或是其他統計值,求得文獻的主題分類。雖然文獻的主題是由名詞組成,但是本文 認為決定那些名詞成為主題的因素卻不只是名詞。因為文獻的組織是具有結構性的,是事件 驅動( Event-Driven )的,而事件則是由名詞與動詞共同完成的,名詞與動詞在決定文獻 的過程中具有重要地位。本論文考慮文獻的一般行為,提出四項因素:(1) 詞彙的重要性, (2) 詞彙的重複性, (3) 詞彙的共現性, (4) 詞彙的距離,建構一個數學模型並進行讀者 與模型的比較實驗。實驗結果顯示該模型的自動主題辨識與人工主題辨識具有相當的效能。
     The volume of electronic decuments in the Internet grows very quickly. How to effectively assign topics to documents becomes an important issue. In the present time, the researches based on this line focus on the behavior of nounts in documents. Although topics are composed of nounts, the constituents that determine which nouns are topics are not only nouns. We think that texts are well-organized and are event-driven. Therefore, nouns and verbs together contribute the process of topic identification. This paper considers four factors: (1) word importance, (2) word frequency, (3) word co-occurrence, and (4) word distance and constructs a mathematical model. The preliminary experiments show that the performance of the proposed model is equivalent to that of human being.
期刊論文
1.Grosz, B. J.、Sidner, C. L.(1986)。Attention, Intentions, and the Structure of Discourse。Computational Linguistics,12(3),175-204。  new window
2.Spärck-Jones, Karen(1972)。A statistical interpretation of term specificity and its application in retrieval。Journal of Documentation,28(1),11-21。  new window
3.Church, Kenneth Ward、Hanks, Patrick(1990)。Word Association Norms, Mutual Information, and Lexicography。Computational Linguistics,16(1),22-29。  new window
4.Salton, G.、Yang, C. S.(1973)。On the Specification of Term Values in Automatic Indexing。The Journal of Documentation,29(4),351-372。  new window
5.Salton, G.、Yang, C. S.、Yu, C. T.(1975)。A Theory of Term Importance in Automatic Text Analysis。Journal of the American Society for Information Science,26(1),33-44。  new window
6.Youmans, G.(1991)。A New Tool for Discourse Analysis: The Vocabulary-Management Profile。Language,67,763-789。  new window
會議論文
1.陳光華(1995)。Topic Identification in Discourse。The 7th Conference of the European Chapter of Association for Computational Linguistics。San Francisco, CA:Morgan Kaufmann Publishers。267-271。  new window
研究報告
1.中央研究院詞庫小組(1995)。中央研究院平衡語料庫的內容與說明。Taipei, R.O.C.。  延伸查詢new window
圖書
1.國立中央圖書館(1993)。中文圖書標題表。中文圖書標題表。臺北。  延伸查詢new window
2.中國機讀編目格式修訂小組(1997)。中國機讀編目格式。台北:國家圖書館。  延伸查詢new window
3.陳雪華(19960000)。圖書館與網路資源。臺北:文華。new window  延伸查詢new window
4.Witten, I. H.、Moffat, A.、Bell, Timothy(1995)。Compression and Full-Text Indexing for Digital Libraries。Digital Libraries: Current Issues。Berlin,。  new window
5.Library of Congress(1997)。Library of Congress subject headings。Library of Congress subject headings。Washington。  new window
6.Carolyn, Ann Reid、McKinnon, Emma Jean(1992)。MeSH for searchers。MeSH for searchers。Chicago。  new window
7.Belkin, Nicholas J.(1994)。Tutorial for Information Retrieval: Information Retrieval as Interaction。Tutorial for Information Retrieval: Information Retrieval as Interaction。  new window
8.Rocchio, J. J.(1971)。Relevance Feedback in Information Retrieval。The SMART System-Experiments in Automatic Document Processing。New Jersey。  new window
9.Ide, E.(1971)。New Experiments in Relevance Feedback。The SMART System-Experiments in Automatic Document Processing。New Jersey。  new window
10.Kamp, H.(1981)。A Theory of Turth and Semanitc Representation。Formal Methods in the Study of Language〈1〉。Amsterdam。  new window
11.Hodge, Gail(1992)。Automated Support to Indexing。Automated Support to Indexing。Philadelphia。  new window
單篇論文
1.Weibel, Stuart L.,Godby, Jean,Eric, Miller(1995)。OCLC/NCSA Metadata Workshop Report,http://www.oclc.org/oclc/research/conferences/Metadata/dublin_core_report.html。  new window
其他
1.FGDC(1994)。Content standards for digital geospatial metadata -- FGDC。  new window
2.Salton, G.(1975)。A Theory of Indexing,Philadelphia, PA。  new window
3.Hearst, M.,Plaunt, C.(1993)。Subtopic Structuring for Full-Length Document Access,New York。  new window
4.Reynar, J.(1994)。An Automatic Method of Finding Topic Boundaries。  new window
5.陳光華,陳信希(1994)。Extracting Noun Phrases from Large-Scale Texts: A Hybrid Approach and Its Automatic Evaluation。  new window
6.陳信希,Lee, J. C.(1996)。Identification and Classification of Proper Nouns in Chinese Texts。  new window
7.陳信希,邊國維(1997)。Proper Name Extraction from Web Pages for Finding People in Internet,Taipei。  new window
8.Consortium for the Interchange of Museum Information (CIMI)。  new window
圖書論文
1.Frawley, W. J.、Piatetsky-Shapiro, G.(1991)。Knowledge Discovery in Databases。Knowledge Discovery in Databases。Menlo Park。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
:::
無相關著作
 
無相關點閱
 
QR Code
QRCODE