電子文獻主題之自動辨識__臺灣人文及社會科學引文索引資料庫

:::

詳目顯示

第 1 筆 / 總合 1 筆

/1頁

來源文獻資料
摘要
外文摘要
引文資料

題名：	電子文獻主題之自動辨識
書刊名：	中國圖書館學會會報
作者：	陳光華
作者(外文)：	Cheng, Kuang-hua
出版日期：	1997
卷期：	59
頁次：	頁43-58
主題關鍵詞：	資訊檢索；電子文獻；主題辨識；Information retrieval；Electronic document；Topic identification
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(2) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:1 共同引用:38 點閱:23

　　　　　網際網路上的電子文件數量極為龐大，如何快速有效的進行電子文件主題標引的工作逐漸成為一項重要的研究課題。目前有關的研究著重於名詞的行為，期望藉由文獻中名詞的頻率或是其他統計值，求得文獻的主題分類。雖然文獻的主題是由名詞組成，但是本文認為決定那些名詞成為主題的因素卻不只是名詞。因為文獻的組織是具有結構性的，是事件驅動（ Event-Driven ）的，而事件則是由名詞與動詞共同完成的，名詞與動詞在決定文獻的過程中具有重要地位。本論文考慮文獻的一般行為，提出四項因素：(1) 詞彙的重要性， (2) 詞彙的重複性， (3) 詞彙的共現性， (4) 詞彙的距離，建構一個數學模型並進行讀者與模型的比較實驗。實驗結果顯示該模型的自動主題辨識與人工主題辨識具有相當的效能。

以文找文

　　　　　The volume of electronic decuments in the Internet grows very quickly. How to effectively assign topics to documents becomes an important issue. In the present time, the researches based on this line focus on the behavior of nounts in documents. Although topics are composed of nounts, the constituents that determine which nouns are topics are not only nouns. We think that texts are well-organized and are event-driven. Therefore, nouns and verbs together contribute the process of topic identification. This paper considers four factors: (1) word importance, (2) word frequency, (3) word co-occurrence, and (4) word distance and constructs a mathematical model. The preliminary experiments show that the performance of the proposed model is equivalent to that of human being.

以文找文

期刊論文
1.	Grosz, B. J.、Sidner, C. L.(1986)。Attention, Intentions, and the Structure of Discourse。Computational Linguistics，12(3)，175-204。
2.	Spärck-Jones, Karen(1972)。A statistical interpretation of term specificity and its application in retrieval。Journal of Documentation，28(1)，11-21。
3.	Church, Kenneth Ward、Hanks, Patrick(1990)。Word Association Norms, Mutual Information, and Lexicography。Computational Linguistics，16(1)，22-29。
4.	Salton, G.、Yang, C. S.(1973)。On the Specification of Term Values in Automatic Indexing。The Journal of Documentation，29(4)，351-372。
5.	Salton, G.、Yang, C. S.、Yu, C. T.(1975)。A Theory of Term Importance in Automatic Text Analysis。Journal of the American Society for Information Science，26(1)，33-44。
6.	Youmans, G.(1991)。A New Tool for Discourse Analysis: The Vocabulary-Management Profile。Language，67，763-789。

會議論文
1.	陳光華(1995)。Topic Identification in Discourse。The 7th Conference of the European Chapter of Association for Computational Linguistics。San Francisco, CA：Morgan Kaufmann Publishers。267-271。

研究報告
1.	中央研究院詞庫小組(1995)。中央研究院平衡語料庫的內容與說明。Taipei, R.O.C.。延伸查詢

圖書
1.	國立中央圖書館(1993)。中文圖書標題表。中文圖書標題表。臺北。延伸查詢
2.	中國機讀編目格式修訂小組(1997)。中國機讀編目格式。台北：國家圖書館。延伸查詢
3.	陳雪華(19960000)。圖書館與網路資源。臺北：文華。延伸查詢
4.	Witten, I. H.、Moffat, A.、Bell, Timothy(1995)。Compression and Full-Text Indexing for Digital Libraries。Digital Libraries: Current Issues。Berlin,。
5.	Library of Congress(1997)。Library of Congress subject headings。Library of Congress subject headings。Washington。
6.	Carolyn, Ann Reid、McKinnon, Emma Jean(1992)。MeSH for searchers。MeSH for searchers。Chicago。
7.	Belkin, Nicholas J.(1994)。Tutorial for Information Retrieval: Information Retrieval as Interaction。Tutorial for Information Retrieval: Information Retrieval as Interaction。
8.	Rocchio, J. J.(1971)。Relevance Feedback in Information Retrieval。The SMART System-Experiments in Automatic Document Processing。New Jersey。
9.	Ide, E.(1971)。New Experiments in Relevance Feedback。The SMART System-Experiments in Automatic Document Processing。New Jersey。
10.	Kamp, H.(1981)。A Theory of Turth and Semanitc Representation。Formal Methods in the Study of Language〈1〉。Amsterdam。
11.	Hodge, Gail(1992)。Automated Support to Indexing。Automated Support to Indexing。Philadelphia。

單篇論文
1.	Weibel, Stuart L.，Godby, Jean，Eric, Miller(1995)。OCLC/NCSA Metadata Workshop Report，http://www.oclc.org/oclc/research/conferences/Metadata/dublin_core_report.html。

其他
1.	FGDC(1994)。Content standards for digital geospatial metadata -- FGDC。
2.	Salton, G.(1975)。A Theory of Indexing，Philadelphia, PA。
3.	Hearst, M.，Plaunt, C.(1993)。Subtopic Structuring for Full-Length Document Access，New York。
4.	Reynar, J.(1994)。An Automatic Method of Finding Topic Boundaries。
5.	陳光華，陳信希(1994)。Extracting Noun Phrases from Large-Scale Texts: A Hybrid Approach and Its Automatic Evaluation。
6.	陳信希，Lee, J. C.(1996)。Identification and Classification of Proper Nouns in Chinese Texts。
7.	陳信希，邊國維(1997)。Proper Name Extraction from Web Pages for Finding People in Internet，Taipei。
8.	Consortium for the Interchange of Museum Information (CIMI)。

圖書論文
1.	Frawley, W. J.、Piatetsky-Shapiro, G.(1991)。Knowledge Discovery in Databases。Knowledge Discovery in Databases。Menlo Park。

推文
推薦
引用網址
引用嵌入語法
轉寄

top

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	由數位閱讀偏好探討公共圖書館館藏發展
2.	澳門文獻資源數位化著作權保護策略研究
3.	國立臺灣藝術大學圖書館網站使用者之滿意度調查
4.	Open Access文獻之資訊組織及取用管道分析
5.	談圖書館網站之架設--以新竹高中圖書館為例
6.	90年代文字符碼表現在服裝之探討
7.	論網路資源在臺灣歐盟研究中的角色
8.	應用XML Schema架構之Metadata管理系統
9.	Metadata管理系統之分析與設計
10.	館藏發展面面觀
11.	XML/Metadata管理系統--Metalogy之設計
12.	西洋史相關資料的查詢與取得
13.	美國密西根電子圖書館參訪心得
14.	醫學圖書館網路資源合作館藏發展
15.	我國大學圖書資訊學系網路資源相關課程之研究

1.	國家圖書館電子資源館藏發展之研究
2.	古文字資料庫建構研究──以《上海博物館藏戰國楚竹書(一)》為例
3.	網際網路應用對大學圖書館組織文化之影響：以五所大學圖書館為例

1.	公共圖書館事業與利用
2.	圖書館數位合作參考服務的理論與實務
3.	族譜文獻學
4.	臺灣數位文學論 : 數位美學、傳播與教學之理論與實際
5.	圖書資訊利用教育 : 國小階段之課程設計與教學實務
6.	圖書館與網路資源
7.	由數位科技看華語文教學--以紅樓夢王熙人物數位教材教案為考察範圍
8.	資訊素養教育的理性基礎初探：理性危機觀點

無相關著作

無相關點閱

QR Code

臺灣人文及社會科學引文索引資料庫系統

詳目顯示

臺灣人文及社會科學引文索引資料庫