:::

詳目顯示

回上一頁
題名:控制詞彙之自動索引
書刊名:中國圖書館學會會報
作者:陳光華 引用關係伍建廷
作者(外文):Chen, Kuang-hwaWu, Chien-ting
出版日期:1998
卷期:61
頁次:頁81-102
主題關鍵詞:自動索引控制詞彙主題分析Automatic indexingControlled vocabularySubject analysis
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:12
  • 點閱點閱:31
     本論文於詞彙頻率統計的基礎下,利用大量經人工控制詞彙索引的文件,配合控 制詞彙所提供的語意訊息,設計一個自動索引模型。索引模型使用新的詞彙顯著性計算公式 TF × OSDF × CSIDF 修正傳統以 TF × IDF 無法將主題專指性詞彙從主題相近的文件集 合中分離出來的問題。 實驗針對 100 個 MeSH 標題,利用總數 60,400 篇文件的摘要與題 名進行訓練與測試,結果顯示索引模型的表現相當優良。摘要部分的索引精確率與索引回現 率可同時到達 90% 以上,題名部分則在索引精確率 90% 的要求下,維持索引回現率於 70% 。透過索引模型產生大量的控制詞彙建議名單,將可以減輕索引一致性的問題,提高文件的 控制詞彙索引數量,改善傳統控制詞彙索引因為產量過少,導致檢索時精確率雖高,但回現 率卻不如自然語言索引的現象。
     Based on statistics of word frequency and supported by semantic information of controlled vacabularies, a new model for automatically controlled-vocabulary indexing is proposed in this paper. In the proposed model, a new formula of term significance, TF × OSDF × CSIDF, amends the flaw of TF × IDF, in which subject-specific words with high benefit to subject identification cannot be distinquished from other words in the document collection of the same or close subject. Involving 100 MeSH subject headings and 60,400 abstracts and titles, results of the experiment achieve high performance, whereas indexing precision and recall exceed 90% concurrently in abstract part. In tile part, the indexing precision reaches 90% and indexing recall remains 70%. By consulting a big number of candidates of controlled vocabularies generated by the model, the problem of indexer's consistency could be alleviated. Besides, much time and cost saved will directly prompt quality and quantity of controlled-vocabulary index terms, and finally improve retrieval performance indirectly.
期刊論文
1.陳昭珍(19920600)。主題索引問題初探。美國資訊科學學會臺北學生分會會訊,5,14-35。  延伸查詢new window
2.Spärck-Jones, Karen(1972)。A statistical interpretation of term specificity and its application in retrieval。Journal of Documentation,28(1),11-21。  new window
3.Fagan, Joel L.(1989)。The Effectiveness of a Nonsyntactic Approach to Automatic Phrase Indexing for Document Retrieval。Journal of the American Society for Information Science,40(2),115-132。  new window
4.陳昭珍(1992)。主題理論之探討(上)。書農,9,21。  延伸查詢new window
5.陳佳君(1995)。從知識結構探討主題分析。書府,16,30-48。  延伸查詢new window
6.Blair, D. C.、Maron, M. E.(1990)。Full Text Information Retrieval: Further Analysis and Clarification。Information Processing and Management,26(3),437-447。  new window
7.Leung, Chi-Hong、Kan, Wing-Kay(1997)。A Statistical Learning Approach to Automatic Indexing of Controlled Index Terms。Journal of the American Society for Information Science,48(1),55-56。  new window
8.Ginsberg, A.(1993)。A Unified Approach to Automatic Indexing and Information Retrieval。IEEE Expert,8,46-56。  new window
9.Schuegraf, E. J.、Bommel, F. van(1993)。An Automatic Document Indexing System Based on Cooperating Expert Systems: Design and development。Canadian Journal of Information and Library Science,18(2),32-50。  new window
10.Vleduts-Stokolov, Natasha(1987)。Concept Recognition in an Automatic Text-processing System for the Life Sciences。Journal of the American Society for Information Science,38(4),267-287。  new window
11.Vleduts-Stokolov, Natasha(1982)。On Automatic Support to Indexing a Life Science Eata Base。Information Processing and Management,18(6),313-321。  new window
12.Humphrey, Susanne M.、Miller, Nancy E.(1987)。Knowledge-based Indexing of the Medical Literature: the Indexing Aid Project。Journal of the American Society for Information Science,38(3),184-196。  new window
13.Trubkin, Loene(1979)。Auto-indexing of the 1971-77 ABI-INFORM Database。Database,2(2),56-61。  new window
14.Aoki, T.、Tanaka, T.、Nishii, T.、Tsukada, M.、Nakamura, O.、Minami, T.(1988)。Automatic Index Extraction System Using Layout Structure。Research Reports of Kogakuin University,64,297-302。  new window
15.Dillon, Martin、Grar, Ann S.(1983)。FASIT: a fully automatic based indexing system。Journal of the American Society for Information Science,34(2),99-108。  new window
16.Janas, J. M.(1977)。Automatic Recognition of the Part-of-speech foe English Texts。Information Processing and Management,13,205-213。  new window
17.Sager, Naomi(1975)。Sublanguae Grammars in Science Information Processing。Journal of the American Society for Information Science,26(1),10-16。  new window
18.陳光華(1997)。電子文獻主題之自動辨識。中華民國圖書館學會會報,59,43-58。new window  延伸查詢new window
19.Cohen, Jonathan D.(1995)。Highlights: Language-and Domain-independent Automatic Indexing Terms for Abstracting。Journal of the American Society for Information Science,46(3),162-174。  new window
20.Jones, Leslie P.、Edward, W.、Gassie, Jr.、Radhakrishnan, Sridhar(1990)。INDEX: the Statistical basis for an Automatic Conceptual Phrase-index System。Journal of the American Society for Information Science,41(2),87-97。  new window
21.Jones, Kevin P.(1976)。Toward a Theory of indexing [Documentation notes]。The Journal of Documentation,32(2),118-125。  new window
會議論文
1.陳光華(1998)。新資訊時代的啟發性資訊服務。臺北:桃園。195-208。  延伸查詢new window
2.Salton, Gerard(1988)。Syntactic Approaches to Automatic Book Indexing。Buffalo。120-138。  new window
3.Faraj, N.、Godin, R.、Missaoui, R.、David, S.、Plante, P.(1944)。Evaluation of the Contribution of Syntactic Composite Terms to Automatic Indexing340-360。  new window
4.Kang, K. H.、Lee, Y. C.、Jang, W. H.、Park, Y. S.(1994)。An Implementation of an Automatic Keyword Extraction System。Beijing。708-711。  new window
5.陳光華(1995)。Topic Indentification in Discourse。Dublin, Ireland。267-271。  new window
6.Chang, Jyun-Sheng、Tseng, Tsung-yih、Cheng, Ying、Chen, Huey-Chyun、Cheng, Shun-der、Ker, Sur-Jin、Liu, John S.(1992)。A Corpus-based Statistical Approach to Automatic Book Indexing。Trento, Italy。147-151。  new window
圖書
1.Salton, G.(1989)。Automatic Text processing: the Transformation, Analysis and Retrieval of Information by Computer。Automatic Text processing: the Transformation, Analysis and Retrieval of Information by Computer。Reading, M. A.。  new window
2.鄭恒雄(1982)。中文參考資料。中文參考資料。臺北。  延伸查詢new window
3.馮志偉(19970801)。現代術語學引論。大陸語文出版社。  延伸查詢new window
4.黃慕萱(19960000)。資訊檢索中「相關」概念之研究。臺北:臺灣學生。new window  延伸查詢new window
5.Salton, Gerald、McGill, Michael J.(1983)。Introduction to modern information retrieval。McGraw-Hill。  new window
6.胡述兆(1985)。圖書館學與資訊科學大辭典。圖書館學與資訊科學大辭典。臺北。  延伸查詢new window
7.(1998)。國立臺灣大學醫學院圖書分館CDP OVID Web 版資料庫使用手冊。國立臺灣大學醫學院圖書分館CDP OVID Web 版資料庫使用手冊。臺北。  延伸查詢new window
其他
1.Medical Subject Headings。  new window
2.MEDILINE。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top