:::

詳目顯示

回上一頁
題名:自動化研究主題探勘方法及其在計算語言學之應用
書刊名:教育資料與圖書館學
作者:林頌堅 引用關係
作者(外文):Lin, Sung-chien
出版日期:2005
卷期:42:4
頁次:頁523-544
主題關鍵詞:研究主題探勘文本處理文字資訊探勘資訊視覺化計算語言學Topic explorationText processingText miningInformation visualizationComputational linguistics
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:4
  • 點閱點閱:24
由於科學研究的規模日益龐大而且研究的工作也愈來愈複雜,研究人員與科技管理人員需要一套能夠有效地探勘研究主題的方法。過去我們針對這個問題提出一系列 文本處理與文字資訊探勘的技術,其中主要為關鍵語詞抽取技術以及資訊視覺化技術。關鍵語詞抽取技術以研究領域中的論文文字資料做為輸入,自動化抽取關鍵語 詞來代表領域中的重要主題。資訊視覺化技術則將這些語詞和它們之間的關係呈現在二維的圖形,提供使用者可以透過產生的圖形了解該領域的重要主題和它們的發 展情形。其餘還包括了語詞共現估計、主題相關程度計算以及論文映射等技術。本論文將這些技術整合起來並應用到國內的計算語言學領域,確認這個領域研究與發 展的重點。結果發現計算語言學早期著重於各種語言知識的計算理論化,以因應機器翻譯的需求;中期和近期則有語音處理和資訊檢索等更多的應用出現,而應用的 技術則傾向採用具有強健與容易實作等特性的統計導向方法。
Because the size of modern scientific research is larger than before and the task of research becomes even more complex, researchers and managers urgently need an effective method to explore important topics in research domains. In the past, we had proposed a series of technologies based on text processing and text mining to deal with such a problem. Using text information in papers of the examined domain as input, a technology for term extraction was proposed to select key terms in the text information to represent important topics in the domain. Another proposed technology for information visualization was used to present the terms and their relationships in two-dimensional graphs with a technology of information visualization. Users can easily browse the topics of the domain as well as their development through the generated graphs for decision making of research and management. In addition, the technologies include several techniques of estimating term co-occurrences, calculating degrees of relevance between topics, and mapping paper information to the topic graph. In this paper, an automatic method for topic exploration was proposed with the integration of the developed technologies and it was applied to the studies of computational linguistics in Taiwan to depict foci of research and development in the domain. The result shows that for the development of technologies of machine translation, the earlier studies in the domain emphasized the computational theorization of several linguistic knowledge, but in its mid and later periods, there were more applications emerging, such as speech processing and information retrieval, and a lot of statistical approaches were adopted as the technologies for their robustness and easy implementation.
期刊論文
1.林頌堅(20021200)。基於高頻詞語的圖書資訊學研究領域分析之初步探討。中國圖書館學會會報,69,138-154。new window  延伸查詢new window
2.Deerwester, S.、Dumais, S.T.、Furnas, G.W.、Landauer, T.K.、Harshman, R.A.(1990)。Indexing by latent semantic analysis。Journal of the American Society for Information Science,41(6),391-407。  new window
3.White, Howard D.、McCain, Katherine W.(1998)。Visualizing a Discipline: An Author Co-Citation Analysis of Information Science, 1972-1995。Journal of the American Society for Information Science,49(4),327-355。  new window
4.Sproat, R.、Shih, C.、Gale, W.、Chang, N.(1996)。A Stochastic Finite-State Word-Segmentation Algorithm for Chinese。Computational Linguistics,22(3),377-404。  new window
5.Börner, Katy、Chen, Chaomei、Boyack, Kevin W.(2003)。Visualizing knowledge domains。Annual Review of Information Science and Technology,37(1),179-255。  new window
6.Persson, Olle(1994)。The Intellectual Base and Research Fronts of JASIS 1986-1990。Journal of the American Society for Information Science,45(1),31-38。  new window
7.Flexer, A.(2001)。On the Use of Self-organizing Maps for Clustering and Visualization。Intelligent data analysis,5(5),373-384。  new window
8.Kageura, Kyo、Umino, B.(1996)。Methods of automatic term recognition: A review。Terminology,3(2),259-289。  new window
9.林頌堅(2002)。基於詞語抽取的圖書與資訊學刊研究主題分析。圖書與資訊學刊,47,15-35。new window  延伸查詢new window
10.Huang, Chu-Reng(2000)。From quantitative to qualitative studies: Developments in Chinese computational and corpus linguistics。漢學研究,18,473-509。  new window
11.Ritter, H.、Kohonen, T.(1989)。Self-organizing semantic maps。Biological Cybernetics,61,241-254。  new window
12.Wu, Z.、Tseng, G.(1994)。Chinese Text Segmentation for Text Retrieval, Achievements and Problems。Journal of the American Society for Information Science,44(9),532。  new window
會議論文
1.林頌堅(2003)。基於自然語言處理技術的研究主題抽取與分析。第十五屆中華民國計算語言學研討會。臺北市:中華民國計算語言學學會。231-256。  延伸查詢new window
2.Lin, X.、Soergel, D.、Marchionini, G.(1991)。A self-organizing semantic map for information retrieval。The 14th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval。New York:ACM。262-269。  new window
3.Chien, Lee Feng(1997)。PAT-Tree-Based keyword extraction for chinese information retrieval。The 20th ACM SIGIR Conference on Research and Development in Information Retrieval,50-58。  new window
4.陳克健、中央研究院資訊科學研究所中文詞知識庫小組、Liu, Shin-Huan(1992)。Word Identification for Mandarin Chinese Sentences。The 1992 International Conference on Computational Linguistics。Nantes。101-107。  延伸查詢new window
5.Landauer, Thomas K.、Laham, D.、Derr, M.(2004)。From paragraph to graph: Latent Semantic Analysis for information visualization5214-5219。  new window
6.Merkl, D.(1997)。Exploration of text collections with hierarchical feature maps186-195。  new window
7.林頌堅(2004)。以自組織映射圖進行計算語言學領域視覺化之研究。Taipei。69-77。  延伸查詢new window
8.王士元(1988)。電腦在語言學裡的運用257-287。  延伸查詢new window
研究報告
1.Rundensteiner, E. A.、Ward, M. O.、Huang, S.(2003)。Exploration of Dimensionality Reduction for Text Visualization。  new window
學位論文
1.Tabah, Albert N.(1996)。Information Epidemics and the Growth of Physics,Canada。  new window
圖書
1.Kohonen, T.(1989)。Self-organization and Associative Memory。Self-organization and Associative Memory。Berlin, Germany:Springer-Verlag。  new window
2.戴安娜.克蘭、劉珺珺(1988)。無形學院--知識在科學共同體的擴散。北京:華夏出版社。  延伸查詢new window
3.Rogers, E. M.(1983)。Diffusion of Innovations。New York:Macmillan Publishing Company, Inc.。  new window
4.Garfield, E.(1979)。Citation Indexing: It’s Theory and Application in Science, Technology and Humanities。New York:John Wiley。  new window
5.de Solla Price, Derek J.(1963)。Little Science, Big Science。Columbia University Press。  new window
6.Salton, Gerald、McGill, Michael J.(1983)。Introduction to modern information retrieval。McGraw-Hill。  new window
7.Card, Stuart K.、Mackinlay, Jock D.、Shneiderman, Ben(1999)。Information visualization。Readings in information visualization-using vision to think。San Francisco。  new window
8.Dale, Robert(2000)。Handbook of Natural Language Processing。Handbook of Natural Language Processing。  new window
9.Hausser, Roland(2001)。Foundations of Computational Linguistics: Human-computer Communication in Natural Language。Foundations of Computational Linguistics: Human-computer Communication in Natural Language。New York。  new window
10.Samuelsson, C.(2003)。Statistical methods。The Oxford Handbook of Computational Linguistics。USA。  new window
11.Meadows, A. J.(1998)。Communicating Research。Communicating Research。San Diego。  new window
12.行政院國家科學委員會科學技術資料中心(2003)。臺灣學術研發能量之總體表現。臺灣學術研發能量之總體表現。臺北。  延伸查詢new window
其他
1.Lenders, Winfried(2001)。Past and Future Goals of Computational Linguistics。  new window
2.Kohonen, Teuvo(1996)。Very Large Two-Level SOM for the Browsing of Newsgroups。  new window
3.Lin, Xia(1992)。Visualization for the document space。  new window
4.Ma, Qing(2002)。Self-organizing Chinese and Japanese Semantic Maps,Taipei, Taiwan。  new window
5.Wermter, Stefan,Hung, Chihli(2002)。Selforganizing classification on the Reuters News Corpus,Taipei, Taiwan。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE