:::

詳目顯示

回上一頁
題名:文件內容之分析--語料庫為本的模型
書刊名:圖書館學刊
作者:陳光華 引用關係陳信希
作者(外文):Chen, Kuang-huaChen, Hsin-hsi
出版日期:1996
卷期:11
頁次:頁95-112
主題關鍵詞:言談分析資訊檢索自然語言處理Discourse analysisInformation retrievalNatural language processing
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(1) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:18
     一般資訊檢索的研究著重於檢索模型的建構、查詢的回饋機制、檢索行為的探討、檢索系統的執行效能。 本文則把研究的重心回歸資訊或文件本身,希望對資訊的內容有一個初步的瞭解。 本文根據三個因素:1)詞彙的重複,2)詞彙的重要性,3)共容語意,提出一個基於真實語料的文件內容分析的模型。這樣的模可型重於文章中名詞╱動詞與名詞╱名詞之間的配對關係。 本文也說明如何使用文件分析模型進行文件切分與文件主題辦識的研究,同時討論相關實驗的結果。
     An important step to understand text is to build the discourse structure through cohesion and coherence. However, to build the discourse structure in turn depends on the full understanding of texts, so that many efforts on this line are not automatic and not successful. A corpus-based model based on 1) repetition of words, 2) importance of words, and 3) collocational semantics for texts is proposed in theis paper. It focuses on association norms of noun-noun relations and noun-verb relations defined on discourse level and sentencelevel, respectively. According to this model, a text partition algorithm is proposed to determine the boundaries of discourse structures and a topic identification algorithm is also presented. The results of a series of experiments show that the proposed model is promising.
期刊論文
1.Grosz, B. J.、Sidner, C. L.(1986)。Attention, Intentions, and the Structure of Discourse。Computational Linguistics,12(3),175-204。  new window
2.Morris, J.、Hirst, G.(1991)。Lexical cohesion computed by thesaural relations as an indicator of the structure of text。Computational Linguistics,17(1),21-48。  new window
3.Youmans, G.(1991)。A New Tool for Discourse Analysis: The Vocabulary-Management Profile。Language,67,763。  new window
會議論文
1.Salton, G.(1986)。On the Use of Term Association in Automatic Information Retrieval。Bonn。380。  new window
2.Passonneau, R.、Litman, D.(1993)。Intention-Based Segmentation: Human Reliability and Correlation with Linguistic Cues。Columbus。148。  new window
3.Hearst, M.(1994)。Multi-Paragraph Segmentation of Expository Text。Las Cruces。9。  new window
4.Hearst, M.、Plaunt, C.(1993)。Subtopic Structuring for Full-Length Document Access。Pittsburgh。59。  new window
5.陳光華、陳信希(1994)。A Part-of Speech-Based Alignment Algorithm。New York。166。  new window
學位論文
1.Smadja, F.(1991)。Extracting Collocations from Text. An Application: Language Generation,New York。  new window
圖書
1.Johansson, S.(1986)。The Tagged LOB Corpus: Users’ Manual。Bergen。  new window
2.Brown, Gillian、Yule, George(1983)。Discourse Analysis。Cambridge University Press。  new window
3.Hearst, M.(1993)。TextTiling: A Qualititative Approach to Discourse Segmentation。TextTiling: A Qualititative Approach to Discourse Segmentation。Berkeley。  new window
4.Jelinek, F.(1985)。Markov Source Modeling of Text Generation。The Impact Processing Techniques in Communication。Nijhoff, Dordrecht。  new window
圖書論文
1.Kamp, Hans(1981)。A theory of truth and semantic representation。Formal Methods in the Study of Language。Amsterdam:Mathematisch Centrum。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE