:::

詳目顯示

回上一頁
題名:運用詞彙權重技術於自動文件摘要之研究
書刊名:資訊管理學報
作者:黃仁鵬 引用關係張貞瑩
作者(外文):Huang, Jen-pengChang, Chen-ying
出版日期:2014
卷期:21:4
頁次:頁391-415
主題關鍵詞:自動文件摘要文字探勘網際網路探勘資訊檢索TF-IDF 演算法Automatic text summarizationText miningWeb miningTF-IDF
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(2) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:2
  • 共同引用共同引用:5
  • 點閱點閱:83
目前各個搜尋引擎所產生的網頁摘要,大多無法提供使用者充足的摘要內容 判斷資訊,更可能造成使用者的誤導。本研究希望搜尋引擎將查詢結果回傳給使 用者時,不只是給予一些片斷不全的訊息,取而代之的是一個比較有幫助的摘要, 使用者可以藉由此自動摘要,了解全文的概要,然後決定是否需要讀取網頁之全 文。本研究運用權重技術針對網頁的內容進行文字探勘,藉由中研院所開發的中 文斷詞系統(CKIP)進行斷詞,利用TF-ISF 與相似度權重技術分別進行摘要實作, 並透過其聯集與交集分別產生「概略摘要」與「精準摘要」,藉以提升自動摘要的 品質。由實驗結果可證實本研究所提出之系統方法可以有效的提升文件自動摘要 的正確性。
Purpose-The objective of text document summarization is to extract essential sentences that cover most of the concepts of a document so that users are able to comprehend the ideas of the documents which try to address by simply reading through the corresponding summary. This study aims to develop an automatic text summarization technique to product the summary of the web pages by extracting the sentences which cover most of the concepts of the web pages. Design/methodology/approach-The research framework was developed from CKIP (Chinese Knowledge Information Processing) system and automatic text summarization techniques. Two studies were designed to elicit and evaluate the accuracy and applicability of the five automatic text summarization techniques with 10 samples from 184 web articles. Findings-Our results show that TF-ISF (Term Frequency-Inverse Sentence Frequency) is better than the others in the evaluation of “F-measure”. Further, “Rough Summary” and “Accurate Summary” respectively is the best performance in the evaluation of “RECALL” and “PRECISION”.Research limitations/implications-This paper focuses on Chinese web articles. Hence, future research is recommended to develop an automatic text summarization system based on Ontology-based architecture. Practical implications - This paper provides several automatic text summarization techniques to product the summary of the web pages by extracting the sentences which cover most of the concepts of the web pages. The experimental results indicate that the proposed approach outperform a significant improvement on the accuracy of automatic text summarization. Originality/value-This paper is the first that applies the union and intersection of “Rough Summary” and “Accurate Summary” to improve the quality of automatic text summarization.
期刊論文
1.李俊宏、張興亞(20070900)。一個以Ontology為基礎的Web-Mining技術應用於供應鏈競爭分析之研究。電子商務學報,9(3),435-460。new window  延伸查詢new window
2.李麗華、李富民、詹尚驥、周裕健(20090600)。以學術部落格為主之個人化推薦系統。資訊科技國際期刊,3(1),56-75。new window  延伸查詢new window
3.柯淑津(20030800)。從詞網出發的中文複合名詞的語意表達。International Journal of Computational Linguistics & Chinese Language Processing,8(2),93-107。new window  延伸查詢new window
4.鄒明城、韓慧林、邱景星(20100700)。網頁地理資訊檢索與探勘--以民宿主題為例。資訊管理學報,17(3),19-44。new window  延伸查詢new window
5.Das, D.、Martins, A.F.(2007)。A survey on automatic text summarization。Literature Survey for the Language and Statistics II course at CMU,4,192-195。  new window
6.Gupta, V.、Lehal, G. S.(2010)。A survey of text summarization extractive techniques。Journal of Emerging Technologies in Web Intelligence,2(3),258-268。  new window
7.Losiewicz, P.、Oard, D. W.、Kostoff, R. N.(2000)。Textual data mining to support science and technology management。Journal of Intelligent Information Systems,15(2),99-119。  new window
8.Luhn, H. P.(1958)。The Automatic Creation of Literature Abstracts。IBM Journal of Research and Development,2(2),159-165。  new window
9.Baxendale, P. B.(1958)。Machine Made Index for Technical Literature: An experiment。IBM Journal of Research & Development,2(4),354-361。  new window
10.魏玲玉、曾守正(20060700)。以文件倉儲概念實現動態群聚與多重文件摘要之研究--以中文電子新聞為例。資訊管理學報,13(3),153-176。new window  延伸查詢new window
11.Salton, G.、Singhal, A.、Mitra, M.、Buckley, C.(1997)。Automatic Text Structuring and Summarization。Information Processing & Management,33(2),193-207。  new window
會議論文
1.陳姿妤、魏世杰(20070526)。運用重複具排除技術於中文文件自動摘要之研究。第十八屆國際資訊管理學術研討會。臺北。  延伸查詢new window
2.黃純敏、吳郁瑩(19991022)。網路中文文件自動摘要。網際網路研討會,國立中山大學承辦 。高雄。  延伸查詢new window
3.黃純敏、楊存一、邱立豐(2002)。英文網路文件自動摘要之研究。第十三屆國際資訊管理學術研討會,(會議日期: 2002/05/20-05/23)。台北。  延伸查詢new window
4.黃純敏、黃世源、盧韋秀(2011)。自動摘要方法於新聞解讀之比較。2011商管與資訊研討會,(會議日期: 2011/04/28-04-29)。新北市三峽。  延伸查詢new window
5.Abdel Fattah, M.、Ren, F.(2008)。Probabilistic neural network based text summarization。The International Conference on Natural Language Processing and Knowledge Engineering (IEEE 2008),(會議日期: 2008/10/19-10/22)。Beijing, China。1-6。  new window
6.Dalal, M. K.、Zaveri, M. A.(2011)。Heuristics based automatic text summarization of unstructured text。The International Conference & Workshop on Emerging Trends in Technology,(會議日期: 2011/02/25-02/26)。  new window
7.Harris, A.、Oussalah, M.(2008)。Automatic document summarizer。The 7th IEEE International Conference on Cybernetic Intelligent Systems (CIS 2008),(會議日期: 2008/09/09-09/10)。London, UK。1-6。  new window
8.Ji, X.(2008)。Research on the Automatic Summarization Model based on Genetic Algorithm and Mathematical Regression。The International Symposium on Electronic Commerce and Security (ISECS 2008),(會議日期: 2008/08/03-08/05)。Guangzhou, China。488-491。  new window
9.Ren, F.、Li, S.、Kita, K.(2001)。Automatic abstracting important sentences of web articles。IEEE International Conference on Systems, Man, and Cybernetics (IEEE SMC 2001),(會議日期: 2001/10/07-10/10)。Tucson, Arizona。1705-1710。  new window
10.Wei, C. P.、Chen, L. C.、Chen, H. Y.、Yang, C. S.(2013)。Mining Suppliers from Online News Documents。The Pacific Asia Conference on Information Systems (PACIS 2013),(會議日期: 2013/06/18-06/22)。Jeju Island, Korea。  new window
圖書
1.Mani, I.、Maybury, M. T.(1999)。Advances in automatic text summarization。Cambridge, MA:The MIT Press。  new window
2.Salton, G.(1989)。Automatic text processing。Addison-Wesley Publishing Company。  new window
3.Sullivan, Dan(2001)。Document Warehousing and Text Mining: Techniques for Improving Business Operations, Marketing, and Sales。John Wiley & Sons, Inc.。  new window
4.Salton, Gerald、McGill, Michael J.(1983)。Introduction to modern information retrieval。McGraw-Hill。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
:::
無相關書籍
 
無相關著作
 
QR Code
QRCODE