:::

詳目顯示

回上一頁
題名:部落格本文自動萃取機制
書刊名:電子商務研究
作者:洪智力 引用關係林政輝
作者(外文):Hung, ChihliLin, Cheng-hui
出版日期:2010
卷期:8:4
頁次:頁457-472
主題關鍵詞:部落格文章資訊擷取文字探勘文件物件模型Blog textInformation extractionText miningDocument object model
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:63
在部落格快速發展的時代,部落格上的資訊越來越多且具有參考價值,部落格文字內容探勘已成為網頁探勘研究的重要分支。要能自動化讀取部落格的文字內容,必須正確的找出描述本文的網頁標籤。本研究提出「網頁標籤文字相對比例法」,找出最有可能的本文標籤,此技術運用文件物件模型(DOM; document object model)的概念並透過網頁爬行器自動萃取部落格本文。經過實驗說明,本研究所提供的部落格本文自動萃取機制,能正確的過濾雜訊,找出本文標籤。
In the era of blog, more and more useful information is shared on blogs. Mining text on blogs has become one of important and novel research directions in the filed of web mining. For an automatic blog text mining system, it is necessary to locate the tags which describe the main concepts of blog text effectively and efficiently. This research uses the technique of relative proportion of text and tag in order to find the most possible tag for main blog text. More particularly, we use the concept of DOM (document object model) through the java crawler to analyze the relationship between text and tag. According to our experiments, our automatic blog text extraction mechanism is able to extract the main text of blog effectively and efficiently.
期刊論文
1.Kumar, R.、Novak, J.、Raghavan, P.、Tomkins, A.(2004)。Structure and evolution of blogspace。Communications of the ACM,47(12),35-39。  new window
2.Chen, Y.、Tsai, F. S.、Chan, K. L.(2008)。Machine learning techniques for business blog search and mining。Expert Systems with Applications,35(3),581-590。  new window
3.Rosencrance, L.(20040126)。Blogs bubble into business。Computerworld,38(4),23-24。  new window
4.Grossman, L.(2004)。Meet joe blog。Time,163(24),65-68。  new window
5.Geng, H.、Gao, Q.、Pan, J.(2007)。Extracting content for news web pages based on DOM。IJCSNS International Journal of Computer Science and Network Security,7(2),124-129。  new window
6.Wang, K. T.、Huang, Y. -M.、Jeng, Y. -L.、Wang, T. -I(2008)。A blog-based dynamic learning map。Computers & Education,51(1),262-278。  new window
7.Huang, T. -C.、Cheng, S.-C.、Huang, Y.-M.(2009)。A blog article recommendation generating mechanism using an SBACPSO algorithm。Expert Systems with Applications,36(7),10388-10396。  new window
8.Quan, C.、Ren, F.(2010)。A blog emotion corpus for emotional expression analysis in Chinese。Computer Speech and Language,24(4),726-749。  new window
9.Cao, D.、Liao, X.、Xu, H.、Bai, S.(2008)。Blog post and comment extraction using information quantity of web format。Lecture Notes in Computer Science,4993,298-309。  new window
會議論文
1.Lin, S. -H.、Ho, J. -M.(2002)。Discovering informative content blocks from web documents588-593。  new window
2.Hammer, J.、Garcia-Molina, H.、Cho, J.、Aranha, R.、Crespo, A.(1997)。Extracting semistructured information from the web8-25。  new window
3.Wu, F.、Hoffman, R.、Weld, D.S.(2008)。Information extraction from Wikipedia: moving down the long tail731-739。  new window
學位論文
1.侯嘉昌(2009)。知識工作者藉由部落格進行知識分享對壓力紓解之影響研究(碩士論文)。大同大學。  延伸查詢new window
2.吳志宏(民92)。以隱性回饋為基礎的自動化推薦機制。  延伸查詢new window
3.黃高彬(民97)。部落格之精華文章自動收錄系統。  延伸查詢new window
其他
1.創市際市場研究顧問公司(2007),http://briian.com/?p=3059。  延伸查詢new window
2.Henning, J.(2004)。The blogging iceberg - of 4.12 million hosted weblogs,http://pages.citebite.com/h1u2f1h4l1lbx。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE