:::

詳目顯示

回上一頁
題名:臺灣歷史人物文本檢索與探勘系統之建置
書刊名:圖資與檔案學刊
作者:謝順宏柯皓仁張素玢
作者(外文):Sie, Shun-hongKe, Hao-renChang, Su-bing
出版日期:2018
卷期:10:1=92
頁次:頁67-87
主題關鍵詞:臺灣歷史人物資料庫文本檢索文本探勘社會網絡分析命名實體辨識Taiwan Biographical DatabaseTBDBText retrievalText miningSocial network analysisSNAName entity recognition
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(1) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:1
  • 共同引用共同引用:0
  • 點閱點閱:56
「人物」是歷史學研究重要的實體類型之一,因此,對人物傳記的深入了解有助於歷史事件的相關研究。目前許多人物傳記資料是以數位文件的形式存在,而要以人力從大量人物傳記中爬梳、彙整資料頗為曠日廢時,宜妥為運用資訊科技協助歷史學家。此外,儘管臺灣過去已建置眾多資料庫,也有各種人物傳和可資應用的資料文獻,卻較少進行歷史人物資料庫勘考、分析工具的開發。有鑑於此,研究者乃組成研究團隊,以《新修彰化縣志‧人物志》為文本來源,發展資料庫檢索、全文檢索、文本探勘與社會網絡等分析工具,協助歷史人文學進行研究,長期目標為建置「臺灣歷史人物資料庫(Taiwan Biographical Database, TBDB)」。本研究主旨在於描述「臺灣歷史人物資料庫」現階段所收錄之人物特性,闡述系統架構,以及說明初步成果。此外,本研究將提出一套演算法辨識《新修彰化縣志‧人物志》中的命名實體(named entity),並以詩社名稱辨識為例說明。該套演算法的召回率達96%,精確率則為65%。最後,本研究將說明建置「臺灣歷史人物資料庫」過程中習得之經驗和未來發展方向。
Personage is an important kind of entities in the study of history. Comprehensive understanding of personage biographies is beneficial for researching into historical events. In the digital era, many personage biographies are available in digital formats; as a result, it is time-consuming and labor-intensive for researchers to explore invaluable findings from massive personage biographies. Facing this situation, researchers may be helped to utilize the information efficiently with information technologies. This article introduces the development of a text retrieval and mining system for Taiwanese historical people -- Taiwan Biographical Database (TBDB). It describes the characteristics of personages in TBDB, highlights the system architecture and preliminary achievement of TBDB, and proposes a method to recognize named entities in the personage biographies, specifically poetry societies, which achieves the recall rate of 96% and the precision rate of 65%. Finally, this article elaborates on the lessons learned through the creation of TBDB, and the future plans.
會議論文
1.李宗翰、柯皓仁、張素玢、李毓嵐(2017)。從CBDB到TBDB:以《新修彰化縣志•人物志》為試金石。第八屆數位典藏與數位人文國際研討會,國立政治大學數位人文團隊主辦 (會議日期: 2017年11月29日至12月1日)。臺北市。  延伸查詢new window
2.Liu, C.-L.、Huang, C.-K.、Wang, H.、Bol, P. K.(2015)。Toward Algorithmic Discovery of Biographical Information in Local Gazetteers of Ancient China。The 29th Pacific Asia Conference on Language, Information and Computation,(會議日期: 10/30-11/1, 2015)。Shanghai:Shanghai Jiao Tong University。87-95。  new window
3.Sie, S.-H.、Ke, H.-R.、Chang, S.-B.(2017)。Development of a text retrieval and mining system for Taiwanese historical people56-62。  new window
學位論文
1.張尚斌(2006)。詞夾子演算法在專有名詞辨識上的應用--以歷史文件為例(碩士論文)。國立臺灣大學。  延伸查詢new window
圖書
1.Brookshear, J. G.、Brylow, D.(2015)。Computer science: An overview。Boston, N.J:Pearson Education。  new window
單篇論文
1.Fuller, M. A.(2015)。The China Biographical Database User's Guide,https://projects.iq.harvard.edu/files/cbdb/files/cbdb_users_guide.pdf。  new window
2.Lin, C.(2003)。Object-oriented database systems: A survey,https://pdfs.semanticscholar.org/f2bf/923b8fade4ea1cfcb53683abd7aa7a1fa3a1.pdf。  new window
其他
1.What is the relationship between Linked Data and the Semantic Web?,http://linkeddata.org/faq。  new window
2.Bol, P. K.,Hsiang, J.,Fong, G.(2012)。Prosopographical databases, text-mining, GIS and system interoperability for Chinese history and literature,Hamburg:Hamburg University Press。,http://www.dh2012.uni-hamburg.de/conference/programme/abstracts/prosopographical-databases-text-mining-gis-and-system-interoperabilityfor-chinese-history-and-literature.1.html。  new window
3.Cambridge Digital Humanities。Defining Digital Humanities,https://www.cdh.cam.ac.uk/cdh/what-is-dh。  new window
4.Drucker, J.(2013)。Intro to Digital Humanities,http://dh101.humanities.ucla.edu/?page_id=8。  new window
圖書論文
1.Martin, L.(2016)。The university library and digital scholarship: A review of the literature。Developing Digital Scholarship: Emerging Practices in Academic Libraries。London:Facet Publishing。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE