融合多粒度信息的文本向量表示模型__臺灣人文及社會科學引文索引資料庫

:::

詳目顯示

第 1 筆 / 總合 1 筆

/1頁

來源文獻資料
摘要
外文摘要

題名：	融合多粒度信息的文本向量表示模型
書刊名：	數據分析與知識發現
作者：	聶維民／陳永洲／馬靜
出版日期：	2019
卷期：	2019(9)
頁次：	45-52
主題關鍵詞：	文本分類；詞向量；卷積神經網絡；主題模型；Text classification；Word vector；Convolutional neural network；Topic model
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:0 共同引用:0 點閱:2

【目的】更加全面地提取文本語義特征,提高文本向量對文本語義的表示能力。【方法】通過卷積神經網絡提取詞粒度、主題粒度和字粒度文本特征向量,通過"融合門"機制將三種特征向量融合得到最終的文本向量,并進行文本分類實驗。【結果】該模型在搜狗語料庫文本分類實驗上的準確率為92.56%,查準率為92.33%,查全率為92.07%,F1值為92.20%,較基準模型Text-CNN分別提高2.40%,2.05%,1.77%,1.91%。【局限】詞序關系范圍較小,語料庫規模較小。【結論】該模型可以更加全面地提取文本語義特征,得到的文本向量對文本語義表示能力更強。

以文找文

[Objective] This paper proposed a model to extract semantic features from texts more comprehensively and to improve the representation of semantics by text vectors. [Methods] We obtained the word-granularity, topic-granularity and character-granularity feature vectors with the help of convolutional neural networks. Then, the three feature vectors were combined by the "merging gate" mechanism to generate the final text vectors. Finally, we examined the model with text classification experiment. [Results] The accuracy(92.56%), the precision(92.33%), the recall(92.07%) and the F-score(92.20%), were 2.40%, 2.05%, 1.77% and 1.91% higher than the results of Text-CNN. [Limitations] The Long-distance dependency features need to be included and the corpus size needs to be expanded. [Conclusions] The proposed model could better represent the text semantics.

以文找文

推文
推薦
引用網址
引用嵌入語法
轉寄

top

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	基於深度融合特徵的政務微博轉發規模預測模型
2.	基於詞向量語義擴展的網絡文本特徵選擇方法研究
3.	一種融合LDA與CNN的社交媒體中熱點輿情識別方法
4.	融合主題模型和卷積神經網絡的APP推薦研究
5.	基於類別特徵擴展的短文本分類方法研究

無相關博士論文

無相關書籍

無相關著作

無相關點閱

QR Code

臺灣人文及社會科學引文索引資料庫系統

詳目顯示

臺灣人文及社會科學引文索引資料庫