幾個快速挖掘關聯規則的資料探勘方法__臺灣人文及社會科學引文索引資料庫

:::

詳目顯示

第 1 筆 / 總合 1 筆

/1頁

來源文獻資料
摘要
外文摘要
引文資料

題名：	幾個快速挖掘關聯規則的資料探勘方法
書刊名：	電子商務學報
作者：	陳彥良／趙書榮／陳禹辰
作者(外文)：	Chen, Yen-liang／Zhao, Shu-rong／Chen, Yu-chen
出版日期：	2003
卷期：	5:2
頁次：	頁1-10
主題關鍵詞：	資料挖掘；關聯規則；交易資料庫；Data mining；Association rule；Transaction database
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:0 共同引用:0 點閱:74

關聯規則的挖掘，是目前最重要的資料挖掘問題之一，它的目的是要從銷售的交易資料庫中，發現商品項目間的關聯。在過去已經有相當多挖掘關聯規則演算法被提出來，當中FP-tree演算法可說是最主要的演算法之一，並以高執行效率著稱。它的主要概念是不產生candidate itemsets，而將資料庫壓縮在FP-tree的結構中以避免多次的高成本資料庫掃瞄。在本文中，我們針對原本的FP tree演算法，更進一步改進其所用的資料結構以提高挖掘效率。在本文中共建立了三種資料結構: (一) FP-tree_tail演算法，也就是在head table中增加一個tail欄位，(二) FP-treel hash演算法，乃是以hash function計算出每個node所在位置方式建立FP-tree， (三) FP-treel hash+tail演算法，為結合 (一)、(二) 之優點，所完成之演算法。作者將以上三個演算法與傳統FP_tree演算法一起比較以找出各演算法之優缺點。經由各種實驗數據發現，傳統FP_tree演算法所需花費之時間，為三個改良FP-tree演算法的數十倍。

以文找文

Mining association rules is one of the most important problems in data mining. Its aim is to discover the associations between items in a large database of sales transactions. In the past, a large number of algorithms for mining association rules have been proposed, and the FP-tree algorithm is one of the most famous ones, known for its efficiency. Unlike the traditional approach that requires many phases of candidate itemsets generation and database scan, the FP-tree algorithm compresses and stores the entire database into a sophisticated tree structure, called FP-tree, by which all the associations can be found by two database scans. In this paper, we attempt to further improve the standard structure of the FP-tree such that the mining performance can be improved. To this end, three variants of the improved FP-tree algorithm are proposed. The first variant is called FP-tree+tail, which adds a tail pointer into the head table of the original FP-tree structure. The second is named as FP-tree hash, which adds a hash table into every node of the FP-tree. Finally, we call the last FP-treel1ash+tail, which is a combination of the first two improvements. Finally, a performance evaluation is done to compare their performances. The result indicates that the three proposed algorithms are about 20-50 times faster than the original FP-tree algorithm.

以文找文

期刊論文
1.	Agarwal, R.、Aggarwal, C.、Prasad, V. V. V.(2000)。A tree projection algorithm for generation of frequent item sets。Journal of Parallel and Distributed Computing。
2.	Park, J. S.、Chen, M. S.、Yu, P. S.(1995)。An effective hash-based algorithm for mining association rules。Association for computing machinery special interest group on management of data，24(2)，175-186。
3.	Pasquier, N.、Bastide, Y.、Taouil, R.、Lakhal, L.(1999)。Efficient Mining of Association Rules using Closed Itemset Lattices。Information Systems，24(1)，25-46。
4.	Pei, J.、Han, J.(2000)。Mining Frequent Patterns by Pattern-Growth: Methodology and Implications。ACM SIGKDD Explorations，2(2)。

會議論文
1.	Brin, S.、Motwani, R.、Ullman, J. D.、Tsur, S.(1997)。Dynamic Itemset Counting and Implication Rules for Market Basket Data。The 1997 ACM SIGMOD international conference on Management of data，255-264。
2.	Pei, J.、Han, J.、Mortazavi-Asl, B.、Pinto, H.、Chen, Q.、Dayal, U.、Hsu, M. C.(2001)。PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth。17th International Conference on Data Engineering，215-224。
3.	Agrawal, R.、Srikant, R.(1994)。Fast algorithms for mining association rules in large database。The 20th International Conference on Very Large Data Bases。Morgan Kaufmann Publishers Inc.。478-499。
4.	Han, J.、Pei, J.、Yin, Y.(2000)。Mining frequent patterns without candidate generation。The 2000 ACM SIGMOD International Conference on Management of Data。ACM。1-12。
5.	Bayardo, R. J., Jr.(1998)。Efficiently Mining Long Patterns from Databases。沒有紀錄。85-93。
6.	Saluja, S.、Mannila, H.、Gunopulos, G.(1997)。Discovering All Most Specific Sentences by Randomized Algorithms。沒有紀錄。215-229。
7.	Yen, S. J.、Chen, A. L. P.(1996)。An Efficient Approach to Discovery Knowledge from Large Database。沒有紀錄。8-18。
8.	Savasere, E. O.、Navathe, S.(1995)。An Efficient Algorithm for Mining Association Rules in Large Databases。沒有紀錄。432-444。
9.	Han, J.、Pei, J.(2000)。Can We Push More Constraints into Frequent Pattern Mining?。The ACM SIGKDD International Conference on Knowledge Discovery and Data Mining，350-354。

研究報告
1.	Zaki, M. J.、Li, W.、Ogihara, M.、Parthasarathy, S.(1996)。Evaluation of Sampling for Data Mining of Association Rules。U. Rochester。

推文
推薦
引用網址
引用嵌入語法
轉寄

top

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	運用資料挖掘技術探索顧客購買圖書特性
2.	運用資料挖掘技術探討成人健檢資料
3.	頻繁集的自然分桶的Hash生成方法
4.	以螞蟻理論為基礎的資料挖掘方法之研究
5.	應用資料挖掘於交通事故資料分析
6.	應用資料挖掘技術於「學生缺曠模式分析」之研究--以醒吾技術學院觀光科為例
7.	在少樣商品或短交易長度情況下挖掘關聯規則
8.	在包裹資料庫中挖掘數量關聯規則

無相關博士論文

無相關書籍

無相關著作

1.	運用服務導向架構發展中小企業之雲端碳足跡運算系統
2.	電子化行銷導向之量表發展與驗證
3.	從社會認同理論與情緒之觀點探討線上品牌社群之對立品牌忠誠：以智慧型手機論壇為例
4.	擴增實境影像自拍系統對旅遊行為之影響
5.	開發慢性腎臟病(CKD)在飲食照護的諮詢系統
6.	基於決策樹與二元語言模型的網路用語轉譯系統
7.	應用特徵分析探索有向網絡之拓撲結構
8.	機械式及有機式補救機制對補救績效之影響：以線上購物業者為例
9.	為網絡可達性分析之區塊模式化擴展
10.	線上購物服務失誤類型與補救策略、認知公平與補救後滿意度之關係
11.	從交易成本觀點探討影響持續合購意願之因素
12.	面對網路商店進入下雙通路競爭之定價策略
13.	網路品牌社群認同與投入對消費者行為之影響
14.	混合複數類神經模糊與自動回歸差分平均移動方法之智慧型時間序列預測模型
15.	應用整合型科技接受模式與創新擴散通用模型於企業導入數位學習之多層次分析

QR Code

臺灣人文及社會科學引文索引資料庫系統

詳目顯示

臺灣人文及社會科學引文索引資料庫