:::

詳目顯示

回上一頁
題名:幾個快速挖掘關聯規則的資料探勘方法
書刊名:電子商務學報
作者:陳彥良趙書榮陳禹辰 引用關係
作者(外文):Chen, Yen-liangZhao, Shu-rongChen, Yu-chen
出版日期:2003
卷期:5:2
頁次:頁1-10
主題關鍵詞:資料挖掘關聯規則交易資料庫Data miningAssociation ruleTransaction database
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:74
關聯規則的挖掘,是目前最重要的資料挖掘問題之一,它的目的是要從銷售的交易資料庫中,發現商品項目間的關聯。在過去已經有相當多挖掘關聯規則演算法被提出來,當中FP-tree演算法可說是最主要的演算法之一,並以高執行效率著稱。它的主要概念是不產生candidate itemsets,而將資料庫壓縮在FP-tree的結構中以避免多次的高成本資料庫掃瞄。在本文中,我們針對原本的FP tree演算法,更進一步改進其所用的資料結構以提高挖掘效率。 在本文中共建立了三種資料結構: (一) FP-tree_tail演算法,也就是在head table中增加一個tail欄位,(二) FP-treel hash演算法,乃是以hash function計算出每個node所在位置方式建立FP-tree, (三) FP-treel hash+tail演算法,為結合 (一)、(二) 之優點,所完成之演算法。作者將以上三個演算法與傳統FP_tree演算法一起比較以找出各演算法之優缺點。經由各種實驗數據發現,傳統FP_tree演算法所需花費之時間,為三個改良FP-tree演算法的數十倍。
Mining association rules is one of the most important problems in data mining. Its aim is to discover the associations between items in a large database of sales transactions. In the past, a large number of algorithms for mining association rules have been proposed, and the FP-tree algorithm is one of the most famous ones, known for its efficiency. Unlike the traditional approach that requires many phases of candidate itemsets generation and database scan, the FP-tree algorithm compresses and stores the entire database into a sophisticated tree structure, called FP-tree, by which all the associations can be found by two database scans. In this paper, we attempt to further improve the standard structure of the FP-tree such that the mining performance can be improved. To this end, three variants of the improved FP-tree algorithm are proposed. The first variant is called FP-tree+tail, which adds a tail pointer into the head table of the original FP-tree structure. The second is named as FP-tree hash, which adds a hash table into every node of the FP-tree. Finally, we call the last FP-treel1ash+tail, which is a combination of the first two improvements. Finally, a performance evaluation is done to compare their performances. The result indicates that the three proposed algorithms are about 20-50 times faster than the original FP-tree algorithm.
期刊論文
1.Agarwal, R.、Aggarwal, C.、Prasad, V. V. V.(2000)。A tree projection algorithm for generation of frequent item sets。Journal of Parallel and Distributed Computing。  new window
2.Park, J. S.、Chen, M. S.、Yu, P. S.(1995)。An effective hash-based algorithm for mining association rules。Association for computing machinery special interest group on management of data,24(2),175-186。  new window
3.Pasquier, N.、Bastide, Y.、Taouil, R.、Lakhal, L.(1999)。Efficient Mining of Association Rules using Closed Itemset Lattices。Information Systems,24(1),25-46。  new window
4.Pei, J.、Han, J.(2000)。Mining Frequent Patterns by Pattern-Growth: Methodology and Implications。ACM SIGKDD Explorations,2(2)。  new window
會議論文
1.Brin, S.、Motwani, R.、Ullman, J. D.、Tsur, S.(1997)。Dynamic Itemset Counting and Implication Rules for Market Basket Data。The 1997 ACM SIGMOD international conference on Management of data,255-264。  new window
2.Pei, J.、Han, J.、Mortazavi-Asl, B.、Pinto, H.、Chen, Q.、Dayal, U.、Hsu, M. C.(2001)。PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth。17th International Conference on Data Engineering,215-224。  new window
3.Agrawal, R.、Srikant, R.(1994)。Fast algorithms for mining association rules in large database。The 20th International Conference on Very Large Data Bases。Morgan Kaufmann Publishers Inc.。478-499。  new window
4.Han, J.、Pei, J.、Yin, Y.(2000)。Mining frequent patterns without candidate generation。The 2000 ACM SIGMOD International Conference on Management of Data。ACM。1-12。  new window
5.Bayardo, R. J., Jr.(1998)。Efficiently Mining Long Patterns from Databases。沒有紀錄。85-93。  new window
6.Saluja, S.、Mannila, H.、Gunopulos, G.(1997)。Discovering All Most Specific Sentences by Randomized Algorithms。沒有紀錄。215-229。  new window
7.Yen, S. J.、Chen, A. L. P.(1996)。An Efficient Approach to Discovery Knowledge from Large Database。沒有紀錄。8-18。  new window
8.Savasere, E. O.、Navathe, S.(1995)。An Efficient Algorithm for Mining Association Rules in Large Databases。沒有紀錄。432-444。  new window
9.Han, J.、Pei, J.(2000)。Can We Push More Constraints into Frequent Pattern Mining?。The ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,350-354。  new window
研究報告
1.Zaki, M. J.、Li, W.、Ogihara, M.、Parthasarathy, S.(1996)。Evaluation of Sampling for Data Mining of Association Rules。U. Rochester。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
:::
無相關博士論文
 
無相關書籍
 
無相關著作
 
QR Code
QRCODE