高效率之遞增式資料探勘演算法--ICI__臺灣人文及社會科學引文索引資料庫

:::

詳目顯示

第 1 筆 / 總合 1 筆

/1頁

來源文獻資料
摘要
外文摘要
引文資料

題名：	高效率之遞增式資料探勘演算法--ICI
書刊名：	電子商務學報
作者：	黃仁鵬／錢依佩／郭煌政
作者(外文)：	Huang, Jen-peng／Chien, I-pei／Kuo, Huang-cheng
出版日期：	2006
卷期：	8:3
頁次：	頁393-413
主題關鍵詞：	資料探勘；關聯規則；Apriori演算法；高頻項目集；遞增式資料探勘；Data mining；Association rule；Frequent itemsets；Incremental mining
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(2) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:1 共同引用:4 點閱:45

隨著資訊科技的進步、電腦的普及，蒐集資料變得更容易、快速而且方便。但長時間之下，資料庫累積了大量且有隱藏知識的資料。所以，如何將這些被隱藏的知識，做正確又有效率地探勘成為一個重要的議題。因此，資料探勘的技術便應運而生。當中，最被廣為使用的技術為關聯規則之探勘。關聯規則探勘主要是探討如何從龐大資料庫中找出高頻項目集，進而發掘有用的知識。而在關聯規則中最常被使用的方法為Apriori演算法。雖然此方法可以找出關聯規則，但是它有二個最大的缺點：第一點為在找高頻項目集合時，會產生大量的候選項目集合；第二點為執行時必須經常掃瞄整個資料庫，造成執行效率不佳。後續有許多研究皆針對此缺點做改進，但皆未跳脫Apriori 演算法的整體架構，以致於其執行效率並無很大的進展。本研究所提出ICI演算法脫離Apriori演算法的架構，在產生大項目集合時，只需掃描資料庫一次，因此可以有效率地降低I/O的存取時間，並且快速地找出關聯規則，使得探勘更有效率。此外ICI演算法不需要任何修改就可以當作線上即時漸增式資料探勘 (On-line Incremental Data Mining) 的演算法。

以文找文

Due to the improvement of information technologies and popularization of computers, collecting information becomes easier, rapider and more convenient than before. As the time goes by, database accumulates huge and knowledge-hiding information. Therefore, how to correctly uncover and efficiently mining hidden knowledge from those information becomes a very important issue. Hence the technology of data mining becomes one of the solutions. Among the data mining technologies association rules mining is one of the most popular technologies to be used. Association rules mining explores the approaches to extract the frequent itemsets from large database and to derive the knowledge behind implicitly. The Apriori algorithm is one of the most frequently used algorithms. Although the Apriori algorithm can successful derive the association rules from database, the Apriori algorithm has two major defects: First, the Apriori algorithm produces large amounts of candidate itemsets during extracting the frequent itemsets from large database. Secondly, the whole database is scanned many times which leads to inefficient performance. Many researches try to improve the performance of the Apriori algorithm, but still not escape from the frame of the Apriori algorithm and lead to a little improvement of the performance. In this paper we propose ICI (Incremental Combination Itemsets) which escapes the frame of Apriori algorithm, and it only needs to scan whole database once during extracting the frequent itemsets from large database. Therefore, the ICI algorithm efficiently reduces the I/O time, and rapidly extracts the frequent itemsets from large database, and makes data mining more efficient than before. Meanwhile, ICI algorithm doesn’t need to scan database and reconstruct data structure again when database is updated or minimum support is varied. Therefore, it can be applied to online incremental mining applications without any modification.

以文找文

期刊論文
1.	Han, J.、Pei, J.、Yin, Y.、Mao, R.(2004)。Mining frequent patterns without candidate generation: a frequent pattern tree approach。Data Mining and Knowledge Discovery，8(1)，53-87。
2.	Chen, Ming-Syan、Han, Jiawei、Yu, Philip S.(1996)。Data Mining: An Overview from a Database Perspective。IEEE Transactions on Knowledge and Data Engineering，8(6)，866-883。
3.	Park, J. S.、Chen, M.-S.、Yu, P. S.(1997)。Using a Hashed Method with Transaction Trimming and Database Scan Reduction for Mining Association Rules。IEEE Transactions on Knowledge and Data Engineering，19(5)，813-825。

會議論文
1.	Agrawal, R.、Imielinski, T.、Swami, A. N.(1993)。Mining Association Rules between Sets of Items in Large Databases。The 1993 ACM SIGMOD International Conference on Management of Data，207-216。
2.	Ng, R.、Han, J.(1994)。Efficient and Effective Clustering Method for Spatial Data Mining。0。
3.	Srikant, R.、Agrawal, R.(1995)。Mining Sequential Patterns。The Eleventh International Conference on Data Engineering。Taipei：IEEE Computer Society。3-14。
4.	Agrawal, R.、Srikant, R.(1994)。Fast algorithms for mining association rules in large database。The 20th International Conference on Very Large Data Bases。Morgan Kaufmann Publishers Inc.。478-499。
5.	黃仁鵬、錢依佩(2002)。高效率之關聯規則探勘演算法－QDT。0。55-55。延伸查詢
6.	Lin, D.、Kedem, Z. M.(1998)。Pincer-Search: A New Algorithm for Discovering the Maximum Frequent Set。0。105-119。

圖書
1.	Kaufman, Leonard、Rousseeuw, Peter J.(1990)。Finding Groups in Data: an Introduction to Cluster Analysis。John Wiley and Sons, Inc.。

推文
推薦
引用網址
引用嵌入語法
轉寄

top

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	教師在職進修研習課程大數據關聯規則研究初探：以2017年全國國中語文領域國文科教師為例
2.	女性阿茲海默症病人之照護目標組合的研究：以中部某醫學中心為例
3.	運用關聯規則及改變探勘技術於防火牆政策規則優化
4.	GSPT：使用前序表的高效關聯規則演算法
5.	應用FP-Tree探勘多層次特徵規則
6.	以資料探勘技術建立宅配業之車輛維修及預警決策支援系統
7.	從購買意願資料中挖掘高度相關性的關聯規則
8.	應用高頻項目集探勘技術在DNA晶片雙聚類問題之研究
9.	運用關聯規則和序列型樣探討投資地區之關聯性與遷移--以印刷電路板產業為例
10.	疾病診斷異常之偵測：關聯規則之應用
11.	文化創意產業之資料探勘初探
12.	GSSA：以階段分組排序搜尋機制探勘關聯規則之演算法
13.	應用以約定值為基礎之演算法於關聯規則探勘
14.	關聯推理神經網路
15.	銀行區位選址決策支援系統之研發--以臺北市為例

1.	以資料探勘分析影響國民中小學學習成就因素之研究
2.	從不精準或不確定性資料中挖掘關聯規則
3.	有效率的跨交易關聯規則探勘演算法

無相關書籍

無相關著作

無相關點閱

QR Code

臺灣人文及社會科學引文索引資料庫系統

詳目顯示

臺灣人文及社會科學引文索引資料庫