:::

詳目顯示

回上一頁
題名:模糊關聯規則之研究
作者:王建驊
作者(外文):Chien-Hua Wang
校院名稱:元智大學
系所名稱:資訊管理學系
指導教授:龐金宗
學位類別:博士
出版日期:2013
主題關鍵詞:資料探勘Apriori演算法FP-growth演算法模糊集合模糊分割法模糊關聯規則Data MiningApriori AlgorithmFP-growth AlgorithmFuzzy SetsFuzzy Partition MethodFuzzy Association Rule
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:14
隨著計算機技術的迅速發展,其在資料庫中存儲和管理資料的能力正變得越來越重要了。雖然計算機技術的發展有利於數據處理和簡化存儲介質上的要求,提取可用的隱含信息,以幫助決策,已成為一個新的和具有挑戰性的任務。
然而,資料探勘乃是資料的探索和分析,以便於發現有意義的模式。它可以有效地分析各種的應用,這將有助於在業務決策過程。
在資料探勘中,在交易資料庫裡最常見的為關聯規則。其目的在於尋找資料庫裡項目之間的關係,則反映出項目$(X)$出現時,另一個項目$(Y)$可能會出現。
例如,當一位顧客購買麵包時,而也有可能會買牛奶。因此,關聯規則便能協助決策者了解顧客在購買商品時可能的搭配,以便能促進規劃行銷策略。
在本論文,我們結合FP-growth與模糊集合之概念來進行模糊關聯規則探勘。
使用模糊集合是因為它可以自然語言的方式來描述模糊知識,而這也相當符合人們的主觀思考,也更助於增加使用者在制定決策時之彈性。
因此,模糊分割法可模糊的呈現方式是否為使用者所理解。
其次,由於Apriori演算法對於大量資料無法有效率處理的缺點以及時間複雜度會隨著資料愈大而急速的成長,因此在關聯規則的技術上選擇以FP-growth為主。
本論文的目的在於發展處理數量化的模糊資料探勘方法。
基於模糊分割法,由屬性資料中找出可理解與潛在性有用的模糊知識,並進一步結合FP-growth演算法來解決不同的決策問題。
第一個探勘方法是從交易資料中萃取出模糊關聯規則。在此方法中,是利用模糊分割法先轉換成語意的模糊知識,再藉由FP-growth演算法進行探勘。
而在第二個方法中,則是在FP-growth演算法運算的過程中,加入了表格結構來計算,因而能比第一個方法更有效率。
另外,對於需所的參數值及管理者所評估的項目重要性,皆以語意型態來表達,此方式對人類而言是較自然及易於了解。
因此,在第三個方法裡,我們便提出了模糊權重式關聯規則來進行探勘。
最後,本論文所提出的三種不同方式也與其他的方法相比較;其實驗結果也證實本論文所提的方法是具有良好之效率。
除此之外,我們所提的方法皆優於Apriori演算法,而第二個方法則比第一個方法有較佳的效率。
As computer technology progresses rapidly, its capacity to store and manage data in database has become crucial.
Though computer technology development facilitates data processing and eases demands on storage media, extraction of available implicit information to aid decision making has turn into a new and challenging task.
However, data mining is the exploration and analysis of data in order to discover meaningful patterns.
It can effectively be applied on all varieties of analysis and assist the process of decision-making in businesses.
In data mining, finding association rules in transaction database is most commonly seen.
The purpose is to search for the relation that exists among items of database.
The relation reflects that items ($X$) appear, other items ($Y$) are likely to appear as well.
For instance, when a customer purchases bread, one might also get milk along with it.
Accordingly, association rules can assist decision makers to scoop out the possible items that are likely to be purchased by consumers in the hopes to facilitate marketing strategies.
In this dissertation, we combine FP-growth with the concept of fuzzy sets to mine fuzzy association rules.
Using fuzzy sets means that we consider fuzzy knowledge representations described by the natural language are well suited for the subjective thinking of human subjects and will assist users in making decisions flexibly.
Therefore, the fuzzy partition method can be comprehensible by human users.
Next, because Apriori algorithm is not efficient in handing drawbacks for huge data and its time complexity with greater information and rapid growth, the technology of association rule was selected by FP-growth algorithm.
The main aim of this dissertation is to develop novel fuzzy data mining techniques for quantitative data to find comprehensible and hidden useful fuzzy knowledge based on the fuzzy partition method and FP-growth to solve various decision problems.
In the first algorithm, the fuzzy association rules are extracted from transaction database.
In this algorithm, we use fuzzy partition method to transform quantitative data into fuzzy knowledge and apply FP-growth algorithm to mine.
In the second algorithm, we increase the table structure in FP-growth algorithm, thus it is more efficient than the first method.
In addition, the parameters needed in the mining process and the importance of items evaluated by managers are given as linguistic terms, which are more natural and understandable for human beings.
Hence, in the third method, we proposed fuzzy weighted association rule to mine. Finally, we propose three different mining methods to compare with other methods, and the experiment results were verified that the methods proposed have great efficiency.
Besides, the proposed approaches are superior to Apriori algorithm. Then the second approach also has more efficiency than the first one.
[1] G. Piatetsky-Shapiro, “Knowledge discovery in real databases: A report on thenew window
ijcai-89 workshop,” AI Magazine, vol. 11, no. 4, pp. 68–70, 1990.
[2] U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, “From data mining to knowledge
discovery in databases,” Advances in Knowledge Discovery and Data Mining,
vol. 17, no. 3, pp. 1–34, 1996.
[3] A. A. Freitas, Data mining and knowledge discovery with evolutionary algorithms.
Springer, New Year, 2002.
[4] M. J. Berry and G. Linoff, Data mining techniques: for marketing, sales, and
customer support. John Wiley &; Sons, NY,, 1997.
[5] R. Agrawal, T. Imielinksi, and A. Swami, “Mining association rules between
sets of items in large databases,” in The 1993 ACM SIGMOD Conference,
vol. 22, no. 2. ACM, 1993, pp. 207–216.
[6] R. Agrawal, T. Imielinski, and A. Swami, “Database mining: A performance
perspective,” Knowledge and Data Engineering, IEEE Transactions on, vol. 5,
no. 6, pp. 914–925, 1993.
[7] R. Agrawal, R. Srikant et al., “Fast algorithms for mining association rules,”
in Proc. 20th Int. Conf. Very Large Data Bases, VLDB, vol. 1215, 1994, pp.
487–499.
[8] J. Han and Kamber, Data mining: concepts and techniques. Morgan kaufmann,
San Francisisco, 2001.
[9] H.-J. Zimmermann, Fuzzy sets, decision making and expert systems. Kluwer
Academic Pub, Boston, 1991, vol. 10.
[10] Y.-C. Hu, R.-S. Chen, and G.-H. Tzeng, “Finding fuzzy classification rules
using data mining techniques,” Pattern Recognition Letters, vol. 24, no. 1, pp.new window
509–519, 2003.
11] H. Ishibuchi, T. Nakashima, and T. Murata, “Performance evaluation of fuzzy
classifier systems for multidimensional pattern classification problems,” Systems,
Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on,
vol. 29, no. 5, pp. 601–618, 1999.
[12] J. Han, J. Pei, and Y. Yin, “Mining frequent patterns without candidate generation,”
in in Proc. ACM SIGMOD Int. Conf. on Management of Data, vol. 29,
no. 2. ACM, 2000, pp. 1–12.
[13] S.-M. Chen and C.-M. Huang, “A new approach to generate weighted fuzzy
rules using genetic algorithms for estimating null values,” Expert Systems with
Applications, vol. 35, no. 3, pp. 905–917, 2008.
[14] T.-P. Hong and J.-B. Chen, “Finding relevant attributes and membership functions,”
Fuzzy Sets and Systems, vol. 103, no. 3, pp. 389–404, 1999.
[15] T.-P. Hong, C.-S. Kuo, and S.-C. Chi, “A data mining algorithm for transaction
data with quantitative values,” International Data Analysis, vol. 3, no. 5, pp.
363–376, 1999.
[16] T.-P. Hong, M.-J. Chiang, and S.-L. Wang, “Fuzzy weighted data mining from
quantitative transactions with linguistic minimum supports and confidences,”
International Journal of Fuzzy Systems, vol. 8, no. 4, pp. 173–182, 2006.
[17] T.-P. Hong, C.-H. Chen, Y.-L. Wu, and Y.-C. Lee, “A ga-based fuzzy mining
approach to achieve a trade-off between number of rules and suitability of
membership functions,” Soft Computing, vol. 10, no. 11, pp. 1091–1101, 2006.
[18] Y.-C. Hu, “Mining association rules at a concept hierarchy using fuzzy partition,”
Journal of Information Management, vol. 13, no. 3, pp. 63–80, 2006.
[19] Y.-C. Hu, “Finding useful fuzzy concepts for pattern classification using genetic
algorithm,” Information Sciences, vol. 175, no. 1, pp. 1–19, 2005.new window
[20] Y.-C. Hu, “Sugeno fuzzy integral for finding fuzzy if-then classification rules,”
Applied Mathematics and Computation, vol. 185, no. 1, pp. 72–83, 2007.new window
[21] M. Spiliopoulou, “Web usage mining for web site evaluation,” Communications
of the ACM, vol. 43, no. 8, pp. 127–134, 2000.
[22] P.-N. Tan et al., Introduction to data mining. Pearson Education India, Boston,
2005.
[23] R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, A. I. Verkamo et al., “Fast
discovery of association rules,” Advances in Knowledge Discovery and Data
Mining, vol. 12, pp. 307–328, 1996.
[24] L. A. Zadeh, “Fuzzy sets,” Information and Control, vol. 8, no. 3, pp. 338–353,
1965.
[25] L.-X. Wang, “Adaptive fuzzy systems and control- design and stability analysis(
book),” Englewood Cliffs, NJ: PTR Prentice Hall, 1994., 1994.
[26] H. O. Wang, K. Tanaka, and M. F. Griffin, “An approach to fuzzy control of
nonlinear systems: stability and design issues,” Fuzzy Systems, IEEE Transactions
on, vol. 4, no. 1, pp. 14–23, 1996.new window
[27] R. Babuska, “Fuzzy modeling-a control engineering perspective,” in Fuzzy Systems,
1995. International Joint Conference of the Fourth IEEE International
Conference on Fuzzy Systems and The Second International Fuzzy Engineering
Symposium., Proceedings of 1995 IEEE International Conference on, vol. 4.
IEEE, 1995, pp. 1897–1902.
[28] E. Kim and H. Lee, “New approaches to relaxed quadratic stability condition
of fuzzy control systems,” Fuzzy Systems, IEEE Transactions on, vol. 8, no. 5,
pp. 523–534, 2000.
[29] C.-H. Wang and C.-T. Pang, “Applying fuzzy data mining for an application
crm,” Bulletin of Networking, Computing, Systems, and Software, vol. 1, no. 1,new window
pp. 46–51, 2012.
[30] C.-H. Wang and C.-T. Pang, “Applying fuzzy data mining for an application
crm,” in The Second International Conference on Networking and Computing,
Nov. 2011.
[31] L. A. Zadeh, “The role of fuzzy logic in the management of uncertainty in
expert systems,” Fuzzy Sets and Systems, vol. 11, no. 1, pp. 197–198, 1983.new window
[32] L. Wei-Yi, “Fuzzy data dependencies and implication of fuzzy data dependencies,”
Fuzzy Sets and Systems, vol. 92, no. 3, pp. 341–348, 1997.
[33] B. P. Buckles and F. E. Petry, “A fuzzy representation of data for relational
databases,” Fuzzy Sets and Systems, vol. 7, no. 3, pp. 213–226, 1982.
[34] S. Liao, H. Wang, and W. Liu, “Functional dependencies with null values, fuzzy
values, and crisp values,” Fuzzy Systems, IEEE Transactions on, vol. 7, no. 1,new window
pp. 97–103, 1999.
[35] W.-H. Lee and C.-T. Pang, “An extension of semantic proximity for fuzzy functional
dependencies,” in Fuzzy Information Processing Society, 2009. NAFIPS
2009. Annual Meeting of the North American. IEEE, 2009, pp. 1–6.
[36] W.-H. Lee, C.-H. Wang, and C.-T. Pang, “Evaluating service quality of online
auction by fuzzy mcdm,” International Conference on Intelligent Systems, 2010.
[37] C.-H. Wang and C.-T. Pang, “Using vikor method for evaluating service quality
of online auction under fuzzy environment,” International Journal of Computer
Science and Emerging Technologies, vol. 1, pp. 307–314, 2011.new window
[38] C.-H. Wang, S.-H. Liu, and C.-T. Pang, “Using genetic algorithm improve the
consistency of fuzzy analytic hierarchy process,” in Soft Computing and Intelligent
Systems (SCIS) and 13th International Symposium on Advanced Intelligent
Systems (ISIS), 2012 Joint 6th International Conference on. IEEE, 2012, pp.
977–982.
[39] C.-H. Wang, M.-Y. Chou, and C.-T. Pang, “Applying fuzzy analytic hierarchy
process for evaluating service quality of online auction,” in International
Conference on Intelligent Systems and Technologies, May 2012.
[40] L. A. Zadeh, “The concept of a linguistic variable and its application to approximate
reasoning (part 1),” Information Sciences, vol. 8, no. 3, pp. 199–249,new window
1975.
[41] L. A. Zadeh, “The concept of a linguistic variable and its application to approximate
reasoning (part 2),” Information Sciences, vol. 8, pp. 301–357, 1975.
[42] L. A. Zadeh, “The concept of a linguistic variable and its application to approximate
reasoning (part 3),” Information Sciences, vol. 9, pp. 43–80, 1975.
[43] H. J. Zimmermann, Fuzzy set theory and its applications. Kluwer Academic
Pub, Boston, 1996.
[44] H. Ishibuchi, K. Nozaki, and H. Tanaka, “Distributed representation of fuzzy
rules and its application to pattern classification,” Fuzzy Sets and Systems,
vol. 52, no. 1, pp. 21–32, 1992.new window
[45] H. Ishibuchi, K. Nozaki, N. Yamamoto, and H. Tanaka, “Selecting fuzzy ifthen
rules for classification problems using genetic algorithms,” Fuzzy Systems,
IEEE Transactions on, vol. 3, no. 3, pp. 260–270, 1995.
[46] G.-H. Tzeng and J.-J. Huang, Multiple Attribute Decision Making: Method and
Applciations, C. Press, Ed. CRC Press, 2011.
[47] C.-H. Wang, W.-H. Lee, and C.-T. Pang, “Applying fuzzy fp-growth to mine
fuzzy association rules,” in International Conference on Intelligent Systems,
vol. 65. World Academy of Science, Engineering and Technology, May 2010.
[48] C.-H. Wang, S.-H. Liu, and C.-T. Pang, “Mining association rules uses fuzzy
weighted fp-growth,” in Soft Computing and Intelligent Systems (SCIS) and
13th International Symposium on Advanced Intelligent Systems (ISIS), 2012
Joint 6th International Conference on. IEEE, 2012, pp. 983–988.
[49] C.-W. Lin, T.-P. Hong, and W.-H. Lu, “An efficient tree-based fuzzy data
mining approach,” International Journal of Fuzzy Systems, vol. 12, no. 2, pp.
150–157, 2010.
[50] S. Papadimitriou and S. Mavroudi, “The fuzzy frequent pattern tree,” in Proceedings
of the 9th WSEAS International Conference on Computers. World
Scientific and Engineering Academy and Society (WSEAS), 2005, pp. 1–7.
[51] C.-H. Wang, W.-H. Lee, and C.-T. Pang, “Fuzzy data mining for quantitative
transactions with fp-growth,” Journal of Nonlinear and Convex Analysis,
vol. 14, no. 1, pp. 193–207, 2013.new window
[52] K. C. Chan and W.-H. Au, “Mining fuzzy association rules,” in Proceedings of
the sixth International Conference on Information and knowledge management.
ACM, 1997, pp. 209–215.
[53] C. M. Kuok, A. Fu, and M. H. Wong, “Mining fuzzy association rules in
databases,” ACM Sigmod Record, vol. 27, no. 1, pp. 41–46, 1998.new window
[54] P.-L. Huang, “Research on mining fuzzy quantitative algorithm,” Master’s thesis,
Master Thesis, Institute of Information Management, I-Shou University,
Kaohsiung, 2009.
[55] C.-H. Wang and C.-T. Pang, “Finding fuzzy association rules using fwfp-gwoth
with linguistic supports and confidences,” International Journal of Information
and Mathematical Sciences, vol. 5, pp. 300–308, 2009.
[56] C.-H.Wang and C.-T. Pang, “Finding fuzzy association rules using fwfp-growth
with linguistic supports and confidences,” in International Conference on Computer
and Information Technology, vol. 53. WORLD Academy of Science,
Engineering and Technology, May 2009, pp. 1139–1147.
[57] M. Kaya and R. Alhajj, “Utilizing genetic algorithms to optimize membership
functions for fuzzy weighted association rules mining,” Applied Intelligence,
vol. 24, no. 1, pp. 7–15, 2006.new window
82
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE