| Peer-Reviewed

Integrating Type-1 Fuzzy and Type-2 Fuzzy Clustering with K-Means for Pre-Processing Input Data in Classification Algorithms

Received: 10 October 2014     Accepted: 14 October 2014     Published: 6 November 2014
Views:       Downloads:
Abstract

In several papers, clustering has been used for preprocessing datasets before applying classification algorithms in order to enhance classification results. A strong clustered dataset as input to classification algorithms can significantly improve the computation time. This can be particularly useful in “Big Data” where computation time is equally or more important than accuracy. However, there is a trade-off between computation time (speed) and accuracy among clustering algorithms. Specifically, general type-2 fuzzy c-means (GT2 FCM) is considered to be a highly accurate clustering approach, but it is computationally intensive. To improve its computation time we propose a hybrid clustering algorithm called KFGT2FCM that combines GT2 FCM with two fast algorithms k-means and Fuzzy C-means algorithm for input data preprocessing of classification algorithms. The proposed algorithm shows improved computation time when compared with GT2 FCM on five benchmarks from university of California Irvine (UCI) library.

Published in International Journal of Intelligent Information Systems (Volume 3, Issue 6-1)

This article belongs to the Special Issue Research and Practices in Information Systems and Technologies in Developing Countries

DOI 10.11648/j.ijiis.s.2014030601.27
Page(s) 91-97
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2014. Published by Science Publishing Group

Keywords

Classification, Input Data Preprocessing, Clustering, General Type-2 Fuzzy Logic, Fuzzy C-Means (FCM), K-Means

References
[1] Ting liu, Jurrus, E., Seyedhosseini, M., Ellisman, M., Tasdizen, T.,Watershed merge tree classification for electron microscopy image segmentation. 21st International Conference on Pattern Recognition (ICPR), 11-15 Nov. 2012.
[2] Zhang Bin,Ma, Guorui,Zhang, Zhi,Qin, Qianqing, Region-based classification by combining MS segmentation and MRF for POLSAR images , Journal of Systems Engineering and Electronics. 2013 (Volume:24, Issue:3).
[3] Zhou Q. ,Tong G. ,Xie D. ,Li B. , A Seismic-Based Feature Extraction Algorithm for Robust Ground Target Classification . Signal Processing Letters, IEEE. 2012 (Volume: 19, Issue: 10).
[4] Garcia Bermudez, F.L., Julian, R.C., Haldane, D.W., Abbeel P., Performance analysis and terrain classification for a legged robot over rough terrain , International Conference on Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ.7-12. 2012, Vilamoura, Portugal.
[5] Nasrollahi P., Jafari S. , Ebrahimi, M., Action classification of humanoid soccer robots using machine learning , 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP), 2-3 May 2012, Shiraz, Iran.
[6] B., H.B., N., J.C., Hierarchical classification using a Competitive Neural Network , Eighth International Conference on Natural Computation (ICNC), 2012. 29-31 May 2012, Chongqing ,China.
[7] W. Yang , K. Wang , W. Zuo, Prediction of protein secondary structure using large margin nearest neighbor classification . Advanced Computer Control (ICACC), 2011 3rd International Conference, 18-20 Jan. 2011, Harbin, China.
[8] Yuvaraj, N. ,Vivekanandan, P., An efficient SVM based tumor classification with symmetry Non-negative Matrix Factorization using gene expression data, International Conference on Information Communication and Embedded Systems (ICICES), 2013. 21-22. 2013, Chennai, India.
[9] Swangnetr, M.,Kaber, D.B., Emotional State Classification in Patient–Robot Interaction Using Wavelet Analysis and Statistics-Based Feature Selection, IEEE Transactions on Human-Machine Systems (Volume:43,Issue:1),.2013.
[10] W Cai ;S. Chen ;D. Zhang. A Multi-objective Simultaneous Learning Framework for Clustering and Classification . IEEE Transactions on Neural Networks, Volume: 21 , Issue: 2 2010.
[11] E. R., Pfahringer, B., Holmes, G., Clustering for classification . Information Technology in Asia (CITA 11), 2011 7th International Conference on Digital Object Identifier: 10.1109/CITA.2011.5998839. 2011 , Page(s): 1 – 8.
[12] A. Shahi, R. Binti Atan and M-D. Nasir Sulaiman, An effective fuzzy c-mean and type-2 fuzzy logic for weather forecasting . Journal of Theoretical and Applied Information Technology. 2009, Vol. 5 Issue 5, p550 . Malaysia.
[13] Q. Liang and J. Mendel, Decision Feedback Equalizer for Nonlinear Time-Varying Channels Using Type-2 fizzy Adaptive Filters . Fuzzy Systems, 2000..
[14] G. Zhengetal, A Similarity Measure between Interval Type-2 Fuzzy Sets . Proceedings of the 2010 IEEE, International Conference on Mechatronics and Automation 2011.
[15] Kimito Funatsu and Kiyoshi Hasegawa, New fundamental technologies in data mining. First published January, 2011. Printed in India.
[16] Rui Xu, Donald Wunsch II, Survey of Clustering Algorithms . IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 16, NO. 3, 2005.
[17] Huaxiang Zhang , Jing Lu , Creating ensembles of classifiers via fuzzy clustering and deflection . Fuzzy Sets and Systems, Volume 161, Issue 13, 1 2010, Pages 1790–1802.
[18] Ondrej Linda, Milos Manic, General Type-2 Fuzzy C-Means Algorithm for Uncertain Fuzzy Clustering . Fuzzy Systems, IEEE Transactions. 13 2012, ISSN : 1063-6706.
[19] Frank, A. Asuncion, UCI Machine Learning Repository [http://archive.ics.uci.edu/ml], Irvine, CA: University of California, School of Informatics and Computer Science.
[20] R. Athauda, M. Tissera, C. Fernando. ” Data Mining Applications: Promise and Challenges”. Data Mining and Knowledge Discovery in Real Life Applications, ISBN 978-3-902613-53-0, pp. 438, 2009, I-Tech, Vienna, Austria.
[21] M. H. Fazel Zarandi, I. B. Turksen, O. Torabi Kasbi,” Type-2 fuzzy modeling for desulphurization of steel process”. Expert Systems with Applications 32 (2007) 157–171.
[22] D. Wu , J. Mendel.” Enhanced Karnik-Mendel Algorithms for Interval Type-2 Fuzzy Sets and Systems”. Fuzzy Information Processing Society, 2007. Annual Meeting of the North American.
[23] Der-Chen Lin, Miin-Shen Yang,” A similarity measure between type-2 fuzzy sets with its application to clustering”. Fourth International Conference on Fuzzy Systems and Knowledge Discovery, 2007
[24] Wen-liang Hung, Miin-shen Yang ,” Similarity Measures Between Type-2 Fuzzy Sets”. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems. Vol. 12, No. 6 (2004) 827-841.
[25] Miin-Shen Yang, Der-Chen Lin. “On similarity and inclusion measures between type-2 fuzzy sets with an application to clustering”. Computers and Mathematics with Applications 57 (2009) 896_907.
[26] Hwang C.-M., Yang M.-S., Hung W.-L., “On similarity, inclusion measure and entropy between type-2 fuzzy sets”. International Journal of Uncertainty, Fuzziness and Knowlege-Based Systems 2012. Volume 53, Issues 9–10.
Cite This Article
  • APA Style

    Vahid Nouri, Mohammad Reza Akbarzadeh, Tootoonchi, Alireza Rowhanimanesh. (2014). Integrating Type-1 Fuzzy and Type-2 Fuzzy Clustering with K-Means for Pre-Processing Input Data in Classification Algorithms. International Journal of Intelligent Information Systems, 3(6-1), 91-97. https://doi.org/10.11648/j.ijiis.s.2014030601.27

    Copy | Download

    ACS Style

    Vahid Nouri; Mohammad Reza Akbarzadeh; Tootoonchi; Alireza Rowhanimanesh. Integrating Type-1 Fuzzy and Type-2 Fuzzy Clustering with K-Means for Pre-Processing Input Data in Classification Algorithms. Int. J. Intell. Inf. Syst. 2014, 3(6-1), 91-97. doi: 10.11648/j.ijiis.s.2014030601.27

    Copy | Download

    AMA Style

    Vahid Nouri, Mohammad Reza Akbarzadeh, Tootoonchi, Alireza Rowhanimanesh. Integrating Type-1 Fuzzy and Type-2 Fuzzy Clustering with K-Means for Pre-Processing Input Data in Classification Algorithms. Int J Intell Inf Syst. 2014;3(6-1):91-97. doi: 10.11648/j.ijiis.s.2014030601.27

    Copy | Download

  • @article{10.11648/j.ijiis.s.2014030601.27,
      author = {Vahid Nouri and Mohammad Reza Akbarzadeh and Tootoonchi and Alireza Rowhanimanesh},
      title = {Integrating Type-1 Fuzzy and Type-2 Fuzzy Clustering with K-Means for Pre-Processing Input Data in Classification Algorithms},
      journal = {International Journal of Intelligent Information Systems},
      volume = {3},
      number = {6-1},
      pages = {91-97},
      doi = {10.11648/j.ijiis.s.2014030601.27},
      url = {https://doi.org/10.11648/j.ijiis.s.2014030601.27},
      eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijiis.s.2014030601.27},
      abstract = {In several papers, clustering has been used for preprocessing datasets before applying classification algorithms in order to enhance classification results. A strong clustered dataset as input to classification algorithms can significantly improve the computation time. This can be particularly useful in “Big Data” where computation time is equally or more important than accuracy. However, there is a trade-off between computation time (speed) and accuracy among clustering algorithms. Specifically, general type-2 fuzzy c-means (GT2 FCM) is considered to be a highly accurate clustering approach, but it is computationally intensive. To improve its computation time we propose a hybrid clustering algorithm called KFGT2FCM that combines GT2 FCM with two fast algorithms k-means and Fuzzy C-means algorithm for input data preprocessing of classification algorithms. The proposed algorithm shows improved computation time when compared with GT2 FCM on five benchmarks from university of California Irvine (UCI) library.},
     year = {2014}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - Integrating Type-1 Fuzzy and Type-2 Fuzzy Clustering with K-Means for Pre-Processing Input Data in Classification Algorithms
    AU  - Vahid Nouri
    AU  - Mohammad Reza Akbarzadeh
    AU  - Tootoonchi
    AU  - Alireza Rowhanimanesh
    Y1  - 2014/11/06
    PY  - 2014
    N1  - https://doi.org/10.11648/j.ijiis.s.2014030601.27
    DO  - 10.11648/j.ijiis.s.2014030601.27
    T2  - International Journal of Intelligent Information Systems
    JF  - International Journal of Intelligent Information Systems
    JO  - International Journal of Intelligent Information Systems
    SP  - 91
    EP  - 97
    PB  - Science Publishing Group
    SN  - 2328-7683
    UR  - https://doi.org/10.11648/j.ijiis.s.2014030601.27
    AB  - In several papers, clustering has been used for preprocessing datasets before applying classification algorithms in order to enhance classification results. A strong clustered dataset as input to classification algorithms can significantly improve the computation time. This can be particularly useful in “Big Data” where computation time is equally or more important than accuracy. However, there is a trade-off between computation time (speed) and accuracy among clustering algorithms. Specifically, general type-2 fuzzy c-means (GT2 FCM) is considered to be a highly accurate clustering approach, but it is computationally intensive. To improve its computation time we propose a hybrid clustering algorithm called KFGT2FCM that combines GT2 FCM with two fast algorithms k-means and Fuzzy C-means algorithm for input data preprocessing of classification algorithms. The proposed algorithm shows improved computation time when compared with GT2 FCM on five benchmarks from university of California Irvine (UCI) library.
    VL  - 3
    IS  - 6-1
    ER  - 

    Copy | Download

Author Information
  • Department of Computer Engineering, Islamic Azad University, Mashhad Branch, Mashhad, Iran

  • Department of Electrical Engineering, University of Neyshabur, Neyshabur, Iran

  • Sections