• Article
  • | |
  • Metrics
  • |
  • Reference [17]
  • |
  • Related [20]
  • |
  • Cited by [3]
  • | |
  • Comments
    Abstract:

    Based on the antibody clonal selection theory, an immune clonal data reduction algorithm is proposed for instance selection problems of data reduction. The theory of Markov chain proves that the new algorithm is convergent with probability 1. The experimental studies on seven standard data sets of UCI repository show that the algorithm proposed in this paper is effective. The best domain of the weight parameter λ is determined by analyzing its effect on algorithm’s performance. Furthermore, an encoding method based on the stratified strategy is introduced to accelerate the convergence speed when solving large scale data reduction problems. The experimental studies based on seven large scale data sets show that the improved method is superior to the primary one. Finally,the best domain of the number of stratums t is determined by analyzing its effect on algorithm’s performance based on the data sets Letter and DNA.

    Reference
    [1] Liu H, Motoda H. Instance Selection and Construction for Data Mining. New York: Kluwer Academic Publishers, 2001. 3?20.
    [2] Takashi F, Akio D. A Study of data reduction method with data accuracy for triangle data. In: Barolli L, ed. Proc. of the 11th Int’l Conf. on Parallel and Distributed Systems. Washington: IEEE Computer Society, 2005. 210?213.
    [3] Charu CA. An efficient subspace sampling framework for high-dimensional data reduction, selectivity estimation, and nearest-neighbor search. IEEE Trans. on Knowledge and Data Engineering, 2004,16(10):1247?1262.
    [4] Lynch RS, Willett P K. A theoretical performance analysis of the Bayesian data reduction algorithm. In: Proc. of the 2005 IEEE Int’l Symposium on Systems, Man, and Cybernetics. Piscataway: IEEE Systems, Man, and Cybernetics Society, 2005. 330?335.
    [5] Tahani H, Plummer B, Hemamalini NS. A new data reduction algorithm for pattern classification. In: Proc. of the 1996 IEEE Int’l Conf. on Acoustics, Speech and Signal Processing. Piscataway: IEEE Signal Processing Society, 1996. 3446?3449.
    [6] Cano JR, Herrera F, Lozano M. Using evolutionary algorithms as instance selection for data reduction in KDD: An experimental study. IEEE Trans. on Evolutionary Computation, 2003,7(6):561?575.
    [7] Liu H, Motoda H. Feature Selection for Knowledge Discovery and Data Mining. New York: Kluwer Academic Publishers, 1998.
    [8] Liu H, Motoda H. On issues of instance selection. Data Mining and Knowledge Discovery, 2002,6(2):115?130.
    [9] Cano JR, Herrera F, Lozano M. On the combination of evolutionary algorithm and stratified strategies for training set selection in data mining. Applied Soft Computation, 2006,6(3):323?332.
    [10] Hart PE. The condensed nearest neighbor rule. IEEE Trans. on Information Theory, 1968,IT-14(3):515?516.
    [11] Kibbler D, Aha DW. Learning representative exemplars of concepts: An initial case of study. In: Shavlik JW, Dietterich TG, eds.Readings in Machine Learning. San Fransisco: Morgan Kaufmann Publishers, 1990. 108?114.
    [12] Aha DW, Kibbler D, Albert MK. Instance-Based learning algorithms. Machine Learning, 1991,6(1):37?66.
    [13] Wilson DR, Martinez TR. Instance pruning techniques. In: Fisher DH, ed. Proc. of the 14th Int’l Conf. on Machine Learning. SanFransisco: Morgan Kaufmann Publishers, 1997. 403?411.
    [14] Skalak DB. Prototype and feature selection by sampling and random mutation hill climbing algorithms. In: Cohen WC, Hirsh H,eds. Proc. of the 11th Int’l Conf. on Machine Learning. San Fransisco: Morgan Kaufmann Publishers, 1994. 293?301.
    [15] De Castro LN, Von Zuben FJ. Learning and optimization using the clonal selection principle. IEEE Trans. on EvolutionaryComputation, 2002,6(3):239?251.
    [16] Jiao LC, Du HF, Liu F, Gong MG. Immunological Computation for Optimization, Learning and Recognition. Beijing: SciencePress, 2006 (in Chinese). 附中文参考文献:
    [16] 焦李成,杜海峰,刘芳,公茂果.免疫优化计算、学习与识别.北京:科学出版社,2006.
    Comments
    Comments
    分享到微博
    Submit
Get Citation

公茂果,郝琳,焦李成,王晓华,孙奕菲.基于人工免疫系统的数据简化.软件学报,2009,20(4):804-814

Copy
Share
Article Metrics
  • Abstract:5755
  • PDF: 7377
  • HTML: 0
  • Cited by: 0
History
  • Received:June 15,2007
  • Revised:January 10,2008
You are the first2044101Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063