A Clustering-Based Algorithm for Data Reduction

5th International Workshop on Computational Intelligence & Applications Proceedings : IWCIA 2009 Page 65-70 published_at 2009-11

アクセス数 : 633 件

ダウンロード数 : 87 件

今月のアクセス数 : 0 件

今月のダウンロード数 : 0 件

この文献の参照には次のURLをご利用ください : https://ir.lib.hiroshima-u.ac.jp/00028454

File	A1202.pdf 270 KB 種類 : fulltext
Title ( eng )	A Clustering-Based Algorithm for Data Reduction
Creator	Yeh Chi-Yuan Ouyang Jeng Lee Shie-Jue
Source Title	5th International Workshop on Computational Intelligence & Applications Proceedings : IWCIA 2009
Start Page	65
End Page	70
Abstract	Finding an efficient data reduction method for largescale problems is an imperative task. In this paper, we propose a similarity-based self-constructing fuzzy clustering algorithm to do the sampling of instances for the classification task. Instances that are similar to each other are grouped into the same cluster. When all the instances have been fed in, a number of clusters are formed automatically. Then the statistical mean for each cluster will be regarded as representing all the instances covered in the cluster. This approach has two advantages. One is that it can be faster and uses less storage memory. The other is that the number of new representative instances need not be specified in advance by the user. Experiments on real-world datasets show that our method can run faster and obtain better reduction rate than other methods.
Keywords	Large-scale dataset fuzzy similarity data reduction prototype reduction instance-filtering instance-abstraction
NDC	Technology. Engineering [ 500 ]
Language	eng
Resource Type	conference paper
Publisher	IEEE SMC Hiroshima Chapter
Date of Issued	2009-11
Rights	(c) Copyright by IEEE SMC Hiroshima Chapter.
Publish Type	Version of Record
Access Rights	open access
Source Identifier	[ISSN] 1883-3977 [URI] http://www.hil.hiroshima-u.ac.jp/iwcia/2009/