TY - GEN
T1 - Cluster detection of special datasets using the PYRAMID algorithm
AU - Tout, Samir
AU - Sun, Junping
AU - Sverdlik, William
PY - 2007
Y1 - 2007
N2 - Clustering is the art of discovering patterns in large data sets. With the constantly growing computational demands of modern applications, such data sets have grown tremendously in size and complexity, imposing further challenges on clustering algorithms. This includes outlier handling, detection of arbitrary shaped clusters, processing speed, and dependence on user-supplied parameters. The latest decade has witnessed many studies that have addressed one or more of these challenges. One of these is PYRAMID, or parallel hybrid clustering using genetic programming and multi-objective fitness with density, which we introduced in a previous research. PYRAMID is an algorithm that addresses some of the above challenges by employing a combination of data parallelism, a form of genetic programming, and a multi-objective density-based fitness function in the context of clustering. This study adds to our previous research by analyzing some of the detection characteristics of PYRAMID with respect to a set of challenging datasets and drawing conclusions as well as future directions.
AB - Clustering is the art of discovering patterns in large data sets. With the constantly growing computational demands of modern applications, such data sets have grown tremendously in size and complexity, imposing further challenges on clustering algorithms. This includes outlier handling, detection of arbitrary shaped clusters, processing speed, and dependence on user-supplied parameters. The latest decade has witnessed many studies that have addressed one or more of these challenges. One of these is PYRAMID, or parallel hybrid clustering using genetic programming and multi-objective fitness with density, which we introduced in a previous research. PYRAMID is an algorithm that addresses some of the above challenges by employing a combination of data parallelism, a form of genetic programming, and a multi-objective density-based fitness function in the context of clustering. This study adds to our previous research by analyzing some of the detection characteristics of PYRAMID with respect to a set of challenging datasets and drawing conclusions as well as future directions.
KW - Clustering
KW - Data mining
KW - Density
KW - Genetic programming
KW - Parallelism
UR - https://www.scopus.com/pages/publications/84888371605
UR - https://www.scopus.com/pages/publications/84888371605#tab=citedBy
M3 - Conference contribution
AN - SCOPUS:84888371605
SN - 9789889867140
T3 - Lecture Notes in Engineering and Computer Science
SP - 812
EP - 817
BT - IMECS 2007 - International MultiConference of Engineers and Computer Scientists 2007
T2 - International MultiConference of Engineers and Computer Scientists 2007, IMECS 2007
Y2 - 21 March 2007 through 23 March 2007
ER -