An Efficient Data Pre-processing Procedure for Support Vector Clustering


Findings of  Dr. K. Raghava Rao, Y.A.Siva Prasad, B.Venugopal on "An Efficient Data Pre-processing Procedure for Support Vector Clustering" have been selected for NATIONAL CONFERENCE on "Imaging, Computing, Object and Mining (ICOM'10)" Organized by Department of Computer Science and Engineering ANNA UNIVERSITY, TIRUCHIRAPPALLI scheduled on May 7th and 8th 2010 at Anna University Tiruchirappalli,TAMILNADU

Abstract:

This paper presents an efficient data pre-processing procedure for the support of vector clustering (SVC) to reduce the size of a training dataset. Solvingthe optimization  problem and labeling the data points with cluster labels are time-consuming in the SVC training procedure. This makes using SVC to process largedatasets inefficient. We proposed a data pre-processing procedure to solve the problem.  The procedure contains a shared nearest neighbour (SNN) algorithm, andutilizes the concept of unit vectors for eliminating insignificant data points from the datasetComputer simulations have been conducted on artificial and benchmarkdatasets to demonstrate the effectiveness of the proposed method.

Keywords? Support Vector Clustering, Shared Nearest Neighbours, Noise Elimination.