Computer Science – Information Retrieval
Scientific paper
2003-04-03
Computer Science
Information Retrieval
10 pages, 2 figures, latex format
Scientific paper
Cluster analysis often serves as the initial step in the process of data classification. In this paper, the problem of clustering different length input data is considered. The edit distance as the minimum number of elementary edit operations needed to transform one vector into another is used. A heuristic for clustering unequal length vectors, analogue to the well known k-means algorithm is described and analyzed. This heuristic determines cluster centroids expanding shorter vectors to the lengths of the longest ones in each cluster in a specific way. It is shown that the time and space complexities of the heuristic are linear in the number of input vectors. Experimental results on real data originating from a system for classification of Web attacks are given.
Alvarez Gonzalo
Petrovic Slobodan
No associations
LandOfFree
A Method for Clustering Web Attacks Using Edit Distance does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A Method for Clustering Web Attacks Using Edit Distance, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Method for Clustering Web Attacks Using Edit Distance will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-730051