Compression-based Anomaly Detection using K-means Clustering

http://www.ndsl.kr/ndsl/search/detail/article/articleSearchResultDetail.do?cn=JAKO201227932999832

This study presents a new method for storing large log data, and simultaneously, detecting anomaly data. To achieve this, the well-known K-means clustering algorithm is used for the anomaly detection. In K-means algorithm, the dissimilarity between data is calculated on the space transformed by the Logpack compression algorithm. We also performed a feature selection using genetic algorithms to obtain an informative subset of features relevant to anomaly events. Through various tests, it is observed that the proposed method is superior to conventional algorithms.

View details

August 1, 2012 ∙ anomaly detection log compression K-means clustering genetic algorithm

Need a freelance expert to plan and build your product? Available to founders, teams, and businesses from product framing through launch.

Work With Me Get in touch →