Data Mining for Network Intrusion Detection: A Comparison of Alternative Methods
Intrusion detection systems help network administrators prepare for and deal with network security attacks. These systems collect information from a variety of systems and network sources, and analyze them for signs of intrusion and misuse. A variety of techniques have been employed for analysis ranging from traditional statistical methods to new data mining approaches. In this study the performance of three data mining methods in detecting network intrusion is examined. An experimental design is created to evaluate the impact of three data mining methods, two data representation formats, and two data proportion schemes on the classification accuracy of intrusion detection systems. The results indicate that data mining methods and data proportion have a significant impact on classification accuracy. Within data mining methods, rough sets provide better accuracy, followed by neural networks and inductive learning. Balanced data proportion performs better than unbalanced data proportion. There are no major differences in performance between binary and integer data representation.
Data Mining, Inductive Learning, Intrusion Detection, Network Security, Neural Networks, Rough Sets, and Telecommunications
Computer Sciences | Management Information Systems | Numerical Analysis and Scientific Computing
Information Systems and Management
ZHU, Dan; Premkumar, G.; ZHANG, Xiaoning; and CHU, Chao-Hsien.
Data Mining for Network Intrusion Detection: A Comparison of Alternative Methods. (2007). Decision Sciences. 32, (4), 635-660. Research Collection School Of Information Systems.
Available at: http://ink.library.smu.edu.sg/sis_research/1764