Building Efficient Intrusion Detection System Using Factor Analysis and Support Vector Machines

P Indira Priyadarsini; I Ramesh Babu

doi:10.17577/IJERTV3IS041928

Volume 03, Issue 04 (April 2014)

Building Efficient Intrusion Detection System Using Factor Analysis and Support Vector Machines

DOI : 10.17577/IJERTV3IS041928

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 115
Total Downloads : 298
Authors : P Indira Priyadarsini, I Ramesh Babu
Paper ID : IJERTV3IS041928
Volume & Issue : Volume 03, Issue 04 (April 2014)
Published (First Online): 26-04-2014
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Building Efficient Intrusion Detection System Using Factor Analysis and Support Vector Machines

P Indira Priyadarsini I Ramesh Babu

Dept of Computer Science & Engg. Dept of Computer Science,&Engg.

Acharya Nagarjuna University Acharya Nagarjuna University Guntur,A.P.,India Guntur,A.P.,India

Abstract – Intrusion detection is a critical issue in network security, for protecting network resources. Therefore an accurate system of detecting intrusions is to be built to give assurance for information in any organization either public or private. The main goal is to increase the detection rate and reduce the false alarm rate. Since existing Intrusion Detection Systems (IDSs) use all the features to detect known intrusions, they achieve depressed results. We have proposed an algorithm Factor Analysis based Support Vector Machine (FA-SVM) for developing efficient IDS by making use of popular statistical technique called Factor Analysis (FA) through which the features are analyzed as factors. To design more effective and efficient IDSs it is very essential to select the best classifiers. Therefore we used Support Vector Machines (SVMs) which are good enough with high generalization ability. This work is done on knowledge discovery and data mining cup dataset for conducting tests. The performance of this approach was analyzed and compared with existing approaches like Principal Component Analysis (PCA) using SVM and also classification with SVM itself without feature selection. The results proved that the proposed method enhances the intrusion detection and outperforms existing approaches thus modeling computationally efficient IDS with minimum false positive rates.

Key words: Intrusion Detection System (IDS), Network Security, Factor Analysis (FA), Support Vector Machines (SVMs), Principal Component Analysis (PCA).

INTRODUCTION

Intrusion detection is a critical issue in network security, for protecting network resources. Therefore an accurate system of detecting intrusions is to be built to give assurance for information in any organization either public or private. The main goal is to increase the detection rate and reduce the false alarm rate. Intrusion Detection System (IDS) is a method which dynamically monitors the events occurring in a system, and decides whether these events are signs of an attack or constitutes an authorized use of the system [1] [2] [3].There are many types of IDSs in terms of monitoring the network traffic such as Network Intrusion Detection System (NIDS), Host Based Intrusion Detection System (HIDS) and Hybrid Intrusion Detection System.

IDS has to monitor large amount of audit data even for a small network, therefore analysis becomes more difficult, which leads to poor detection of suspicious activities. There are diverse affinities between features. So, IDS has

to decrease the quantity of the data to be processed by removing the features that contain false correlations and redundant information. This results in gaining better accuracy and lower computation time. IDS task is commonly modeled as a classification procedure in a machine-learning context. Many methods were proposed to develop an efficient IDS, among those Support Vector Machines(SVMs) have gained a significant importance using intrusion detection system using various kernels [4].In modeling efficient IDS,it is necessary to reduce the features which showed a great change in the performance[5].

For constructing an Intrusion Detection System the research mainly falls in two ways: detection model generation and intrusion feature selection. In achieving best accurate results preprocessing techniques like feature selection, feature reduction have become crucial in Intrusion Detection Systems [6].The recent study illustrated an improved false positive rate using Artificial Neural Networks (ANN) in Intrusion Detection mechanism with Principal Component Analysis(PCA) as a feature selection strategy [7].There are numerous studies which show reasonably good results with feature reduction using Support Vector Machine(SVM) as a classifier tool[8][9][10][11].In another study using Classification and Regression Trees (CART) and Bayesian Networks(BN) Chebrolu et. al has given ensemble feature selection algorithms which results in lightweight IDS[12].More recently a study on Generalized Discriminant Analysis as a feature selection technique achieved good results[17].

Even though SVM is a good classification technique, when applied to massive datasets many problems will be occurring. Since solving SVM is similar to solving a quadratic optimization problem, when the dimensionality increases it needs a large computational time and memory. Meanwhile for a pattern classification problem e.g.: intrusion detection, it is difficult to decide which features are useful for classifying attack or normal activity. But with IDS there are large amount of dimensions d as well as examples k which leads to inaccurate results. Therefore there is a need to select most significant features and apply high performance classifiers like SVMs which results in low false alarm rates.

Here in this paper we have taken a popular statistical technique called Factor Analysis (FA) as a dimensionality

reduction technique through which the features are analyzed as factors. The rest of the paper is organized as follows. Section 2 describes An Overview of Support Vector Machines and Factor Analysis. Section 3 will describe the Proposed IDS Model with a novel algorithm and Section 4

give Experimental results followed by Conclusions with future work.
MACHINE LEARNING PERSPECTIVE: AN OVERVIEW

2.1. SUPPORT VECTOR MACHINES

Mainly classification in IDS deals with false positive reduction and classifying between normal and attack patterns, therefore Support Vector Machines (SVMs) are best classifiers. SVMs are supervised learning techniques.SVM is based on statistical learning theory and is developed by Vapnik [13][14][15].These are built using support vectors, which are responsible for classification of data points with Maximal Marginal Hyper plane(MMH). The main aim is to classify the data points using MMH by solving quadratic optimization problem [16].SVMs have smaller running times and give high accurate classification results. The attractiveness of SVMs lies in its mathematical equations and pictorial illustrations.

SVM is a machine, constructed based on support vectors which are decisive points in both of the classes. Once support vectors are identified then it is easy to draw the hyper plane which separates both positive and negative classes. In this way classification process is done in SVM. It uses class label, so they are called as supervised learning techniques. By training the model we used to get the weight vector and bias vector values which are used to identify support vectors.SVM construction can be done both in data linearly separable case and linearly inseparable case. When the data is linearly separable, MMH is constructed based on training points and class boundary. When the data is linearly inseparable, the data is mapped to a high dimensional feature space and classification is done. The process of mapping to a high dimensional feature space is called kernel function. The Figure 1: given below illustrates the classification of SVM.

w.x+b=1 Support vectors

w.x+b=-1

w.x+b=0

MMH

1/w

done to improve learning methods using SVMs. One approach is to optimize the SVM algorithm [20, 21] to solve the convex optimization problem. Other approaches include simplification phase in reducing the training set size [22, 23]. To perform training using SVM, model selection is crucial. Even though the SVM algorithms are lesser sensitive to curse of dimensionality, dimensionality reduction techniques can enhance the efficiency of SVMs. In SVM, generalization ability depends on the choice of SVMs parameters.

In Training the dataset using SVMs, the user should provide the type of kernel function to be applied [21]. There are several kernel functions namely linear, sigmoid, polynomial, radial basis and Gaussian and so on. The performance of SVM depends mainly on the kernel selected. More general studies showed that Radial Basis Function (RBF) is most popular choice of Kernel option because of their localized and finite responses across the entire range of the real x-axis [2]. The SVM work flow is given with the following algorithm [16].

SVM Algorithm

Input:D={(xl,yl),(xl,yl),..,(xl,yl)},xRn, y{-1,+1} Define: wi,bi,j where w is the weight vector,b is the bias vector, j is the lagrangian multiplier and i=number of attributes and j=number of intstances.

Solve: LD = i ijxi xj yi yj where LD is the dual form. It must be solved to obtain j.

Calculate: w,b are obtained by substituting j.>0 values in the equations iyixi and for getting b,in i( yi( w. xi +b) -1) =0

Classifier: f(x)=sgn(w*.x+b*) if sgn is + then class is positive,if sgn is then class is negative.
PROPOSED INTRUSION DETECTION SYSTEM

SVMs are powerful classifiers; they yielded good results when applied to intrusion detection. They are applied to data with a large number of features, but their performance has been drastically increased by reducing the number of features [19].In building IDS, KDD Cup 99 dataset which is a bench mark in the area of intrusion detection and security evaluation frameworks is used. Generally IDS is a classification technique in a machine-learning framework. Here in the proposed model we have added another phase to reduce the number of features and then perform classification task. The key objective is to increase the detection rate and reduce the false alarm rate. It consists of five phases: collection of raw KDD cup 99 dataset, pre- processing, and feature reduction scheme, parameter selection using SVM and testing. The proposed model of IDS is described in the figure below.

Collection of Raw KDD Cup dataset

Pre-Processing the dataset

Feature Reduction Scheme

Parameter Selection using SVM

Testing

Figure 2: Proposed Model of Intrusion Detection System
Here in this approach, we conduct 10 fold cross validation. The dataset is partitioned at random into 10 equal parts in which the classes are taken approximately as same scope as in the full dataset. Each part is held out in turn and the training is conducted on remaining 9 parts, then its testing (error rte) is conducted on holdout set. The training procedure is conducted in total of 10 times on different training sets and finally the 10 error rates are averaged to fetch overall error estimate.
EXPERIMENTS CONDUCTED

4.1. Dataset Description

The Knowledge Discovery and Data Mining (KDD) Cup 99 dataset [18] was used in conducting the experiments and examining the results. It was taken from the Third International Knowledge Discovery and Data Mining Tools Competition. Each connection record in the data set constitutes 41 attributes [2] which are of both continuous and discrete type variables. There are 22 categories of attacks from the following four classes: Denial of Service (DOS), Root to Local (R2L), User to Root (U2R), and Probe. The dataset holds 391458 DOS attack records, 97278 normal records, 4107 Probe attack records, 1126 R2L attack records and 52 U2R attack records [17].
We have performed three types of experiments.
1. The dataset taken containing 14027 records with no feature selection, i.e. taking 41 attributes, we applied SVM.
2. In the second experiment we have applied Principal
TABLE II: FALSE ALARM RATE OBTAINED

The Detection Rates and False Alarm Rates of three experiments SVM, PCA+SVM, FA+SVM are depicted in the following charts in Figure 4 & Figure 5 for the evaluation of results in precise way.

100

SVM PCA+SVM

FA+SVM

80

60

40

20

0

Normal Probe U2R

Figure 4: Comparision Of Performance Results: Detection Rate

20

SVM

PCA+SVM FA+SVM

15

10

5

0

Normal DOS Probe R2L U2R

Figure 5: Performance Of Existing Techniques And Proposed Technique: Far Rate
CONCLUSION

Factor analysis its main goal is to reduce high-dimensional data, when the processing dataset is large with a more number of feature variables, it is advantageous. To design most efficient Intrusion Detection System it is necessary to go for dimension reduction, so the FA-SVM algorithm is best suited for detecting intrusive behavior. The results obtained in this study showed better accuracy and lower computation time. It is worth paying attention in using dimensionality reduction techniques for improving and building well proficient Intrusion Detection Systems (IDSs). Future research will employ alterations of the proposed method and upgrading to it to achieve enhanced performance and automation by developing classifiers that are more accurate for the detection of attacks.

REFERENCES

Ghosh A. K. (1999). Learning Program Behavior Profiles for Intrusion Detection. USENIX.
Mukkamala S., Janoski G., Sung A. H, Intrusion Detection Using Neural Networks and Support Vector Machines, Proceedings of IEEE International Joint Conference on Neural Networks, 2002, pp.1702-1707.
H. Debar, M. Dacier and A. Wespi, Towards a taxonomy of intrusion-detection systems Computer Networks, vol. 31,pp. 805- 822, 1999.
Wun-Hwa Chen, Sheng-Hsun Hsu,Application of SVM and ANN for intrusion detection, Computers & Operations Research, 2005 Elsevier .
Rupali Datti, Bhupendra verma,Feature Reduction for Intrusion Detection Using Linear Discriminant Analysis, (IJCSE) International Journal on Computer Science and Engineering Vol 02, No. 04, 2010, 1072-1078
Andrew Sung,S Mukkamala.,Feature Selection for Intrusion Detection using Neural Networks and Support Vector MachinesTransportation Research Record:Journal of the Transportation Research Board 1822.1,2003,pp.33-39.
Ravi Kiran Varma,V.Valli Kumari ,Feature Optimiation and Performance Improvement of a Multiclass Intrusion Detection System using PCA and ANN , International Journal of Computer Applications (0975 8887) Vol 44 No13, April 2012.
Safaa Zaman and Fakhri Karray.,Features Selection for Intrusion Detection Systems Based on Support Vector Machines, Consumer Communications and Networking Conference, 2009. CCNC 2009. 6th IEEE
Gopi K. Kuchimanchi, Vir V. Phoha, Kiran S. Balagani, Shekhar R. Gaddam,Dimension Reduction Using Feature Extraction Methods for Real-time Misuse Detection Systems,Proceedings of the 2004

IEEE Workshop on Information Assurance and Security T1B2 1555 United States Military Academy, West Point, NY, 10,June 2004.
Heba F. Eid, Ashraf Darwish, Aboul Ella Hassanien, and Ajith Abraham,Principle Components Analysis and Support Vector Machine based Intrusion Detection System,ISDA 2010,363-367.
ZhangXue-qin, GU Chun-hua and LINJia-jun.,Intrusion Detection System Based On Feature Selection And Support Vector Machine,IEEE,2006
Srilatha Chebrolu, Ajith Abraham, and Johnson P. ThomasHybrid Feature Selection for Modeling Intrusion Detection Systems Springer ,2004,pp 1020-1025.
Vapnik V., The Nature of Statistical Learning Theory, Springer- Verlag, New York, 1995.
Cortes C.,Vapnik V.,Support vector networks, in Proceedings of Machine Learning 20: pp.273297, 1995.
Boser, Guyon, and Vapnik, A training algorithm for optimal margin classifiers,Proceedings of the fifth annual workshop on Computational learning theory.pp.144-152, 1992.
P Indira priyadarsini,Nagaraju Devarakonda,I Ramesh Babu,A Chock-Full Survey on Support Vector Machines, International Journal of Computer Science and Software Engineering,Vol 3,issue10,2013.
P Indira priyadarsini,I Ramesh Babu,Modeling Intrusion Detection System based on Generalized Discriminant Analysis and Support Vector Machines,International Conference on Recent Trends in Engineering and Technilogy Sciences-2014,pp 8-12.
Mahbod Tavallaee, Ebrahim Bagheri, Wei Lu, and Ali A. Ghorbani A Detailed Analysis of the KDD CUP 99 Data Set, Proceedings of the 2009 IEEE Symposium on Computational Intelligence in Security and Defense Applications (CISDA 2009).
Iftikhar Ahmad,Muhammad Hussain ,Abdullah Alghamdi,Abdulhameed Alelaiwi .,Enhancing SVM performance in intrusion detection using optimal feature subset selection based on genetic principal componentsSpringer 2012.
Platt, J.: Fast training of SVMs using sequential minimal optimization, advances in kernel methods-support vector learning. MIT Press ,1999 ,pp.185208
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines. Sofware

Available at http://www.csie.ntu.edu.tw/cjlin/libsvm ,2001
Yu, H., Yang, J., Han, J.: Classifying large data sets using SVM with hierarchical clusters. In: SIGKDD.,2003,pp.306315
Lebrun, G., Charrier, C., Cardot, H.: SVM training time reduction using vector quantization. In: ICPR. Volume 1.,2004,pp. 160163.
Nitin Khosla Dimensionality reduction using factor analysisMastersThesis,http://researchhub.griffith.edu.au/display/n2 6993f96c6bc6146d5444ea116009424,2006.
R. J. Johnson and D. W. Wichern, Applied Multivariate Statistical Analysis, Prentice Hall, New Jersey, 1998
M. Hall, et al., "The WEKA data mining software: an update," ACM SIGKDD Explorations Newsletter, vol. 11, pp. 10-18, 2009.
http://www.cs.cmu.edu/~pmuthuku/mlsp_page/lectures/slides/JFA_ presentation_final.pdf.

	Normal	DOS	Probe	R2L	U2R
SVM	93.5	79.4	77.7	9.4	9.6
PCA+SVM	95.2	84.5	84.4	16.4	17.3
FA+SVM	96.7	93.8	95.1	35	25

	Normal	DOS	Probe	R2L	U2R
SVM	19.3	9.0	1.24	0.91	0.09
PCA+SVM	13.4	7.3	2.5	0.3	0.02
FA+SVM	6.03	5.5	0.9	0.15	0

Volume 03, Issue 04 (April 2014)

Building Efficient Intrusion Detection System Using Factor Analysis and Support Vector Machines

Building Efficient Intrusion Detection System Using Factor Analysis and Support Vector Machines

10000

Normal DOS

Probe R2L

U2R

8000

6000

4000

2000

0

Data Organized

100

SVM PCA+SVM

FA+SVM

80

60

40

20

0

Normal Probe U2R

20

SVM

PCA+SVM FA+SVM

15

10

5

0

Normal DOS Probe R2L U2R

Leave a Reply