Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/1951
Title: Integrating genetic algorithm with random forest for improving the classification performance of web log data
Authors: Mittal R
Malik V
Singh V
Singh J
Kaur A.
Keywords: Web Mining
Classification
RFGA
Random Forest
Issue Date: 2020
Publisher: Institute of Electrical and Electronics Engineers Inc.
Abstract: Web mining is an important approach to retrieve and analyse the information from web server log data. In the internet-driven information age, a lot of data is present on the web in many ways and analysing such data using the web mining methods cam result in some novel insights. Such data can be extracted from the server log files and can be preprocessed to be used for various web mining functionalities. In this paper authors used the data from web server log files, preprocessed it and then applied various classification algorithms such as Na�ve bayes,KNN,decision tree,random forest and analysed the results. The best approach was then chosen to further improve the performance of the classifier by integrating it with genetic algorithm. In this context, a hybrid approach, namely RFGA was used integrating Random forest and genetic algorithm on the dataset and the results of different machine learning classifiers were compared with RFGA in terms of the predictive accuracy.
URI: 10.1109/PDGC50313.2020.9315807
http://hdl.handle.net/123456789/1951
Appears in Collections:Conferences

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.