Publication: An efficient data transformation technique for web log
Loading...
Date
2017
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Newswood Limited
Abstract
Growth of data over time especially in term of volume, velocity, value, veracity and variety led to many challenges especially in extracting useful information from it. Furthermore, managing and transforming raw data into a readable format is crucial for subsequent analysis. Therefore, this paper presents a new web server log file classification and an efficient way of transforming raw web log files by using knowledge database discovery (KDD) technique into a readable format for data mining analysis. An experiment was conducted to the raw web log files, in a controlled lab environment, by using KDD technique and k-nearest neighbor (IBk) algorithm. Based on the experiment conducted, the IBk algorithm generates 99.66% for true positive rate (TPR) and 0.34% for false positive rate (FPR) which indicates the significant efficiency of the new web log file classification and data transformation technique used in this paper.
Description
Keywords
Big Data, Data transformation, Knowledge database discovery, Log analysis, Big data, Blogs, Classification (of information), Metadata, Nearest neighbor search, Data transformation, False positive rates, K-nearest neighbors, Knowledge database, Log analysis, True positive rates, Web log file, Web server logs, Data mining