Publication:
An efficient data transformation technique for web log

Loading...
Thumbnail Image

Date

2017

Journal Title

Journal ISSN

Volume Title

Publisher

Newswood Limited

Research Projects

Organizational Units

Journal Issue

Abstract

Growth of data over time especially in term of volume, velocity, value, veracity and variety led to many challenges especially in extracting useful information from it. Furthermore, managing and transforming raw data into a readable format is crucial for subsequent analysis. Therefore, this paper presents a new web server log file classification and an efficient way of transforming raw web log files by using knowledge database discovery (KDD) technique into a readable format for data mining analysis. An experiment was conducted to the raw web log files, in a controlled lab environment, by using KDD technique and k-nearest neighbor (IBk) algorithm. Based on the experiment conducted, the IBk algorithm generates 99.66% for true positive rate (TPR) and 0.34% for false positive rate (FPR) which indicates the significant efficiency of the new web log file classification and data transformation technique used in this paper.

Description

Keywords

Big Data, Data transformation, Knowledge database discovery, Log analysis, Big data, Blogs, Classification (of information), Metadata, Nearest neighbor search, Data transformation, False positive rates, K-nearest neighbors, Knowledge database, Log analysis, True positive rates, Web log file, Web server logs, Data mining

Citation

Collections