Repository logo
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Srpski (lat)
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Српски
  • Yкраї́нська
  • Log In
    New user? Click here to register.Have you forgotten your password?
Repository logo
    Communities & Collections
    Research Outputs
    Fundings & Projects
    People
    Statistics
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Srpski (lat)
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Српски
  • Yкраї́нська
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Staff Publications
  3. Scopus
  4. A Hybrid Approach For Web Search Result Clustering Based On Genetic Algorithm With K-means
 
  • Details
Options

A Hybrid Approach For Web Search Result Clustering Based On Genetic Algorithm With K-means

Journal
Journal of Theoretical and Applied Information Technology
Date Issued
2021-06-15
Author(s)
Norita Md Norwawi
Bourair Al-attar
Ahmed J. Allami
Ali Thoulfikar A. Imeer
Yusor Fadhil Alasadi
Hawraa M. Kadhim
Abstract
Nowadays, search engines tend to use the latest technologies in enhancing the personalization of web
searches, which leads to a better understanding of user needs. One of these technologies is web search results
clustering which returns meaningful labeled clusters from a set of Web snippets retrieved from any Web
search engine for a given user’s query. Search result clustering aims to improve searching for information
from the potentially huge amount of search results. These search results consist of URLs, titles, and snippets
(descriptions or summaries) of web pages. Dealing with search results is considered as treating large-scale
data, which indeed has a significant impact on effectiveness and efficiency. However, unlike traditional text
mining, queries and snippets tend to be shorter which leads to more ambiguity. K-means tend to converge to
local optima and depend on the initial value of cluster centers. In the past, many heuristic algorithms have
been introduced to overcome this local optima problem. Nevertheless, these algorithms suffer several
shortcomings. In this paper, we present an efficient hybrid web search results clustering algorithm referred
to as G-K-M, whereby, we combine K-means with a modified genetic algorithm. The AOL standard dataset
is used for evaluating web data log clustering. ODP-239 and MORESQUE are used as the main gold standards
for the evaluation of search results clustering algorithms. The experimental results show that the proposed
approach demonstrates its significant advantages over traditional clustering. Besides, results show that
proposed methods are promising approaches that can make search results more understandable to the users
and yield promising benefits in terms of personalization.
File(s)
Loading...
Thumbnail Image
Name

A Hybrid Approach For Web Search Result Clustering Based On Genetic Algorithm With K-means.pdf

Size

310.35 KB

Format

Adobe PDF

Checksum

(MD5):0653f53cfc6ffb8ae883fcf381510403

Welcome to SRP

"A platform where you can access full-text research
papers, journal articles, conference papers, book
chapters, and theses by USIM researchers and students.”

Contact:
  • ddms@usim.edu.my
  • 06-798 6206 / 6221
  • USIM Library
Follow Us:
READ MORE Copyright © 2024 Universiti Sains Islam Malaysia