Publication:
Query Cost-Reduction For Quranic-Arabic Information Retrieval Using Hexadecimal Conversion Algorithm

Research Projects

Organizational Units

Journal Issue

Abstract

Digital Quran is a natural language document that use either Arabic font or images of the verses. In the Al-Quran there are 18994 unique words. Thus, the image approach uses a significant amount of memory space. However there is not much work has been done using machine translation (MT) technique for the Quranic representation. This paper will proposed Arabic information retrieval based on keywords search in Hexadecimal Representation using Al-Quran verses as the test case. All Quranic words will transliterate into machine language in the form of binary format after removing diacritic and duplication. This machine language approach in representing Digital Quran reduces the size of storage around 47-54% and retrieval time up to 20% hence reduce the query cost for Arabic information retrieval in general.

Description

Keywords

hexadecimal conversion, arabic information retrieval, query cost reduction, digital quran representation

Citation