Publication:
Query Translation for Multilingual Content with Semantic Technique

Research Projects

Organizational Units

Journal Issue

Abstract

Cross-lingual information retrieval (CLLR) allows user query in a different language from the language of target resources. Thus, translation is the key element in the query processing. There are three translation approaches: query document, or hybrid query-document. However, query translation is very challenging due to the polysemy problem. Different linguistic nature of the languages will lead to ambiguity of meaning subsequently user's true intention could be misinterpreted. This paper presents a semantic technique on query translation for a multilingual knowledge repository to improve the query processing. Offline translated documents or parallel corpora in English, Arabic, and Malay language including Jawi text was used as the data. Set of keywords were constructed preidentified by expert related to prophetic, food. These keywords were annotated with the relevant Quranic verses, Hadith texts, Manuscript text images and scientific article determined by expert. The synonym and context-based translation was annotated together with the specific keyword. A query will do a three-way pattern match based on the keyword indexing list that link to the relevant documents. A one-stop knowledge repository on prophetic food was developed as a proof of concept using sources are from al-Quran, Hadith, classical manuscript, and scientific articles verified by experts to ensure the content authenticity and integrity.

Description

Volume 49 No: 9

Keywords

Cross lingual information retrieval, one stop knowledge repository, prophetic food, query translation, semantic technique

Citation