Improved Stemming Methods for Arabic Language for Enhanced Search Engine Efficiency
Keywords:
Natural language processing, Arabic information retrieval systems, Arabic morphology, Light Stemming algorithm, Arabic Stemming algorithms, Rule based stemming, Indexing, Word segmentation.Abstract
Year after year, many methods are being published to overcome the Arabic stem problem for successful retrieval of documents. Therefore, this research present a novel method to extracting Arabic stem. In our method we investigate Arabic morphology features. The main goal and advantage of our approach is to generate/extract stem by applying a set of rules and matches the relationship between some Arabic letters to find the root/stem of the respective words in order to uses as indexing term for the text searching in Arabic retrieval systems. Consequently, our method can be considered to operate around minimum morphological complexity, and also solve problems of conjunctions in Arabic such as prepositions and stopword that are linked directly to the word.
Indeed, these tasks are very hard and require an understanding of the meaning of a text and the ability to reason over relevant facts. Using only supporting facts. Thus, we have been tested our method using the EveTAR (2016) dataset on Arabic tweets and the obtained results show that our method results outperform the state-of-the-art results. Therefore, our method has been able to improve performance of Arabic stem and increases retrieval as well as being active against any type of stem and we believe that it’s difficult to develop new Arabic system retrieval method without uses a good morphology analysis support it.
Downloads
Published
Issue
Section
License
Authors who publish with this journal agree to the following terms:
-
Copyright Retention: Authors retain copyright and grant the journal right of first publication.
-
Licensing: The work is simultaneously licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
-
Third-Party Rights: This license allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal. Commercial use of the work is not permitted without explicit permission.
-
Self-Archiving: Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) subsequent to publication, as it can lead to productive exchanges, as well as earlier and greater citation of published work.




