Please use this identifier to cite or link to this item:
http://ir.futminna.edu.ng:8080/jspui/handle/123456789/27527
Title: | Systematic Review on Text Normalization Techniques and its Approach to Non-Standard Words |
Authors: | Aliero, Abubakar Ahmad Bashir, Sulaimon Adebayo Aliyu, Hamzat Olanrewaju Tafida, Amina Gogo Bashar, Umar Kangiwa Nasiru, Muhammad Dankolo |
Keywords: | Text Normalization, Techniques, Method, Approach, Rulebased, Statistical Method, Neural Network, Similarity-based, Context-based |
Issue Date: | Sep-2023 |
Publisher: | International Journal of Computer Applications. |
Citation: | Abubakar Ahmad Aliero, Bashir Sulaimon Adebayo, Hamzat Olanrewaju Aliyu, Amina Gogo Tafida, Bashar Umar Kangiwa, Nasiru Muhammad Dankolo . Systematic Review on Text Normalization Techniques and its Approach to Non-Standard Words. International Journal of Computer Applications. 185, 33 ( Sep 2023), 44-55. DOI=10.5120/ijca2023923106 |
Abstract: | Text normalization is the process of transforming text into a standardized and canonical form. It involves correcting spelling errors, expanding abbreviations, resolving contractions, normalizing punctuation, capitalization, and other linguistic variations to ensure consistent and coherent representations of textual data. The goal of text normalization is to reduce the lexical and orthographic variations in text, making it easier to process, analyze, and understand. It is a critical preprocessing step in many natural language processing (NLP) tasks, such as machine translation, text-to-speech synthesis, sentiment analysis, and information retrieval. Many techniques and approaches have been used for normalizing different kind of text including the User-Generated Content (UGC). This normalization helps to improve the performance of NLP downstream task. This paper provides a broad picture of the state-of-the-art researches in the area of text normalization from 2018 to 2022. About 54 journal and conference papers was selected to identifies and analyzed the trends of the text normalization techniques, approaches and issues in the related field. The use of dataset and evaluation metrics were excluded for future research |
URI: | http://repository.futminna.edu.ng:8080/jspui/handle/123456789/27527 |
Appears in Collections: | Computer Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
aliero-2023-ijca-923106.pdf | 451.29 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.