Please use this identifier to cite or link to this item: http://repository.futminna.edu.ng:8080/jspui/handle/123456789/18898
Full metadata record
DC FieldValueLanguage
dc.contributor.authorUgwuoke, Uchenna Cosmas-
dc.contributor.authorAminu, Enesi Femi-
dc.contributor.authorEkundayo, Ayobami-
dc.date.accessioned2023-05-12T20:46:41Z-
dc.date.available2023-05-12T20:46:41Z-
dc.date.issued2022-10-
dc.identifier.issnELSEVIER-SSRN - ISSN-1556-5068-
dc.identifier.urihttp://repository.futminna.edu.ng:8080/jspui/handle/123456789/18898-
dc.descriptionProceedings of International Conference on Information systems and Emerging Technologies, 2022.en_US
dc.description.abstractIn natural language processing, text classification forms an essential task to be performed; as such, the use of machine learning algorithms have constantly become indispensable and significance to the research drive. However, the problem of solving text classification with the traditional models gets more challenging because of ambiguities associated with natural languages. A typical example is synonyms’ concept mismatch, and other related issues that accurately attribute text to their related contexts. While a more robust model with an increased number of hidden layers such as LSTM is essential, because of the volume of data involved; exploration of strategies for data augmentation is highly significant. To this end, this research aims to employs semantic lexical database, called WordNet as strategy to augment the BBC news textual data obtained from kaggle repository. This is to pave way for a more efficient news data classification based on the proposed LSTM model. The total BBC news samples are 2,225 data points, and each data point is grouped into five different news categories, which include, technology news, business news, sport news, entertainment news, and political news. Experimental evaluations are carried out using the benchmark BBC news dataset; and the newly augmented dataset within the scope of this study. Consequently, the accuracy of the classification LSTM model for original news dataset and the augmented dataset are 90% and 95% respectively. Therefore, the proposed data augmentation strategy is promising for textual datasets.en_US
dc.language.isoenen_US
dc.publisherELSEVIER-SSRNen_US
dc.relation.ispartofseriesISSN-1556-5068;-
dc.subjectData augmentationen_US
dc.subjectWordNeten_US
dc.subjectBBC news dataen_US
dc.subjectLSTM modelen_US
dc.titlePerforming Data Augmentation Experiment to Enhance Model Accuracy: A Case Study of BBC News’ Dataen_US
dc.typeArticleen_US
Appears in Collections:Computer Science

Files in This Item:
File Description SizeFormat 
BBC.pdfPerforming Data Augmentation Experiment to Enhance Model Accuracy561.68 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.