Urdu word sense disambiguation using machine learning approach

dc.contributor.authorAbid, Muhammad
dc.contributor.authorHabib, Asad
dc.contributor.authorAshraf, Jawad
dc.contributor.authorShahid, Abdul
dc.date.accessioned2024-10-29T15:21:26Z
dc.date.available2024-10-29T15:21:26Z
dc.date.issued2017-06-20
dc.description.abstractThis paper focuses on the word sense disambiguation (WSD) problem in the context of Urdu language. Word sense disambiguation (WSD) is a phenomena for disambiguating the text so that machine (computer) would be capable to deduce correct sense of individual given word(s). WSD is critical for solving natural language engineering (NLE) tasks such as machine translation and speech processing etc. It also increase the performance of other tasks such as text retrieval, document classification and document clustering etc. Research work in WSD has been conducted up to different extents in computationally developed languages of the world. In the context of Urdu language the NLE research in general and the WSD research in particular is still in the infancy stage due to the rich morphological structure of Urdu. In this paper, we use machine learning (ML) approaches such as Bayes net classifier (BN), support vector machine (SVM) and decision tree (DT) for WSD in native script Urdu text. The results shown that BN has better F-measure than SVM and DT. The maximum F-measure of 0.711 over 2.5 million words raw Urdu corpus was recorded for the Bayes net classifier.
dc.funderNo external funder
dc.identifier.citationAbid, M., Habib, A., Ashraf, J. et al. (2018) Urdu word sense disambiguation using machine learning approach. Cluster Computing, 21, pp. 515–522
dc.identifier.doihttps://doi.org/10.1007/s10586-017-0918-0
dc.identifier.issn1386-7857
dc.identifier.issn1573-7543
dc.identifier.urihttps://hdl.handle.net/2086/24418
dc.publisherSpringer
dc.relation.ispartofCluster Computing
dc.titleUrdu word sense disambiguation using machine learning approach
dc.typeArticle
oaire.citation.issue1
oaire.citation.volume21

Files

License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.2 KB
Format:
Item-specific license agreed upon to submission
Description: