Cited time in webofscience Cited time in scopus

Urdu Named Entity Recognition with Attention Bi-LSTM-CRF Model

Title
Urdu Named Entity Recognition with Attention Bi-LSTM-CRF Model
Author(s)
Ullah, FidaUllah, IhsanKolesnikova, Olga
Issued Date
2022-10-27
Citation
21st Mexican International Conference on Artificial Intelligence, MICAI 2022, pp.3 - 17
Type
Conference Paper
ISBN
9783031194955
ISSN
0302-9743
Abstract
The named entity recognition (NER) task is a challenging problem in natural language processing (NLP), especially for languages with very few annotated corpora such as Urdu. In this paper we proposed an Attention-Bi-LSTM-CRF method and applied it to the MK-PUCIT Corpus which is the latest NER dataset available for the Urdu language. In addition to word-level embedding, we used an embedding-level focus mechanism. The output of the embedding layer was fed into a bidirectional-LSTM encoder unit, accompanied by another self-attention layer to boost the system’s accuracy. Our Attention-Bi-LSTM-CRF model demonstrated an F1-score of 92%. The cumulative findings of the experiments show that our approach outperforms existing methods, thus yielding a new UNER (Urdu Named Entity Recognition) state-of-the-art performance. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
URI
http://hdl.handle.net/20.500.11750/46788
DOI
10.1007/978-3-031-19496-2_1
Publisher
Springer Science and Business Media Deutschland GmbH
Files in This Item:

There are no files associated with this item.

Appears in Collections:
ETC 2. Conference Papers

qrcode

  • twitter
  • facebook
  • mendeley

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE