ETSU Faculty Works

A Deep Neural Network-Based Model for Named Entity Recognition for Hindi Language

Richa Sharma, Banasthali Vidyapith
Sudha Morwal, Banasthali Vidyapith
Basant Agarwal, Council of Indian Institutes of Information Technology
Ramesh Chandra, Norges teknisk-naturvitenskapelige universitet
Mohammad S. Khan, East Tennessee State UniversityFollow

Document Type

Article

Publication Date

10-1-2020

Description

The aim of this work is to develop efficient named entity recognition from the given text that in turn improves the performance of the systems that use natural language processing (NLP). The performance of IoT-based devices such as Alexa and Cortana significantly depends upon an efficient NLP model. To increase the capability of the smart IoT devices in comprehending the natural language, named entity recognition (NER) tools play an important role in these devices. In general, the NER is a two-step process that initially the proper nouns are identified from text and then classify them into predefined categories of entities such as person, location, measure, organization and time. NER is often performed as a subtask while processing natural languages which increases the accuracy level of a NLP task. In this paper, we propose deep neural network architecture for named entity recognition for the resource-scarce language Hindi, based on convolutional neural network (CNN), bidirectional long short-term memory (Bi-LSTM) neural network and conditional random field (CRF). In the proposed approach, initially, we use skip-gram word2vec model and GloVe model to represent words in semantic vectors which are further used in different deep neural network-based architectures. In the proposed approach, we use character- and word-level embedding to represent the text that includes information at fine-grained level. Due to the use of character-level embeddings, the proposed model is robust for the out-of-vocabulary words. Experimental results show that the combination of Bi-LSTM, CNN and CRF algorithms performs better as compared to the other baseline methods such as recurrent neural network, long short-term memory and Bi-LSTM individually.

Citation Information

Sharma, Richa; Morwal, Sudha; Agarwal, Basant; Chandra, Ramesh; and Khan, Mohammad S.. 2020. A Deep Neural Network-Based Model for Named Entity Recognition for Hindi Language. Neural Computing and Applications. Vol.32(20). 16191-16203. https://doi.org/10.1007/s00521-020-04881-z ISSN: 0941-0643

Link to Full Text

Find in your library

COinS

Digital Commons @ East Tennessee State University

ETSU Faculty Works

A Deep Neural Network-Based Model for Named Entity Recognition for Hindi Language

Document Type

Publication Date

Description

Citation Information

Search

Browse All

Browse Faculty Works

Author Corner

Links

Digital Commons @ East Tennessee State University

ETSU Faculty Works

A Deep Neural Network-Based Model for Named Entity Recognition for Hindi Language

Creator(s)

Document Type

Publication Date

Description

Citation Information

Share

Search

Browse All

Browse Faculty Works

Author Corner

Links