Text Summarization Technique for Punjabi Language Using Neural Networks

authors

JAIN, ARTI
ARORA, ANUJA
YADAV, DIVAKAR
MORATO LARA, JORGE LUIS
KAUR, AMANPREET

published in

International Arab Journal of Information Technology Journal

publication date

November 2021

start page

807

end page

818

issue

6

volume

18

Digital Object Identifier (DOI)

https://doi.org/10.34028/iajit/18/6/8

full text

http://hdl.handle.net/10016/37752

International Standard Serial Number (ISSN)

1683-3198

abstract

In the contemporary world, utilization of digital content has risen exponentially. For example, newspaper and web
articles, status updates, advertisements etc. have become an integral part of our daily routine. Thus, there is a need to build
an automated system to summarize such large documents of text in order to save time and effort. Although, there are
summarizers for languages such as English since the work has started in the 1950s and at present has led it up to a matured
stage but there are several languages that still need special attention such as Punjabi language. The Punjabi language is
highly rich in morphological structure as compared to English and other foreign languages. In this work, we provide three
phase extractive summarization methodology using neural networks. It induces compendious summary of Punjabi single text
document. The methodology incorporates pre-processing phase that cleans the text; processing phase that extracts statistical
and linguistic features; and classification phase. The classification based neural network applies an activation function-
sigmoid and weighted error reduction-gradient descent optimization to generate the resultant output summary. The proposed
summarization system is applied over monolingual Punjabi text corpus from Indian languages corpora initiative phase-II.
The precision, recall and F-measure are achieved as 90.0%, 89.28% an 89.65% respectively which is reasonably good in
comparison to the performance of other existing Indian languages" summarizers.

Text Summarization Technique for Punjabi Language Using Neural Networks Articles

Overview

authors

published in

publication date

start page

end page

issue

volume

Digital Object Identifier (DOI)

full text

International Standard Serial Number (ISSN)

abstract

Classification

subjects

keywords