Automated text summarization of scientific documents

dc.contributor.authorAkmal Jahan, M. A. C.
dc.contributor.authorGunathilaka, R. D. R. M.
dc.date.accessioned2022-12-06T11:13:12Z
dc.date.available2022-12-06T11:13:12Z
dc.date.issued2022-11-15
dc.description.abstractText summarization plays a major role in natural language processing, especially in scientific communities like researchers, students, and so on. Due to the number of scientific publications available online rapidly rising, it takes too much time to identify the most appropriate, quality, and relevant materials for their search out of thousands. Therefore, there should be an alternative way to sort out and simplify the search and get a quality and appropriate document based on our search. The aim of this work is to generate an online platform for a digital library that provides a good summary of any scientific document which is subscribed to by the library of the institution. Therefore, we need to find an appropriate and best suitable text summarization algorithm out of some state-of-the-art text processing algorithms such as the Text Rank algorithm, TF-IDF algorithm, and K-Means algorithm, which have been used in different text processing scenarios. To evaluate and select the best suitable algorithm, we used a publicly available scientific dataset and manually generated a summary from the dataset. From the experiments processed, the Text Rank algorithm performed better than the other algorithms.en_US
dc.identifier.citation11th Annual Science Research Sessions 2022 (ASRS-2022) Proceedings on "“Scientific Engagement for Sustainable Futuristic Innovations”. 15th November 2022. Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai, Sri Lanka. pp. 24.en_US
dc.identifier.isbn978-624-5736-60-7
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/6355
dc.language.isoen_USen_US
dc.publisherFaculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.en_US
dc.subjectText Summarizationen_US
dc.subjectText-Ranken_US
dc.subjectTF-IDFen_US
dc.subjectK-Meansen_US
dc.titleAutomated text summarization of scientific documentsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Computer Sc 6.pdf
Size:
410.18 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: