In today's information age, vast amounts of data are readily available. While this abundance holds immense value, extracting key insights and essential information from large volumes of text can be a significant time investment. News articles, in particular, often contain a wealth of information, but reading them in their entirety can be cumbersome. Ideally, a solution could exist to condense these texts and efficiently extract the most important details.
Fortunately, numerous text summarization techniques have emerged. These methods range from basic approaches, relying on simpler logic for summarizing text, to more advanced methods utilizing complex architectures and pre-trained models. Each technique undoubtedly offers unique advantages and limitations.
This paper aims to explore the capabilities of several prominent summarization techniques. Specifically, we will investigate the effectiveness of Luhn summarization, BART, Pegasus, and Claude LLM in summarizing news articles. By utilizing open-source news article resources, this research will evaluate the strengths and weaknesses of each method in extracting crucial information from news content. To achieve these next task must be done:
• Provide a text summarization model for each technique that will be analyzed.
• Analyze the performance of each text summarization model using the results of each specific metric.
• Examine by comparing the outcome metrics obtained from each of the summarization models.
The objective of this bachelor thesis paper is to explore the capabilities of Luhn Summarization, BART, Pegasus, and Claude LLM using open-source news articles. The specific objectives are to provide a text summarization model or technique, to then obtain the evaluation metrics scores of each of the models or techniques, and to conclude by comparing and concluding the result of each of the evaluation metrics of the text summarization model or technique.
The techniques, technologies, and tools mentioned in this paper can be leveraged to great effect when exploring text summarization models and techniques for generating summaries of news articles.
This paper successfully explores the capabilities of Luhn Summarization, BART, Pegasus, and Claude LLM in generating summaries for news articles. It concludes that Pegasus is the most suitable text summarization model or technique compared to the other three for generating summaries for news articles.