Copyright and Licensing
Articles accepted for publication will be licensed under the Creative Commons BY-NC-SA. Authors must sign a non-exclusive distribution agreement after article acceptance.
Automatic text summarization (ATS) is a critical area in natural language processing that aims to condense long documents into concise and meaningful summaries. Manual text summarization is a time-consuming, labor-intensive, and costly process. Researchers have been actively working to enhance ATS techniques, focusing on either extractive, abstractive, or hybrid methods. This review explores different summarization techniques, including extractive and abstract approaches, with a focus on recent developments using machine learning and deep learning models. The paper also discusses the challenges in generating high-quality summaries, including coherence, redundancy, and information loss. Finally, this review provides a comprehensive survey for researchers, presenting the various aspects of ATS, including approaches, taxonomy analysis, datasets, evaluation methods, and future research trends.
Tewari, K., Yadav, A.K., Kumar, M., Yadav, D. (2023). Extractive Text Summarization Using Statistical Approach. In: Tistarelli, M., Dubey, S.R., Singh, S.K., Jiang, X. (eds) Computer Vision and Machine Intelligence. Lecture Notes in Networks and Systems, vol 586. Springer, Singapore. https://doi.org/10.1007/978-981-19-7867-8_52
Wafaa S. El-Kassas, Cherif R. Salama, Ahmed A. Rafea, Hoda K. Mohamed,Automatic text summarization: A comprehensive survey, Expert Systems with Applications,Volume 165,2021,113679,ISSN 0957-4174,https://doi.org/10.1016/j.eswa.2020.113679.
Vilca, G. C. V., & Cabezudo, M. A. S. (2017). A Study of Abstractive Summarization Using Semantic Representations and Discourse Level Information. Paper presented at the 20th International Conference on Text, Speech, and Dialogue, Prague, Czech Republic.
Patel, V., & Tabrizi, N. (2022). An automatic text summarization: a systematic review. Computación Y Sistemas, 26(3). https://doi.org/10.13053/cys-26-3-4347
Mani, I. and Maybury, M.T., Eds. (1999) Advances in Automatic Text Summarization. MIT Press, Cambridge.
Issam, K. a. R., Patel, S., & N, S. C. (2020). Topic Modeling Based Extractive Text Summarization. International Journal of Innovative Technology and Exploring Engineering, 9(6), 1710–1719. https://doi.org/10.35940/ijitee.f4611.049620
Wang, T., Yang, C., Zou, M. et al. A study of extractive summarization of long documents incorporating local topic and hierarchical information. Sci Rep 14, 10140 (2024). https://doi.org/10.1038/s41598-024-60779-z
Nikolaos Gialitsis, Nikiforos Pittaras, and Panagiotis Stamatopoulos. 2019. A topic-based sentence representation for extractive text summarization. In Proceedings of the Workshop MultiLing 2019: Summarization Across Languages, Genres and Sources, pages 26–34, Varna, Bulgaria. INCOMA Ltd.
Ori Ernst, Avi Caciularu, Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Jacob Goldberger, and Ido Dagan. 2022. Proposition-Level Clustering for Multi-Document Summarization. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1765–1779, Seattle, United States. Association for Computational Linguistics.
Yadav, A.K., Ranvijay, Yadav, R.S. et al. Graph-based extractive text summarization based on single document. Multimed Tools Appl 83, 18987–19013 (2024). https://doi.org/10.1007/s11042-023-16199-8
Waseemullah; Fatima, Z.; Zardari, S.; Fahim, M.; Andleeb Siddiqui, M.; Ibrahim, A.A.A.; Nisar, K.; Naz, L.F. A Novel Approach for Semantic Extractive Text Summarization. Appl. Sci. 2022, 12, 4479. https://doi.org/10.3390/app12094479
S. Mandal, P. Achary, S. Phalke, K. Poorvaja and M. Kulkarni, "Extractive Text Summarization Using Supervised Learning and Natural Language Processing," 2021 International Conference on Intelligent Technologies (CONIT), Hubli, India, 2021, pp. 1-7, doi: 10.1109/CONIT51480.2021.9498322.
Alami Merrouni, Z., Frikh, B. & Ouhbi, B. EXABSUM: a new text summarization approach for generating extractive and abstractive summaries. J Big Data 10, 163 (2023). https://doi.org/10.1186/s40537-023-00836-y
Yadav, A.K., Singh, A., Dhiman, M. et al. Extractive text summarization using deep learning approach. Int. j. inf. tecnol. 14, 2407–2415 (2022). https://doi.org/10.1007/s41870-022-00863-7
Chowdhury, S.R., Sarkar, K. A New Method for Extractive Text Summarization Using Neural Networks. SN COMPUT. SCI. 4, 384 (2023). https://doi.org/10.1007/s42979-023-01806-0
Sharma, B., Katyal, N., Kumar, V., Shivani, Lathwal, A. (2020). Automatic Text Summarization Using Fuzzy Extraction. In: Khanna, A., Gupta, D., Bhattacharyya, S., Snasel, V., Platos, J., Hassanien, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1087. Springer, Singapore. https://doi.org/10.1007/978-981-15-1286-5_33
Tomer, M., Kumar, M., Hashmi, A. et al. Enhancing metaheuristic based extractive text summarization with fuzzy logic. Neural Comput & Applic 35, 9711–9723 (2023). https://doi.org/10.1007/s00521-023-08209-5
Debnath, D., Das, R. & Pakray, P. Single document text summarization addressed with a cat swarm optimization approach. Appl Intell 53, 12268–12287 (2023). https://doi.org/10.1007/s10489-022-04149-0
Jiwei Tan, Xiaojun Wan, and Jianguo Xiao. 2017. Abstractive Document Summarization with a Graph-Based Attentional Neural Model. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1171–1181, Vancouver, Canada. Association for Computational Linguistics.
T. Yu and S. Gao, "Abstractive Text Summarization With Semantic Dependency Graph," 2023 5th International Academic Exchange Conference on Science and Technology Innovation (IAECST), Guangzhou, China, 2023, pp. 567-570, doi: 10.1109/IAECST60924.2023.10502964.
Karuna, G., Akshith, M., Dinesh, P. S., Vardhan, B. V., Bisht, Y. S., & Narsaiah, M. (2023b). Automated Abstractive Text Summarization using Deep Learning. E3S Web of Conferences, 430, 01021. https://doi.org/10.1051/e3sconf/202343001021
P. Gigioli, N. Sagar, A. Rao and J. Voyles, "Domain-Aware Abstractive Text Summarization for Medical Documents," 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Madrid, Spain, 2018, pp. 2338-2343, doi: 10.1109/BIBM.2018.8621539.
Kouris, P., Alexandridis, G. & Stafylopatis, A. Text summarization based on semantic graphs: an abstract meaning representation graph-to-text deep learning approach. J Big Data 11, 95 (2024). https://doi.org/10.1186/s40537-024-00950-5
Hardy, H., Ballesteros, M., Ladhak, F., Khalifa, M., Castelli, V., & McKeown, K. (2022). Novel Chapter Abstractive Summarization using Spinal Tree Aware Sub-Sentential Content Selection. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2211.04903
Shyamala, K., & Evangeline, M. M. (2023). A Framework for Extractive Text Summarization of Single Text Document in Tamil Language Using Frequency Based Feature Extraction Technique. In Lecture notes in networks and systems (pp. 155–164). https://doi.org/10.1007/978-981-99-6755-1_12
Zhang, W., Xue, Z., & Zhao, L. (2021). Recent Advances in Tokenization Methods for Natural Language Processing. Computational Linguistics Journal, 11(5), 357-370.
Dong, Y. (2018). A Survey on Neural Network-Based Summarization Methods. arXiv (Cornell University). https://doi.org/10.48550/arxiv.1804.04589
Supriyono, N., Wibawa, A. P., Suyono, N., & Kurniawan, F. (2024). A survey of text summarization: Techniques, evaluation and challenges. Natural Language Processing Journal, 7, 100070. https://doi.org/10.1016/j.nlp.2024.100070
Mendoza, Griselda Areli Matias, Ledeneva, Yulia, and García-Hernández, Rene Arnulfo. ‘Determining the Importance of Sentence Position for Automatic Text Summarization’. 1 Jan. 2020: 2421 – 2431.
M. Jain and H. Rastogi, "Automatic Text Summarization using Soft-Cosine Similarity and Centrality Measures," 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 2020, pp. 1021-1028, doi: 10.1109/ICECA49313.2020.9297583.
Cao, M. (2022). A Survey on Neural Abstractive Summarization Methods and Factual Consistency of Summarization. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2204.09519
Hoa Trang Dang, Karolina Owczarzak, et al. 2008. Overview of the TAC 2008 update summarization task. In TAC.
Ani Nenkova and Rebecca J Passonneau. 2004. Evaluating content selection in summarization: The pyramid method. In Proceedings of the human language technology conference of the north american chapter of the association for computational linguistics: Hlt-naacl 2004. 145–152.
Ming Zhong, Yang Liu, Da Yin, Yuning Mao, Yizhu Jiao, Pengfei Liu, Chenguang Zhu, Heng Ji, and Jiawei Han. 2022. Towards a Unified Multi-Dimensional Evaluator for Text Generation. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2023–2038
Alexander R Fabbri, Wojciech Kryściński, Bryan McCann, Caiming Xiong, Richard Socher, and Dragomir Radev. 2021. Summeval: Re-evaluating summarization evaluation. Transactions of the Association for Computational Linguistics 9 (2021), 391–409.
Chin-Yew Lin and Eduard Hovy. 2003. Automatic evaluation of summaries using n-gram co-occurrence statistics. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics. 150–157.
Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q Weinberger, and Yoav Artzi. 2019. BERTScore: Evaluating Text Generation with BERT. In International Conference on Learning Representations.
Wei Zhao, Maxime Peyrard, Fei Liu, Yang Gao, Christian M Meyer, and Steffen Eger. 2019. MoverScore: Text generation evaluating with contextualized embeddings and earth mover distance. arXiv preprint arXiv:1909.02622 (2019).
Elizabeth Clark, Asli Celikyilmaz, and Noah A Smith. 2019. Sentence mover’s similarity: Automatic evaluation for multi-sentence texts. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2748–2760.
Weizhe Yuan, Graham Neubig, and Pengfei Liu. 2021. Bartscore: Evaluating generated text as text generation. Advances in Neural Information Processing Systems 34 (2021), 27263–27277.
Joshua Maynez, Shashi Narayan, Bernd Bohnet, and Ryan McDonald. 2020. On faithfulness and factuality in abstractive summarization. arXiv preprint arXiv:2005.00661 (2020).
Wojciech Kryscinski, Bryan McCann, Caiming Xiong, and Richard Socher. 2020. Evaluating the Factual Consistency of Abstractive Text Summarization. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 9332–9346. https://doi.org/10.18653/v1/2020. emnlp-main.750
Esin Durmus, He He, and Mona Diab. 2020. FEQA: A question answering evaluation framework for faithfulness assessment in abstractive summarization. arXiv preprint arXiv:2005.03754 (2020).
Alex Wang, Kyunghyun Cho, and Mike Lewis. 2020. Asking and answering questions to evaluate the factual consistency of summaries. arXiv preprint arXiv:2004.04228 (2020).
Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano, Alex Wang, and Patrick Gallinari. 2021. QuestEval: Summarization Asks for Fact-based Evaluation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 6594–6604
Tanya Goyal, Junyi Jessy Li, and Greg Durrett. 2022. SNaC: Coherence Error Detection for Narrative Summarization. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 444–463.
Maxime Peyrard, Teresa Botschen, and Iryna Gurevych. 2017. Learning to score system summaries for better content selection evaluation.. In Proceedings of the Workshop on New Frontiers in Summarization. 74–84.
Wen Xiao and Giuseppe Carenini. 2020. Systematically Exploring Redundancy Reduction in Summarizing Long Documents. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing. 516–528.
G. R. Sundar, C. S. Nageswari, J. Antony Vijay, S. R. Devi, N. Basker and N. Pushpa, "Machine Learning based Extractive Text Summarization Techniques," 2023 International Conference on Integrated Intelligence and Communication Systems (ICIICS), Kalaburagi, India, 2023, pp. 1-6, doi: 10.1109/ICIICS59993.2023.10421192.
C. Badgujar, V. Jethani and T. Ghorpade, "Abstractive Summarization using Graph Based Methods," 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT), Coimbatore, India, 2018, pp. 803-807, doi: 10.1109/ICICCT.2018.8473315.
Afzal, A., Vladika, J., Braun, D., & Matthes, F. (2023). Challenges in Domain-Specific Abstractive Summarization and How to Overcome Them. -, 682–689. https://doi.org/10.5220/0011744500003393
Gambhir, M., Gupta, V. Improved hybrid text summarization system using deep contextualized embeddings and statistical features. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-19524-x
R. Habu, R. Ratnaparkhi, A. Askhedkar and S. Kulkarni, "A Hybrid Extractive-Abstractive Framework with Pre & Post-Processing Techniques To Enhance Text Summarization," 2023 13th International Conference on Advanced Computer Information Technologies (ACIT), Wrocław, Poland, 2023, pp. 529-533, doi: 10.1109/ACIT58437.2023.10275584.
Liu, W., Gao, Y., Li, J., & Yang, Y. (2021). A Combined Extractive With Abstractive Model for Summarization. IEEE Access, 9, 43970–43980. https://doi.org/10.1109/access.2021.3066484
Karanja, J. M., & Matheka, A. (2022). A Hybrid Model for Text Summarization Using Natural Language Processing. Open Journal for Information Technology, 5(2), 65–80. https://doi.org/10.32591/coas.ojit.0502.03065k
Muniraj, P., Sabarmathi, K., Leelavathi, R., & B, S. B. (2023). HNTSumm: Hybrid text summarization of transliterated news articles. International Journal of Intelligent Networks, 4, 53–61. https://doi.org/10.1016/j.ijin.2023.03.001
Copyright (c) 2025 Sara Zayed, Mostafa Ezzat, Hesham A. Hefny

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Articles accepted for publication will be licensed under the Creative Commons BY-NC-SA. Authors must sign a non-exclusive distribution agreement after article acceptance.
You may also start an advanced similarity search for this article.
Review Stats:
Mean Time to First Response: 89 days
Mean Time to Acceptance Response: 114 days
Member of:

ISSN
1666-6038 (Online)
1666-6046 (Print)