Distillation of Large Language Models for Text Simplification
DOI:
https://doi.org/10.31713/MCIT.2023.071Abstract
This work presents a comprehensive methodology for harnessing the capabilities of Large Language Models to address specific Natural Language Processing tasks, with a focus on Text Simplification. While LLMs have demonstrated their prowess in tackling a wide range of NLP challenges, their demanding computational requirements can render them impractical for real-time online inference. In response to this limitation, we suggest the concept of text distillation, a technique aimed at effectively transferring the knowledge stored within LLMs to more compact and computationally efficient neural networks.
Downloads
Published
2023-11-22
How to Cite
Скуржанський, О. (2023). Distillation of Large Language Models for Text Simplification. Modeling, Control and Information Technologies: Proceedings of International Scientific and Practical Conference, (6), 230–231. https://doi.org/10.31713/MCIT.2023.071
Issue
Section
Internet of Things (IoT) and artificial intelligence