Distillation of Large Language Models for Text Simplification

Олександр Скуржанський

doi:10.31713/MCIT.2023.071

Distillation of Large Language Models for Text Simplification

Authors

Олександр Скуржанський КНУ ім. Тараса Шевченка

DOI:

https://doi.org/10.31713/MCIT.2023.071

Abstract

This work presents a comprehensive methodology for harnessing the capabilities of Large Language Models to address specific Natural Language Processing tasks, with a focus on Text Simplification. While LLMs have demonstrated their prowess in tackling a wide range of NLP challenges, their demanding computational requirements can render them impractical for real-time online inference. In response to this limitation, we suggest the concept of text distillation, a technique aimed at effectively transferring the knowledge stored within LLMs to more compact and computationally efficient neural networks.

Downloads

Published

2023-11-22

How to Cite

Скуржанський, О. (2023). Distillation of Large Language Models for Text Simplification. Modeling, Control and Information Technologies: Proceedings of International Scientific and Practical Conference, (6), 230–231. https://doi.org/10.31713/MCIT.2023.071

Download Citation

Issue

No. 6 (2023): Modeling, control and information technologies: Proceedings of VI International scientific and practical conference

Section

Internet of Things (IoT) and artificial intelligence