Spivavtor: An Instruction Tuned Ukrainian Text Editing Model

Document detail

ID

oai:arXiv.org:2404.18880

Topic

Computer Science - Computation and...

Author

Saini, Aman Chernodub, Artem Raheja, Vipul Kulkarni, Vivek

Year

2024

listing date

5/1/2024

Keywords

ukrainian editing spivavtor

Metrics

Abstract

We introduce Spivavtor, a dataset, and instruction-tuned models for text editing focused on the Ukrainian language.

Spivavtor is the Ukrainian-focused adaptation of the English-only CoEdIT model.

Similar to CoEdIT, Spivavtor performs text editing tasks by following instructions in Ukrainian.

This paper describes the details of the Spivavtor-Instruct dataset and Spivavtor models.

We evaluate Spivavtor on a variety of text editing tasks in Ukrainian, such as Grammatical Error Correction (GEC), Text Simplification, Coherence, and Paraphrasing, and demonstrate its superior performance on all of them.

We publicly release our best-performing models and data as resources to the community to advance further research in this space.

;Comment: Accepted to UNLP Workshop 2024

Saini, Aman,Chernodub, Artem,Raheja, Vipul,Kulkarni, Vivek, 2024, Spivavtor: An Instruction Tuned Ukrainian Text Editing Model

Document

Open

Source

Articles recommended by ES/IODE AI

Computer Science