Document detail
ID

oai:arXiv.org:2404.18880

Topic
Computer Science - Computation and...
Author
Saini, Aman Chernodub, Artem Raheja, Vipul Kulkarni, Vivek
Category

Computer Science

Year

2024

listing date

5/1/2024

Keywords
ukrainian editing spivavtor
Metrics

Abstract

We introduce Spivavtor, a dataset, and instruction-tuned models for text editing focused on the Ukrainian language.

Spivavtor is the Ukrainian-focused adaptation of the English-only CoEdIT model.

Similar to CoEdIT, Spivavtor performs text editing tasks by following instructions in Ukrainian.

This paper describes the details of the Spivavtor-Instruct dataset and Spivavtor models.

We evaluate Spivavtor on a variety of text editing tasks in Ukrainian, such as Grammatical Error Correction (GEC), Text Simplification, Coherence, and Paraphrasing, and demonstrate its superior performance on all of them.

We publicly release our best-performing models and data as resources to the community to advance further research in this space.

;Comment: Accepted to UNLP Workshop 2024

Saini, Aman,Chernodub, Artem,Raheja, Vipul,Kulkarni, Vivek, 2024, Spivavtor: An Instruction Tuned Ukrainian Text Editing Model

Document

Open

Share

Source

Articles recommended by ES/IODE AI

An Updated Overview of Existing Cancer Databases and Identified Needs
advancements insights assess review lipidomics glycomics proteomics databases research cancer