ParaRook

ParaRook - parallel corpora based on GRAC


German-Ukrainian / Ukrainian-German
8,875,783 / 7,805,822
English-Ukrainian / Ukrainian-English
975,969 / 812,126
French-Ukrainian / Ukrainian-French
901,445 / 708,650
Spanish-Ukrainian / Ukrainian-Spanish
423,627 / 344,615
Chinese-Ukrainian / Ukrainian-Chinese
40,101 / 36,011
Persian-Ukrainian / Ukrainian-Persian
8,495 / 7,410
Japanese-Ukrainian / Ukrainian-Japanese
7,343 / 5,201


Manual alignment with the program InterText

Morphosyntactic annotation: UDPipe2

SEARCH via NoSketch Engine

Demonstration (Video)

2023-2024


Drawing by Edward Lear


DEVELOPERS

Coordination and technical support of ParaRook: Arsenii Lukashevskyi (National University “Kharkiv Polytechnic Institute”, Department of Intelligent Computer Systems), Maria Shvedova

ParaRook||DE→UK

Collection arrangement and alignment: Maria Shvedova (National University “Kharkiv Polytechnic Institute”, University of Jena)

Also contributed to the text alignment

Tetiana Sharma (Taras Shevchenko National University of Kyiv)

Students of the National University “Kharkiv Polytechnic Institute”: Oleksandr Yena, Anastasiia Brylova, Ruslan Korotkikh, Mariia Kecheva, Kseniia Mykhailiuk, Uliana Slobodenko, Dina Baidalova, Bohdan Kolesnikov, Anastasiia Tarasova, Maryna Popova, Anastasiia Trotska, Yevheniia Donets, Diana Klymenko, Ilya Dynkin, Lika Tusheva, Oleksii Chornobrovkin.

Students of the online course “Corpus Linguistics” (National University “Kharkiv Polytechnic Institute”, University of Jena with the support of the DAAD Foundation): Daria Lapina, Daryna Vlasenko, Arsenii Ogurtsov, Oleksandra Karavan, Daria Temna (National Aerospace University “Kharkiv Aviation Institute”), Yuliia Hrytsek, Stanislav Bushel, Anastasiia Savushkina (Admiral Makarov National University of Shipbuilding). Kira Kryzhanovska (Kherson National Technical University), Olga Popova (Borys Grinchenko Kyiv Metropolitan University), Kateryna Osypenko, Iryna Hameza (National University of Kyiv-Mohyla Academy), Vitalina Skalyga (Taras Shevchenko National University of Kyiv), Veronika Pylypenko (Vasyl Stus Donetsk National University).

ParaRook||EN→UK

Collection arrangement: Mykhailo Nazarenko (Taras Shevchenko National University of Kyiv)

Coordination and alignment: Alisa Korolova (Horlivka Institute for Foreign Languages “Donbas State Pedagogical University”), Dmytro Reshetnikov (Kryvyi Rih State Pedagogical University)

Also contributed to the text alignment

Students of the online course Corpus Linguistics (National University “Kharkiv Polytechnic Institute”, Jena University with the support of the DAAD Foundation): Valeria Horetlieva, Milana Adut, Tetiana Chaplenko (Kryvyi Rih State Pedagogical University), Anastasiia Voronova (Taras Shevchenko National University of Kyiv), Oleksandra Larionova, Alyona Khmel (Vasyl Stus Donetsk National University), Bohdan Mudriievskyi, Oleksandra Nimets (Ivan Franko Zhytomyr State University), Stanislav Bushel (Admiral Makarov National University of Shipbuilding).

ParaRook||FR→UK

Text alignment:

Students of the National University “Kharkiv Polytechnic Institute”: Olena Zaikina, Tetiana Selezniova, Maria Moloshna, Oleksandra Isaieva, Olha Perova, Kristina Sidnenko, Kateryna Kalashnikova, Yurii Povesma, Dmytro Petrushov, Yelyzaveta Lubenets, Dana Bakanova, Viktoriia Sereda, Alina Schmidt.

ParaRook||ES→UK

Collection arrangement and alignment: Milad Zikran, Sofia Pavlenko (Taras Shevchenko National University of Kyiv)

ParaRook||ZH→UK

Collection arrangement and alignment: Bohdan Sachuk (Taras Shevchenko National University of Kyiv)

ParaRook||FA→UK

Collection arrangement and alignment: Oleksii Kashmarchuk (Ivan Franko National University of Lviv)

ParaRook||JA→UK

Collection arrangement and alignment: Nazar Kotsur, Bohdan Laba (Ivan Franko National University of Lviv)


Publications

Maria Shvedova and Arsenii Lukashevskyi. 2024. Creating Parallel Corpora for Ukrainian: A German-Ukrainian Parallel Corpus (ParaRook||DE-UK). In Proceedings of the Third Ukrainian Natural Language Processing Workshop (UNLP) @ LREC-COLING 2024, pages 14–22, Torino, Italia. ELRA and ICCL.


How to use this theme

Every part of this theme can be translated to another language. Even this content you are reading now!

The drop-down in the main menu is called a Locale Picker. It lets you quickly switch between any of the available languages when browsing this website.

For help on setting up more languages, close this popup and click the Languages menu item.