Source of text

 The corpus contains the information on the source of a given text: place and year of publication, publishing house, URL.

The whole metadata of a given text can be seen in the search results. To get it please clicj the line with the link to the source of a quotation.

The full information may not be available for all the texts.

 The majority of texts come from printed sources. There are also smaller subcorpora of Internet texts (news, Facebook), visual media texts (translations and subtitles for movies and TV shows) and family correspondence. Source type can be specified at the search page by the attribute DOC.SOURCE. Abbreviations:

PRI — printed source

WEB — Internet

TEL — TV

FAM — family archive

REC — transcribed audio recording

The following symbols are used in the transcripts (doc.source="REC"):

In transcription

Meaning

pause in speech

---

unintelligible fragment

a foreign language fragment longer than a sentence

сказа-

broken word - put one hyphen at the end

е-е, мм, ну, ну-у, а-а, о-о, угу

typical pause fillers

з-здається, бага-а-то, дале-е-ко, або-о-о, мене-е-е

an elongated sound inside or at the end of a word

гм, гм-м, ай-яй-яй, альо, апхчи, ау, бр-р, бу-бу-бу, вау, хе-хе-хе

exclamations (including repetitive ones)

вєлік, якшо, всьо, тіпа, камон, нічо, нє-а, ну да ладно

non-standard pronunciation or surzhyk - we transmit it as it sounds. If a Russian or other foreign word is pronounced in the Ukrainian way (for example, мій вєлік, всі пеньонзи) - we write it in Ukrainian. 

який

standard recording of the Ukrainian word regardless of minor peculiarities of individual pronunciation ([йakий] or [йakиій]).

обезопасить, oh yes, good job, coś takiego, Gebühr

If the word is pronounced in Russian or another foreign language in a foreign way, we write it down in that language.

<expletive>

obscenity

🙂

🤝

🙁

laughter (when it is heard)

 agreement (the speaker nods his/her head or otherwise expresses his/her agreement without saying anything)

disagreement (the speaker shakes his head in the negative or otherwise expresses his disagreement and says nothing)


🙃

false start - the speaker said a word or several words, but decided to cut off the phrase and start another one from this place. Another option: repeating the first word of the phrase (for example, ці 🙃 ці люди…)

How to use this theme

Every part of this theme can be translated to another language. Even this content you are reading now!

The drop-down in the main menu is called a Locale Picker. It lets you quickly switch between any of the available languages when browsing this website.

For help on setting up more languages, close this popup and click the Languages menu item.