LiLaH: The linguistic landscape of hate speech on social media (Flemish-Slovene bilateral, 2019-2023)...
The first free speech2text system for Croatian (based on the XLS-R model) published, the dataset will be published early 2022...
MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages (EU CEF, 2021-2023)