Central and South-East European Resources (CESAR)

Financer institutionEuropean Union

ID271022
ClosedInternational tenderConsortial tender

Project Goal: The aim of the project is to map the linguistic resources (corpora processing written and spoken language, dictionaries, ontologies) and tools (morphological, syntactic, and other linguistic analyzers) of six Central European languages (Bulgarian, Croatian, Polish, Hungarian, Serbian, and Slovak), to standardize them, ensure their interoperability, and increase their accessibility. The status of Hungarian in this context is detailed in the book “The Hungarian Language in the Digital Age,” published in 2012.

Outcomes: More than 2000 resources and tools have been incorporated into the META-SHARE repository with appropriate licenses. This ensures that researchers, in particular, can access previously hard-to-reach linguistic resources and tools, making them more widely usable and better exploitable not only in the field of language technology but also in industrial applications.

Participating institutions

Institute for Bulgarian Language "Prof. Lyubomir Andreychin"

Institute for Bulgarian Language "Prof. Lyubomir Andreychin"

University of Zagreb, Faculty of Humanities and Social Sciences

University of Zagreb, Faculty of Humanities and Social Sciences

Institute of Computer Science, Polish Academy of Sciences

Institute of Computer Science, Polish Academy of Sciences

Faculty of Electrical Engineering and Informatics Budapest University of Technology and Economics

Faculty of Electrical Engineering and Informatics Budapest University of Technology and Economics

University of Lodz

University of Lodz

Faculty of Mathematics, University of Belgrade

Faculty of Mathematics, University of Belgrade

Institute “Mihailo Pupin”

Institute “Mihailo Pupin”

Jazykovedný ústav Ľ. Štúra Slovenskej akadémie vied

Jazykovedný ústav Ľ. Štúra Slovenskej akadémie vied