Towards the Compilation of Linguistic Linguistic Resources for Digital Technologies of Basque
EUSKOR
Duration:
10.06.2024 - 30.09.2025
ChatGPT Plus
The EUSKORPUS project aims to ensure the future of the Basque language in an increasingly digital society by promoting its presence and functionality in artificial intelligence and digital environments. Led by the Basque Government, the project seeks to compile and structure a large digital corpus of Basque linguistic data—including written texts, spoken language, conversations, technical documents, and colloquial expressions—that will serve as a foundation for training language models and automated systems. This will enable the development of multilingual AI models capable of understanding, generating, and translating Basque, ensuring the language is not left behind in the current technological revolution. The project also plans to release linguistic resources and models as open-source tools, making them accessible to companies, developers, public institutions, and citizens. The goal is for Basque to be fully integrated into applications such as virtual assistants, chatbots, phone support systems, digital public services, automatic translators, and smart devices. Additionally, EUSKORPUS fosters public-private collaboration, involving universities, tech companies, media, research centers, and cultural agents of the Basque language. With initial funding exceeding 5.5 million euros through 2026, the project also aims to position the Basque Country as a European leader in language technology, guaranteeing the right of Basque citizens to live their digital lives in their own language. Ultimately, EUSKORPUS is not only a technological initiative but also a political, cultural, and social commitment to the vitality, equality, and survival of Basque in the digital age.
Looking for support for your next project? Contact us, we are looking forward to helping you.