🇨🇿 BenCzechMark – Can your LLM Understand Czech?
The 🇨🇿 BenCzechMark is the first and most comprehensive evaluation suite for assessing the abilities of Large Language Models (LLMs) in the Czech language. It aims to test how well LLMs can: Reason and perform complex tasks in Czech. Generate and verify grammatically and semantically correct Czech. Extract information and store knowledge by answering questions about Czech culture and Czech-related facts. Do what language models were originally trained for—estimate the probability of Czech texts. To achieve this, we’ve sourced 50 tasks […]
Read more