Popular repositories Loading
-
-
lm-evaluation-harness
lm-evaluation-harness PublicForked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Python
-
Repositories
Showing 8 of 8 repositories
- MERA_MULTIMODAL Public
MERA-Evaluation/MERA_MULTIMODAL’s past year of commit activity - repotest Public
MERA-Evaluation/repotest’s past year of commit activity - MERA_CODE Public
MERA Code — the first comprehensive open benchmark for evaluating large language models (LLMs) in applied programming tasks in Russian.
MERA-Evaluation/MERA_CODE’s past year of commit activity - MERA Public
MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA models.
MERA-Evaluation/MERA’s past year of commit activity - lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
MERA-Evaluation/lm-evaluation-harness’s past year of commit activity - demo-swe-mera Public
MERA-Evaluation/demo-swe-mera’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…