CMMLU | GPTtopic

CMMLU is a comprehensive Chinese language assessment benchmark specifically designed to evaluate the knowledge and reasoning ability of language models in Chinese contexts, covering 67 topics from basic disciplines to advanced professional levels. It includes natural sciences that require computation and reasoning, humanities and social sciences that require knowledge, and Chinese driving rules that require common sense of life. In addition, many tasks in CMMLU have Chinese specific answers and may not be universally applicable in other regions or languages. Therefore, it is a completely Chinese language testing benchmark.

data statistics

Relevant Navigation

MMLU

Large scale multitasking language comprehension benchmark

Awesome ChatGPT Prompts

ChatGPT Prompts set

词魂

Word Soul is an AIGC boutique prompt word library where you can find various prompt words and spells for AI painting, helping you better use AI tools, quickly achieve desired effects, and improve work efficiency. If you are an excellent prompt word creator, you can also sell your own prompt words here.

提示工程指南

The Prompt Engineering Guide is provided by DAIR The AI initiated project aims to assist research and development and industry professionals in understanding reminder engineering.

C-Eval

C-Eval is a multi-level and multidisciplinary Chinese assessment kit suitable for large language models

Open LLM Leaderboard

The Open LLM Leaderboard is the largest open source big model ranking released by the HuggingFace community, based on the Eleuther AI Language Model Evaluation Harnesspackage.

No comments

No comments...