AI Model Evaluation

SuperCLUE

SuperCLUE is a comprehensive evaluation benchmark for Chinese general large models, which evaluates the capabilities of models from three different dimensions: basic a...

Tags:

SuperCLUE is a comprehensive evaluation benchmark for Chinese general large models, which evaluates the capabilities of models from three different dimensions: basic ability, professional ability, and Chinese characteristic ability.
The basic abilities include: semantic understanding, dialogue, logical reasoning, role simulation, coding, generation and creation, among others.
Professional abilities include: including high school, university, and professional exams, covering more than 50 abilities from mathematics, physics, geography to social sciences.
Chinese Language Ability: Targeting tasks with Chinese characteristics, it includes 10 different abilities including Chinese idioms, poetry, literature, and character shapes.

data statistics

Relevant Navigation

No comments

No comments...