SuperCLUE | GPTtopic

SuperCLUE is a comprehensive evaluation benchmark for Chinese general large models, which evaluates the capabilities of models from three different dimensions: basic ability, professional ability, and Chinese characteristic ability.
The basic abilities include: semantic understanding, dialogue, logical reasoning, role simulation, coding, generation and creation, among others.
Professional abilities include: including high school, university, and professional exams, covering more than 50 abilities from mathematics, physics, geography to social sciences.
Chinese Language Ability: Targeting tasks with Chinese characteristics, it includes 10 different abilities including Chinese idioms, poetry, literature, and character shapes.

data statistics

Relevant Navigation

MMBench

MMBench is a multimodal benchmark test developed by researchers from Shanghai Artificial Intelligence Laboratory, Nanyang Technological University,

提示工程指南

The Prompt Engineering Guide is provided by DAIR The AI initiated project aims to assist research and development and industry professionals in understanding reminder engineering.

PubMedQA

PubMedQA is a biomedical research question and answer dataset that includes 1K expert annotated, 61.2K unlabeled, and 211.3K manually generated QA instances. The ranking currently includes medical test scores for 18 models.

绘AI

Hua AI is committed to exploring and creating a new way to showcase and earn profits from the creation of AI prompt words.

Awesome ChatGPT Prompts

ChatGPT Prompts set

CMMLU

CMMLU is a comprehensive Chinese language assessment benchmark specifically designed to evaluate the knowledge and reasoning ability of language models in Chinese contexts,

No comments

No comments...