AI Model Evaluation
PubMedQA
PubMedQA is a biomedical research question and answer dataset that includes 1K expert annotated, 61.2K unlabeled, and 211.3K manually generated QA instances. The ranki...
Tags:AI Model EvaluationAI TestPubMedQA is a biomedical research question and answer dataset that includes 1K expert annotated, 61.2K unlabeled, and 211.3K manually generated QA instances. The ranking currently includes medical test scores for 18 models.
data statistics
Relevant Navigation
No comments...