AI Training Model

StableVicuna

StableVicuna is the first large-scale open-source chatbot trained through human feedback based reinforcement learning (RLHF), launched by StablityAI behind Stable Diff...

Tags:

StableVicuna is the first large-scale open-source chatbot trained through human feedback based reinforcement learning (RLHF), launched by StablityAI behind Stable Diffusion. StableVicuna is a further instruction fine-tuning and RLHF training version of Vicuna v0 13b, which is an LLaMA 13 billion model with instruction fine-tuning.

data statistics

Relevant Navigation

No comments

No comments...