AI Training Model
StableVicuna
StableVicuna is the first large-scale open-source chatbot trained through human feedback based reinforcement learning (RLHF), launched by StablityAI behind Stable Diff...
Tags:AI Training ModelAI ModelStableVicuna is the first large-scale open-source chatbot trained through human feedback based reinforcement learning (RLHF), launched by StablityAI behind Stable Diffusion. StableVicuna is a further instruction fine-tuning and RLHF training version of Vicuna v0 13b, which is an LLaMA 13 billion model with instruction fine-tuning.
data statistics
Relevant Navigation
No comments...