Vicuna

ShareGPT 에서 모은 약 125K 개의 사용자 대화 데이터를 기반으로 파인튜닝한 Llama 기반 모델

B) Training

B.1) 데이터셋

ShareGPT 데이터셋은 공개하지 않음

B.1.1) Preprocessing

  • To ensure data quality, we convert the HTML back to markdown and filter out some inappropriate or low-quality samples.
  • Additionally, we divide lengthy conversations into smaller segments that fit the model’s maximum context length.

C) Related

D) References

  • GitHub - lm-sys/FastChat: An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.