Conversation
sharegpt
conversations where from
is human
/gpt
. (optional: first row with role system
to override default system prompt)
Note: type: sharegpt
opens special configs: - conversation
: enables conversions to many Conversation types. Refer to the ‘name’ here for options. - roles
: allows you to specify the roles for input and output. This is useful for datasets with custom roles such as tool
etc to support masking. - field_human
: specify the key to use instead of human
in the conversation. - field_model
: specify the key to use instead of gpt
in the conversation.
datasets:
path: ...
type: sharegpt
conversation: # Options (see Conversation 'name'): https://github.com/lm-sys/FastChat/blob/main/fastchat/conversation.py
field_human: # Optional[str]. Human key to use for conversation.
field_model: # Optional[str]. Assistant key to use for conversation.
# Add additional keys from your dataset as input or output roles
roles:
input: # Optional[List[str]]. These will be masked based on train_on_input
output: # Optional[List[str]].
pygmalion
sharegpt.load_role
conversations where role
is used instead of from
sharegpt.load_guanaco
conversations where from
is prompter
assistant
instead of default sharegpt
sharegpt_jokes
creates a chat where bot is asked to tell a joke, then explain why the joke is funny