Custom Pre-Tokenized Dataset
How to use a custom pre-tokenized dataset.
- Do not pass a
type:
in your axolotl config. - Columns in Dataset must be exactly
input_ids
,attention_mask
,labels
config.yml
- path: ...
type:
in your axolotl config.input_ids
, attention_mask
, labels