Custom Pre-Tokenized Dataset

How to use a custom pre-tokenized dataset.
config.yml
- path: ...