Learn how to configure local or Hugging Face data as input for preprocessing.
type
to local
.
source
to provide the path to the input directory.
type
to huggingface
.
source
to specify the dataset name from the Hugging Face hub.
split
to specify the dataset split.
load_dataset
API.
load_dataset
documentation here.