Learn how to use the ModelZoo CLI.
fit
Trains a model using specified configuration
cszoo fit params_model.yaml
validate
Validates a model using specified configuration
cszoo validate params_model.yaml
validate_all
Runs upstream and downstream validation
cszoo validate_all params_model.yaml
checkpoint
Checkpoint converter
convert
Converts a checkpoint between Huggingface and Cerebras formats or between different Cerebras formats.
cszoo checkpoint convert --model gpt2 --src-fmt cs-auto --tgt-fmt hf --config workdir/params_gpt_tiny.yaml model_dir/checkpoint.mdl
convert-config
Converts a checkpoint between Huggingface and Cerebras formats or between different Cerebras formats.
cszoo checkpoint convert-config --model gpt2 --src-fmt cs-auto --tgt-fmt hf workdir/params_gpt_tiny.yaml
list-converters
Lists all available checkpoint converters. Can also list all checkpoint converters for a specified model.
cszoo checkpoint list-converters
diff
Compares two checkpoints to identify differences.
cszoo checkpoint diff checkpoint_a.mdl checkpoint_b.mdl
info
Displays checkpoint details, including size, number of S3 sub-objects, and the software version used to save it.
cszoo checkpoint info PATH
delete
Deletes a checkpoint and all associated sub-objects (if stored in S3).
cszoo checkpoint delete PATH
copy
Copies a checkpoint between local filesystems or S3 buckets. Useful for uploading existing checkpoints from previous releases to S3.
cszoo checkpoint copy SRC_PATH DST_PATH
move
Moves a checkpoint between local filesystems or S3 buckets, deleting the original source after transfer. Useful for migrating older checkpoints to S3.
cszoo checkpoint move SRC_PATH DST_PATH
model
Query information on ModelZoo models
list
Displays all supported models.
cszoo model list
info
Shows detailed model information.
cszoo model info llama
describe
Displays model configuration parameters.
cszoo model describe llama
init_checkpoint
Creates initial model checkpoint.
cszoo model init_checkpoint <model_name>
data_preprocess
Preprocess data
list
Shows available preprocessing configurations.
cszoo data_preprocess list
pull
Copies a model configuration file to a local directory.
cszoo data_preprocess pull summarization_preprocessing -o workdir
run
Executes preprocessing using specified configuration.
cszoo data_preprocess run --config preprocessing.yaml
data_processor
Query information on ModelZoo data processors
list
Shows available data processors.
cszoo data_processor list
info
Displays data processor information.
cszoo data_processor info GptHDF5DataProcessor
describe
Shows processor configuration parameters.
cszoo data_processor describe GptHDF5DataProcessor
benchmark
Benchmarks a specified dataloader.
cszoo data_processor benchmark params.yaml
config
Save and manage model config files
pull
Copies a model config file to a local directory.
cszoo config pull llama2_7b -o workdir
validate
Validates a specified config file.
cszoo config validate params.yaml
convert_legacy
Upgrades V1 config files to V2 YAML.
cszoo config convert_legacy old_config.yaml
stats
Retrieves relevant statistics for a model using the specified configuration file.
cszoo config stats params.yaml
lm_eval
Runs evaluation using the Eleuther Eval Harness.
cszoo lm_eval workdir/params_gpt_tiny.yaml --tasks=winogrande --checkpoint_path=workdir/my_ckpt.mdl
bigcode_eval
Runs code generation benchmarks using the BigCode Eval Harness.
cszoo bigcode_eval workdir/params_gpt_tiny.yaml --tasks=mbpp --checkpoint_path=workdir/my_ckpt.mdl
Create model directory
Preprocess the data
Run model
Convert checkpoint to HuggingFace
--help
flag:
assistant
subcommand.
Use it to:
cszoo assistant "what is the checkpoint converter?"
cszoo assistant "convert my checkpoint from huggingface to cerebras"
export CEREBRAS_API_KEY=<your api key>
Don’t have an API key? Follow these instructions.cszoo ... -h
.