If you are developing your own data loader, you might need to provide specific dependency packages in the virtual Python environments to support your data loader functions.
--mount_dir
and --python_path
when invoking run.py
to allow the workers inside the appliance to find these dependency packages. This approach is hard to use and can be error-prone.
As of release 1.9.0, the Custom Worker Container Workflow
provides seamless support for importing user-specific dependency packages in the Python environments into Cerebras appliance. With this feature, you do not need any special handling. The run.py
script will automatically find all pip-installed dependency packages on the user node, and apply them on Cerebras appliance when the data loader functions are deployed.
In the event where the custom worker container failed to be created, a fallback policy would take into effect by mounting the site packages from the user node Python virtual environment to the worker environment via a predefined NFS-based cluster volume.
In 1.9.0, both the custom worker container feature and the venv mounting fallback policy has been enabled by default. If you would like to disable the features, there are two Cerebras-specific options that can support that. We call these options debug_args
.
debug_args.debug_usr.skip_image_build
.
Setting this option to True will disable this feature.
debug_args.debug_usr.skip_user_venv_mount
. Setting this option to True will disable the fallback policy.
debug_args_writer.py
on any accessible directory on the user node:run.py
as follows: