$ source venv/bin/activate
(venv)$ python modelzoo/transformers/pytorch/gpt2/run.py \
GPU \
-p gpt2_small_gpu.yaml\
-m train\
-o gpt2_small_gpu_model_dir\
--checkpoint_path gpt2_small_model_dir/ckpt_200.pkl
WARNING:root:The following model params are unused: allow_multireplica
2023-04-28 16:06:55,102 INFO: Loading weights from checkpoint gpt2_small_model_dir/ckpt_200.pkl
2023-04-28 16:06:56,704 INFO: Saving checkpoint at step : 200.
2023-04-28 16:07:13,434 INFO: | Train Device=cuda, Step=205, Loss=8.74219, Time=16:07:13
2023-04-28 16:07:14,815 INFO: | Train Device=cuda, Step=210, Loss=8.52344, Time=16:07:14
2023-04-28 16:07:16,194 INFO: | Train Device=cuda, Step=215, Loss=8.60156, Time=16:07:16
2023-04-28 16:07:17,576 INFO: | Train Device=cuda, Step=220, Loss=8.57812, Time=16:07:17
2023-04-28 16:07:18,959 INFO: | Train Device=cuda, Step=225, Loss=8.84375, Time=16:07:18
2023-04-28 16:07:20,362 INFO: | Train Device=cuda, Step=230, Loss=8.89062, Time=16:07:20
2023-04-28 16:07:21,760 INFO: | Train Device=cuda, Step=235, Loss=8.71875, Time=16:07:21
2023-04-28 16:07:23,143 INFO: | Train Device=cuda, Step=240, Loss=8.32812, Time=16:07:23
2023-04-28 16:07:24,566 INFO: | Train Device=cuda, Step=245, Loss=8.35156, Time=16:07:24
2023-04-28 16:07:25,950 INFO: | Train Device=cuda, Step=250, Loss=8.23438, Time=16:07:25
2023-04-28 16:07:25,951 INFO: Saving checkpoint at step : 250.
2023-04-28 16:07:41,007 INFO: Training Completed Successfully!