create_hdf5_dataset.py
, change the value of the argument --max_seq_length
to the desired value.
For example, for a sequence length of 4096 tokens:
max_sequence_length
and the max_position_embeddings
to the desired value in the model’s configuration YAML file.
For example:
create_hdf5_dataset.py
, change the value of the argument --max_seq_length
to the desired value.
max_sequence_length
to the desired value in the model’s configuration yaml
file.
For example:
max_position_embeddings
parameter of the pretrained model remains the same. Only the max_sequence_length
parameter needs to be changed.
On the other hand, when training with a large context window, both the max_position_embeddings
and max_sequence_length
parameters need to be changed.