Generation Parameters
These docs are outdated! Please check out https://docs.titanml.co for the latest information on the TitanML platform. If there's anything that's not covered there, please contact us on our discord.
The API supports the standard generation parameters. See below for a description.
To use the parameters include them in the json payload:
generate_max_length
The maximum generation length
128
sampling_topk
Sample predictions from the top K most probable candidates
1
sampling_topp
Sample from predictions who's cumulative probability exceeds this value
1.0 (no restriction)
sampling_temperature
Sample with randomness. Bigger temperatures are associated with more randomness and 'creativity'.
1.0
repetition_penalty
Penalise the generation of tokens that have been generated before. Set to > 1 to penalize.
1 (no penalty)
no_repeat_ngram_size
Prevent repetitions of ngrams of this size.
0 (turned off)
Last updated