Store Chat Messages & State Without Managing Infrastructure.Check Out DialogueDB
Skip to content

Generic Options

llm-exe attempts to normalize the inputs for various llm vendors, providing a single interface that can be used to interact with different models. While this is not always possible, since certain vendors may implement features that others don't support, either way only the allowed options make it to the respective api calls.

Options

OptionTypeDefaultDescription
timeoutnumber30000Max execution time of API call to the LLM, in milliseconds.
maxDelaynumber5000Used for retry back-off. Max time to wait between attempts when timeout has been reached, in milliseconds.
numOfAttemptsnumber2Used for retry. How many attempts should be made before throwing error
jitter"none" | "full"noneUsed for retry back-off.
temperaturenumberundefinedMaps to provider-specific temperature parameter.
maxTokensnumberundefinedMaps to provider-specific max tokens parameter.
topPnumberundefinedMaps to provider-specific top_p parameter.
stopSequencesstring[]undefinedMaps to provider-specific stop sequences parameter.
effortstringundefinedMaps to reasoning effort. Valid values: "minimal", "low", "medium", "high". Only supported by providers/models that support reasoning effort (e.g. OpenAI gpt-5, Google Gemini 2.5).
streamboolean | nullnullNote: Not supported yet.