Call an LLM from AWS Lambda or a Cron Job

LLM Calls in Lambda and Cron Jobs

A lot of useful LLM automation is a scheduled or event-driven function: classify each new support ticket, summarize yesterday's signups, triage incoming feedback. No server, no framework — one handler that makes one typed LLM call. This recipe shows the Lambda shape; the same pattern applies to any cron runner or queue consumer.

Step 1 - Create the Executor at Module Scope

Everything that doesn't depend on the event goes outside the handler, so warm invocations reuse it. Two settings matter in serverless specifically: timeout must stay below your Lambda's own timeout (otherwise Lambda kills the invocation before llm-exe can retry or throw cleanly), and the API key comes from the environment.

import {
  useLlm,
  createChatPrompt,
  createParser,
  createLlmExecutor,
} from "llm-exe";

// Created once at module scope, so warm Lambda invocations reuse it.
const classifyFeedback = createLlmExecutor({
  name: "classify-feedback",
  llm: useLlm("openai.gpt-4o-mini", {
    openAiApiKey: process.env.OPENAI_API_KEY,
    timeout: 20000, // stay well below your Lambda's own timeout
    numOfAttempts: 2,
  }),
  prompt: createChatPrompt<{ feedback: string }>(
    "Classify this customer feedback as praise, complaint, or question: {{feedback}}"
  ),
  parser: createParser("stringExtract", {
    enum: ["praise", "complaint", "question"],
  }),
});

Step 2 - The Handler Stays Small

export async function handler(event: { feedback: string }) {
  const category = await classifyFeedback.execute({
    feedback: event.feedback,
  });

  // category is typed as "praise" | "complaint" | "question"
  return {
    statusCode: 200,
    body: JSON.stringify({ category }),
  };
}

The enum-constrained parser means category is a typed union — the handler can route, tag, or store it without validating anything itself. If the model's response doesn't contain one of the three categories, the executor throws and the invocation fails visibly (and your dead-letter queue or retry policy takes over), rather than writing a garbage category to your database.

Serverless checklist

Budget retries inside the function timeout. numOfAttempts: 2 with timeout: 20000 can take ~40+ seconds worst-case — make sure the Lambda timeout accommodates the whole retry budget, or lower it.
Keys from environment/secrets manager, never bundled. All providers accept keys via options (as shown) or standard env vars.
Let failures fail. In event-driven code, a thrown LlmExeError with the event going to your DLQ beats a swallowed error and a silent gap in your data.
Log with hooks. Attach onComplete/onError hooks at module scope for structured logs and latency metrics on every invocation.

For deeper AWS integration — using llm-exe inside Step Functions state machines — that pattern works too; the executor is just an async function, so anything that can run Node can run it.

Add Retries and Timeouts — the timeout/retry options in depth
Executor Hooks — structured logging and latency metrics
Intent Classification — richer classification with custom parsers
Get a Yes/No Decision from an LLM — boolean decisions for workflow gating

LLM Calls in Lambda and Cron Jobs ​

Step 1 - Create the Executor at Module Scope ​

Step 2 - The Handler Stays Small ​

Serverless checklist ​

Related ​

LLM Calls in Lambda and Cron Jobs

Step 1 - Create the Executor at Module Scope

Step 2 - The Handler Stays Small

Serverless checklist

Related