Core Chat

Creating models and sending chat messages.

Quick Start

(require '[langchain4clj.core :as llm])

(def model (llm/create-model {:provider :openai
                               :api-key (System/getenv "OPENAI_API_KEY")}))

(llm/chat model "What is the capital of France?")
;; => "The capital of France is Paris."

Creating Models

;; OpenAI
(def openai (llm/create-model {:provider :openai
                                :api-key "sk-..."
                                :model "gpt-4o"}))

;; Anthropic Claude
(def claude (llm/create-model {:provider :anthropic
                                :api-key "sk-ant-..."
                                :model "claude-3-5-sonnet-20241022"}))

;; Google Gemini
(def gemini (llm/create-model {:provider :google-ai-gemini
                                :api-key "AIza..."
                                :model "gemini-1.5-flash"}))

;; Ollama (local)
(def ollama (llm/create-model {:provider :ollama
                                :model "llama3.1"}))

;; Mistral
(def mistral (llm/create-model {:provider :mistral
                                 :api-key "..."
                                 :model "mistral-medium-2508"}))

Or use provider-specific functions:

(def model (llm/openai-model {:api-key "sk-..."}))
(def model (llm/anthropic-model {:api-key "sk-ant-..."}))
(def model (llm/ollama-model {:model "llama3.1"}))

Chat Function

Simple chat:

(llm/chat model "Hello!")
;; => "Hello! How can I help you today?"

With options:

(llm/chat model "Write a haiku"
  {:temperature 0.9
   :max-tokens 100
   :system-message "You are a poet."})

With message history:

(import '[dev.langchain4j.data.message UserMessage AiMessage])

(def history [(UserMessage. "My name is Alice")
              (AiMessage. "Nice to meet you, Alice!")
              (UserMessage. "What's my name?")])

(llm/chat model history {})
;; => "Your name is Alice."

Options

Option	Description
`:temperature`	Creativity (0.0-2.0, default 0.7)
`:max-tokens`	Maximum output tokens
`:system-message`	System prompt
`:response-format`	Force JSON output
`:tools`	Tool specifications
`:listeners`	Chat listeners for observability

Threading-First API

(-> {:api-key "sk-..."}
    (llm/with-model "gpt-4o")
    (llm/with-temperature 0.8)
    (llm/with-timeout 30000)
    (llm/with-logging)
    llm/openai-model)

Chat Listeners

Add listeners for observability and monitoring:

(require '[langchain4clj.listeners :as listeners])

;; Token tracking
(def stats (atom {}))
(def tracker (listeners/token-tracking-listener stats))

;; Create model with listener
(def model
  (llm/create-model
    {:provider :openai
     :api-key "sk-..."
     :listeners [tracker]}))

;; Or using threading
(def model
  (-> {:provider :openai :api-key "sk-..."}
      (llm/with-listeners [(listeners/logging-listener) tracker])
      llm/create-model))

See Chat Listeners for details.

Thinking/Reasoning Modes

Extended thinking support for complex reasoning tasks:

OpenAI (o1, o3 models)

(def model
  (-> {:provider :openai
       :api-key "sk-..."
       :model "o3-mini"}
      (llm/with-thinking {:effort :high    ;; :low :medium :high
                          :return true})   ;; Include reasoning in response
      llm/create-model))

(llm/chat model "Solve this complex math problem...")

Anthropic (Claude 3.5+)

(def model
  (-> {:provider :anthropic
       :api-key "sk-ant-..."
       :model "claude-sonnet-4-20250514"}
      (llm/with-thinking {:enabled true
                          :budget-tokens 4096  ;; Max thinking tokens
                          :return true         ;; Include in response
                          :send true})         ;; Send in multi-turn
      llm/create-model))

Google Gemini (2.5+)

(def model
  (-> {:provider :google-ai-gemini
       :api-key "AIza..."
       :model "gemini-2.5-flash"}
      (llm/with-thinking {:enabled true
                          :effort :medium      ;; Or :budget-tokens 4096
                          :return true})
      llm/create-model))

Thinking Options

Option	Provider	Description
`:enabled`	Anthropic, Gemini	Enable thinking mode
`:effort`	OpenAI, Gemini	Reasoning effort (:low :medium :high)
`:budget-tokens`	Anthropic, Gemini	Max tokens for thinking
`:return`	All	Include thinking in response
`:send`	Anthropic, Gemini	Send thinking in multi-turn

JSON Mode

(require '[dev.langchain4j.model.chat.request ResponseFormat])

(llm/chat model "Return user data"
  {:response-format ResponseFormat/JSON})

Supported by: OpenAI, Google AI Gemini, Ollama, Mistral.
Not supported: Anthropic, Vertex AI.

Provider Defaults

Provider	Default Model
OpenAI	gpt-4o-mini
Anthropic	claude-3-5-sonnet-20241022
Google AI	gemini-1.5-flash
Ollama	llama3.1
Mistral	mistral-medium-2508

Chat Listeners - Observability and monitoring
Streaming - Real-time output
Structured Output - Schema-validated responses
Memory - Conversation management
Message Serialization - Save and restore messages
Assistant - High-level abstractions