feat: HTTP-proxy LangGraph checkpoint API by danielmillerp · Pull Request #146 · scaleapi/scale-agentex

danielmillerp · 2026-02-08T17:32:19Z

What this does

This PR adds backend support for LangGraph checkpoint persistence — the mechanism LangGraph uses to save and restore agent state between messages (conversation history, channel values, pending writes, etc.).

Why we need this

LangGraph agents need to persist their state (checkpoints) to a database. The built-in approach (AsyncPostgresSaver) has each agent connect directly to Postgres with its own connection pool. This doesn't scale — as we spin up more LangGraph agent pods, we'd hit connection limits quickly. This is the same problem we already solved for Temporal: instead of agents talking to the DB directly, they go through the backend API, which uses a shared connection pool.

Why Postgres (not MongoDB)

Even though agent state currently lives in MongoDB, we chose Postgres for checkpoint storage. There have been some reliability concerns around MongoDB recently and there's a potential future migration to Postgres. Keeping new storage in Postgres is more future-forward. The checkpoint tables are independent and don't conflict with existing MongoDB state storage.

How it works

The pattern mirrors what we do with Temporal. The agent doesn't know about the database — it talks to the backend API, and the backend handles the DB operations.

Agent (SDK HttpCheckpointSaver)  →  Backend API (/checkpoints/*)  →  Postgres

On the backend side, we:

Created 4 new Postgres tables via ORM models + Alembic migration (checkpoints, checkpoint_blobs, checkpoint_writes, checkpoint_migrations) — these mirror the schema that LangGraph's own AsyncPostgresSaver uses
Built a repository layer that reimplements the same SQL operations from AsyncPostgresSaver using our SQLAlchemy patterns (composite primary keys, JSONB metadata, upserts via ON CONFLICT)
Exposed 5 POST endpoints under /checkpoints (get-tuple, put, put-writes, list, delete-thread) — one for each method on LangGraph's BaseCheckpointSaver
Added 19 integration tests for the repository layer, running against a real Postgres via testcontainers — covering round-trip storage, blob versioning, pending writes (upsert vs skip), metadata filtering, pagination, thread/namespace isolation, and deletion

Binary blob data (serialized Python objects) is base64-encoded for JSON transport. The actual serialization/deserialization stays in the SDK — the backend just stores and retrieves raw JSONB + bytes.

Companion PRs

SDK (scale-agentex-python): feat: HTTP-proxy LangGraph checkpointer scale-agentex-python#258 — replaces AsyncPostgresSaver with HttpCheckpointSaver that calls these endpoints
Agent code: No changes needed — create_checkpointer() API is unchanged

Test plan

19 integration tests passing against real Postgres (testcontainers)
Manually tested end-to-end: backend + langgraph agent, sent messages, confirmed checkpoint stored and conversation history restored
Verified ruff lint + format passes

🤖 Generated with Claude Code

Agents no longer need a direct Postgres connection for LangGraph checkpointing. Instead, checkpoint operations are proxied through 5 new backend endpoints under /checkpoints (get-tuple, put, put-writes, list, delete-thread). Binary blob data is base64-encoded for JSON transport. Includes ORM models for the 4 checkpoint tables, Alembic migration, repository with composite-PK queries, use case layer, Pydantic schemas, and FastAPI routes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

danielmillerp requested a review from a team as a code owner February 8, 2026 17:32

danielmillerp mentioned this pull request Feb 8, 2026

feat: HTTP-proxy LangGraph checkpointer scaleapi/scale-agentex-python#258

Open

3 tasks

danielmillerp force-pushed the dm/langgraph-setup branch from 115e886 to dd5abe4 Compare February 9, 2026 03:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: HTTP-proxy LangGraph checkpoint API#146

feat: HTTP-proxy LangGraph checkpoint API#146
danielmillerp wants to merge 1 commit intomainfrom
dm/langgraph-setup

danielmillerp commented Feb 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

danielmillerp commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

Why we need this

Why Postgres (not MongoDB)

How it works

Companion PRs

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

danielmillerp commented Feb 8, 2026 •

edited

Loading