Akribes by Podesta

The stack we use internally.

Studio, SDKs, the workflow server, the eval system, two-way MCP. The same platform that runs every workflow Podesta delivers under a services engagement — licensed for your team to build on.

Talk to us about a license See how it's priced

What you license

Five surfaces, one stack.

Studio

Visual editor, live event stream, inline debugger, eval panel.

SDKs

TypeScript, Python, Rust. Typed inputs, typed outputs.

Server

Workflow engine, execution persistence, audit trail, REST + SSE + WebSocket.

Eval system

Case sets, judge scripts, scoring, drift detection — first-class, not bolted on.

Two-way MCP

Consume external MCP servers as typed tools. Export your workflows as MCP servers for customer agents.

Where you author

Studio is the surface we live in.

The browser-based IDE every Podesta engineer uses to build workflows. LSP-backed editor, live event stream rendering as the workflow runs, inline debugger that can re-run from any node, an eval panel that scores your workflow against your test set.

Editor with cross-script intelligence

Go to definition jumps across use boundaries. Hover types on workflows in other scripts. Find references on a task. Semantic diff that knows about Akribes, not just text lines.

Live event stream

WorkflowStart, TaskPrompt, AgentOutput (streaming), TaskEnd, Suspended, Resumed, WorkflowEnd. The same protocol your SDK sees.

Inline debugger

Pause on a node, inspect state, rewind, run-from-here. The state inspector renders typed values from the engine, not a JSON dump.

Eval panel

Your case set, your judge script, your scoring rubric. The same number that gates a change internally is the one you show a customer.

Call from your code

Instantiate the client. Call the workflow. Get typed output.

import { AkribesClient } from "akribes-sdk-ts";

const client = new AkribesClient({
  baseUrl: "https://podesta.example.com",
  token: process.env.PODESTA_API_KEY!,
  projectId: 1,
});

const run = client.executions.runStream("contract-clause-extract", {
  inputs: { document: pdfBytes, locale: "en-GB" },
});

const output = await run.output;
// output is typed against the workflow's declared output schema.
console.log(output);

import asyncio, os
from akribes_sdk import AkribesClient

async def main() -> None:
    async with AkribesClient(
        base_url="https://podesta.example.com",
        project_id=1,
        token=os.environ["PODESTA_API_KEY"],
    ) as client:
        _id, output = await client.executions.run_and_await(
            "contract-clause-extract",
            inputs={"document": pdf_bytes, "locale": "en-GB"},
        )
        # output.result is typed against the workflow's declared output schema.
        print(output.result)

asyncio.run(main())

use akribes_sdk_rust::AkribesClient;

#[tokio::main]
async fn main() -> akribes_sdk_rust::Result<()> {
    let client = AkribesClient::builder("https://podesta.example.com")
        .project_id(1)
        .token(&std::env::var("PODESTA_API_KEY")?)
        .build();

    let (_id, output) = client
        .project(1)
        .executions()
        .run("contract-clause-extract")
        .execute_and_await(None)
        .await?;

    // output.result is typed against the workflow's declared output schema.
    println!("{:?}", output.result);
    Ok(())
}

What you don't ship

Three things stay out of your codebase.

No prompts.

Prompts live with the workflow, not in your repo. Your code calls a name and gets a result.

No model upgrades.

When a model improves, the workflow re-tests against your examples. You don't ship a release for a model change.

No AI test deps.

Workflow tests run in Akribes. Your test suite stays clean — model fragility doesn't bleed into your CI.

Two-way MCP

Consume tools. Export your workflows.

MCP is first-class in both directions. A workflow consumes external MCP servers as typed tools (databases, third-party SaaS, anything that speaks the protocol). And a workflow can be exported as an MCP tool — your Akribes script becomes something another team's agent, or another company's, can call without touching your code.

Inbound

Use any MCP server as a typed tool.

Wire a Postgres, a search index, a SaaS API — anything with an MCP surface — and call it from a workflow with the rest of the type system enforcing the shape.

Outbound

Ship your workflow as an MCP server.

Your project gets a hostable MCP endpoint with OAuth via Studio or your own SSO. Customer agents call your workflow as a tool. You ship Applied AI as a tool surface, not just a feature.

How workflows get built

Examples in. Tests pass. Workflow becomes callable.

The arc is the same one we run for services customers, told from the developer's seat. Domain experts upload examples in Studio. The compiler turns them into tests. The workflow generates and iterates until every test passes. Your code keeps calling the same name — when a workflow updates, the update flows without an engineering ticket.

Why this exists

We use the same stack to deliver services.

Every workflow Podesta ships under a services engagement runs on the stack you'd license. We didn't extract Akribes from a side project — we built it for the platform that runs our customer work. The type system, the eval surface, cross-script composition, document ingestion, the audit trail: each piece exists because we needed it to ship correct AI behaviour at scale.

BDPE's first production workflow runs on it. So does the chain that handles their inbound mail. So does the internal evaluation harness — 27 cases, a deterministic judge, the same score that gates every change we ship.

Deployment

Three options. Same stack.

Managed cloud

Run on Podesta's shared multi-tenant infrastructure. Fastest to start.

Dedicated tenant

Single-tenant Podesta-hosted environment. SOC-2 evidence, isolation guarantees.

Customer-controlled cloud

Signed container images run in your cloud account or on-prem. Audit evidence stays with the platform.

Pricing

License pricing on request.

Priced on executions, what you're building, deployment, and support. Workflows, benchmark cases, and evals are not metered — see /pricing#license for the full picture.

Talk to us about a license

Talk to us

Building Applied AI into your product?

Tell us what you're building. We come back with what fits.