--- title: "Pricing for reliable AI agent orchestration runtime" description: "Pricing for procurement on Duale AI's agent orchestration runtime, with Evaluation, Production and Enterprise tiers, AIOp metric, and model inference billed by your providers." lang: en lastUpdated: 2026-06-07 url: https://duale.ai/en/product/pricing --- ## AI-generated summary Provides Duale AI’s early‑access pricing for its runtime that orchestrates AI agents, defines the AIOp metric for billing runtime work, and details the three tiered plans (Evaluation, Production, Enterprise) along with the commercial review process that finalizes pricing, deployment boundaries, and provider setup. - The pricing separates runtime work (orchestration, retries, state transitions, policy checks, review records) from model inference, which you pay directly to your providers. - Evaluation tier provides early‑access to the Python SDK, a managed workspace, basic task identifiers and event signals, and a scoped AIOp volume; pricing is available on request. - Production tier is usage‑based, adds durable orchestration, recovery, and requestable review inputs; model inference is billed directly by your providers. - Enterprise tier provides custom pricing, deployment‑boundary and retention reviews, multi‑project/team support, and security & privacy review aligned with rollout scope. - Commercial review sets the AIOp volume, counting rules, support level, deployment boundary, retention, and provider setup, and creates the order form that governs runtime billing. Summaries were generated by AI. Generative AI is experimental. --- Pricing for reliable agent orchestration } subtitle="Pricing separates Duale runtime work from model inference: orchestration, retries, state transitions, policy checks, and review records where configured. Model inference is billed by the providers you choose." primaryCta={{ label: "Discuss your rollout", dialog: "contact" }} secondaryCta={{ label: "Security posture", href: "https://duale.ai/en/product/security.md" }} />

Agent orchestration, retries, state transitions, event emission, policy checks, and review records where configured. Model inference, vector databases, cloud infrastructure outside the managed service, and third-party tools you contract directly. The useful cost is not the cheapest model call. It is the cost of a reliable result that can be operated, reviewed, and improved over time.

Keep commercial control of model providers and switch them as capabilities, prices, and compliance requirements change. Decide when a small model is enough, when a stronger model is justified, and when another automated attempt should stop for project review. Move from a few pilots to many production agents without building separate visibility, retries, decision gates, and review inputs for each one.

Teams that need durable Python agents, several model providers, task identifiers, available operating signals, review inputs, and a shared operating model between engineering, security, and business teams. Teams looking for a finished no-code agent builder, a marketplace of prebuilt assistants, or a turnkey packaged deployment without design-partner work.

## Pricing questions ### What does the AIOp unit cover and what does it exclude AIOp meters the Duale AI runtime work that makes an agent operable: orchestration, retries, state transitions, policy checks, runtime event emission, recovery handling, and the records the runtime attaches to a run for review. It excludes model inference, vector databases, customer cloud infrastructure, and third-party tools you contract directly with their providers. ### Who pays for model inference Unless separately agreed in the order form, you choose and contract with your model providers, and they bill you directly for inference. Duale AI pricing covers the managed runtime and does not bundle a provider fee. External databases, customer cloud, and third-party tools are priced separately. ### Is AIOp the final billing metric Yes. AIOp is the billing metric Duale AI uses for runtime work. The order form fixes the counting rules and included volume for your deployment alongside retention, support, deployment boundary, and provider setup. ### Is the Production package the right starting point Pick by stage. Evaluation is for a first Python agent on the evaluation workspace with a scoped AIOp volume. Production is for teams moving an agent out of pilot into operated workflows, with runtime usage, routing policy, and review inputs. Enterprise is for regulated rollouts or multi-team portfolios that need a deployment boundary, retention, and procurement review. Commercial review picks the package against your workflow scope and decision gate. ### What does commercial review decide Commercial review fixes the AIOp volume and counting rules, the support level, the deployment boundary, the retention duration, the provider setup, and any rollout-specific legal and commercial terms. The output is the order form. ## Related content - [Production runtime for durable AI agents](https://duale.ai/en/home.md) - [Turn AI pilots into a production portfolio](https://duale.ai/en/solutions/business.md) - [Build Python agents with routing, retries, and task IDs](https://duale.ai/en/solutions/developers.md) - [Role-based solutions for platform, business, and governance](https://duale.ai/en/solutions.md) - [Govern AI agents in production with shared operating signals](https://duale.ai/en/solutions/governance.md) - [Terms and conditions for SaaS service](https://duale.ai/en/legal/cgv.md) --- ## Sitemap See the full [Markdown sitemap](https://duale.ai/sitemap.md) for all pages.