Do you offer discounts for startups?

Talk to our Startup Team for information about startup discounts.

Do you offer discounts for NGOs, universities, or non-profits?

Talk to our Public Sector Team for information about discounts for NGOs, universities, or non-profits.

How do monthly credits work for the Team plan?

Your monthly plan fee is issued back to you as usage credits. In practice, this means your monthly plan cost becomes your minimum monthly spend, and you can use that same amount in usage at no additional charge. Any usage beyond that amount is billed separately.

An action is an individual execution of a task. It represents a specific invocation of a task with particular inputs. If a task runs multiple times (such as inside a loop) you'll see multiple actions, one for each invocation.

Can my team have a forward-deployed engineer (FDE) from Union.ai to help us build?

Book a consultation to discuss having a forward-deployed engineer from Union.ai help your team build.

Do you offer a self-hosted control plane as a deployment option?

Book a consultation to discuss self-hosted control plane deployment options.

How do you calculate GPU, CPU, Memory hours of usage?

We report the allocated resources (CPU, Memory, and GPU accelerator) from each container running the actions within your workflows and apply usage-based pricing down to the second. We do not include the resources consumed by any other services. Therefore, if you run Union on a shared K8s cluster, you are only paying for usage on the resources consumed by your Union tasks and workflows.

Is Union.ai a SaaS service?

No. You deploy the Union operator into a Kubernetes cluster you manage, which securely communicates with the Union control plane to poll for work. Your workflow executions, code, images, data, logs, and secrets all remain in your VPC/cloud, and are inaccessible to Union.

What's the difference between action concurrency and actions/run (i.e., task fanout)?

Fanout is the total number of actions a run creates, while concurrency is how many of those actions are running at the same time. For example, a run might fan out to 50,000 actions but only execute around 100 of them concurrently.

Can I run Union.ai in my own cloud environment?

Yes, Union.ai supports bring-your-own-cloud (BYOC) deployments. You can run it in your own AWS, GCP, Azure, or neo-cloud environment while maintaining full control over your data, security, and infrastructure.

Union.ai is the enterprise Flyte platform.

Experience the AI runtime with the scale, performance, and durability for production.

Try the devbox

Compare Features

Both platforms share the same Python-native authoring, dynamic workflows, and typed exception handling. Flyte workflows run on Union.ai without rewriting.

Flyte 2

Orchestrate & Serve

Compute-aware AI orchestration

Dynamic, python-based workflows

Real-time inference

Ultra-low latency

Durability & Debugging

Self-healing workflows

Automatic retries

Live remote debugging

Debug remote tasks, line-by-line, on actual infrastructure

Scalability

Fanout

~10k actions

50k+ actions

Multi-cluster

DIY

Action-level cluster routing

Multi-cloud workflows

Multi-region

Concurrent actions per run

~500

10,000+

Performance

Cold start latency

~5min

<5s

Reusable containers

<100ms task startup time

Infra maintenance

Whiteglove for BYOC deployments

Control plane operational costs

DIY, more expensive

Included, no extra cost

Developer Experience

Flyte UI and TUI for local runs

Union UI: group runs by task, view task code, create trigger form, rerun form, action level usage metrics (mem, cpu, gpu)

Data lineage

Realtime & persisted UI logging

Through cloud provider

Realtime, persisted logs securely in your cloud

Build container images in your cloud

Images are built and stored in your cloud registry

Compute plugin observability

Ray UI, Spark UI, Spark History server natively hosted

Observability dashboards

Per resource (CPU/Mem/GPU) usage dashboard
Per Node-type-usage dashboard
Cluster health dashboard

Enterprise Features

SSO

Standard (OIDC)

Custom (OIDC, SAML/p)

Role-based access control (RBAC)

Fine-grained

Managed secrets

Securely stored in your cloud

Whiteglove onboarding

Dedicated support

Results, proven in production.

Hopper visualizes 4.4 billion flight prices with pure Python orchestration

Woven by Toyota saves millions and scales autonomous driving with Union.ai

Rezo accelerates drug discovery while saving >90% on compute costs with Union.ai

Frequently asked questions

What makes Union.ai better for production AI?

Union.ai outperforms any OSS alternative on scale and performance in production. It supports 50K+ actions per workflow, 10,000+ concurrent actions per run, and cold start under 5 seconds. Reusable warm-start containers, per-action GPU and CPU profiling, cost attribution per team and workflow, and fail-fast resource validation at launch are the capabilities that separate a platform you can run experiments on from one you can run a business on.

What are reusable containers and when do they matter?

Most orchestrators launch a new Kubernetes pod per action, ~10 seconds of overhead before your code runs. Union.ai supports reusable containers: warm containers you can use across similar tasks. Cold start drops to under 100ms and GPU stays allocated across invocations. For teams building agentic AI, RAG pipelines, or multi-step inference workflows, this adds essential production efficiency.

How hard is it to migrate from Flyte to Union.ai?

Flyte workflows run on Union.ai without rewriting. The SDK is compatible and the authoring model is identical. The migration is mostly operational and straightforward. Most teams run their first workflow on Union.ai within an hour of starting setup.

We’re running open-source Flyte. What’s the real cost?

Flyte OSS is free to license. Operating it (or any open-source orchestrator) is not free. A stable production deployment requires a significant amount of manual maintenance that gets more costly as you scale. Engineers must manage Helm values, Postgres, ingress config, a separate secrets solution, an external log aggregation stack, and ongoing K8s maintenance. Union.ai offloads this maintenance so your team focuses on workflows, not infrastructure. The break-even on engineer time tends to come faster than most teams expect.

Does Union.ai make sense for smaller teams?

Scale is one part of the value. The features that tend to matter first for smaller teams are data lineage, persistent logs and built-in observability, and managed secrets that pass a security review without custom engineering. RBAC and cost attribution matter as soon as a second team starts touching the same platform. The operational overhead of self-managed Flyte tends to grow faster than the team itself does.

Why is Union’s zero trust architecture trusted by extremely security-sensitive industries?

Union’s zero trust security architecture means data NEVER transits outside your secure cloud. No model weights, pipeline outputs, or execution logs leave your environment. This is more secure than the industry status quo, where you’re required to trust a vendor to handle your data safely.

Start today and scale with confidence.

Chat with an engineer