Products
CAPABILITIES
DEPLOYMENT
Integrations
STORAGE
ANALYTICS

One compute plane. Every workload.
One compute plane. Every workload.
Orion orchestrates GPU workloads, containers, VMs, and bare metal from one unified compute plane. Your team manages one system instead of five.
Orion orchestrates GPU workloads, containers, VMs, and bare metal from one unified compute plane. Your team manages one system instead of five.
The fragmentation problem.
Most teams manage separate systems for containers, VMs, and bare metal. That means three ops workflows, three billing systems, and three sets of failure points.
VMware costs are up 150% to 10×+
vSphere 7 is already out of support. vSphere 8 end of general support: October 2027. The migration window is narrowing, and every path forward involves rebuilding your infrastructure.
GPU hardware sits idle most of the time
Enterprise on-premises GPU utilization sits at 10–15%. The hardware is paid for. The capacity is there. Most environments lack the orchestration layer to actually use it.
Three clusters for three substrate types
Kubernetes for containers. VMware for VMs. Custom tooling for bare metal. Each has its own admin workflow, billing model, and failure mode. Your team manages the seams, not the work.
Provisioning measured in days, not minutes
Getting a researcher or artist a GPU workstation means a ticket, a queue, and someone from ops in the loop. That friction adds up. Orion provisions in 60 seconds, without IT involvement.
Orchestration as a Service.
One cluster for every workload type.

GPU Operator Automation
Typically 2–4× more workload density
Orion automates NVIDIA GPU operator installation and configuration. Admins choose their slicing method (MIG, vGPU, or time slicing) through a UI. No YAML. No manual node labeling. AMD and Intel GPU support available via community plugin (roadmap). End users get more capacity without knowing it exists.

GPU Operator Automation
Typically 2–4× more workload density
Orion automates NVIDIA GPU operator installation and configuration. Admins choose their slicing method (MIG, vGPU, or time slicing) through a UI. No YAML. No manual node labeling. AMD and Intel GPU support available via community plugin (roadmap). End users get more capacity without knowing it exists.

GPU Operator Automation
Typically 2–4× more workload density
Orion automates NVIDIA GPU operator installation and configuration. Admins choose their slicing method (MIG, vGPU, or time slicing) through a UI. No YAML. No manual node labeling. AMD and Intel GPU support available via community plugin (roadmap). End users get more capacity without knowing it exists.

Autoscaling
Right-node, right-size autoscaling
When demand spikes, Orion selects the right node for the job — not just the largest available. Scale up on burst, scale back when idle. No wasted capacity, no over-provisioning.

Autoscaling
Right-node, right-size autoscaling
When demand spikes, Orion selects the right node for the job — not just the largest available. Scale up on burst, scale back when idle. No wasted capacity, no over-provisioning.

Autoscaling
Right-node, right-size autoscaling
When demand spikes, Orion selects the right node for the job — not just the largest available. Scale up on burst, scale back when idle. No wasted capacity, no over-provisioning.

Load Balancing
Request-aware load balancing
Orion distributes workloads evenly across your fleet by active request count, not just CPU/memory headroom. Your cluster stays balanced without manual intervention.

Load Balancing
Request-aware load balancing
Orion distributes workloads evenly across your fleet by active request count, not just CPU/memory headroom. Your cluster stays balanced without manual intervention.

Load Balancing
Request-aware load balancing
Orion distributes workloads evenly across your fleet by active request count, not just CPU/memory headroom. Your cluster stays balanced without manual intervention.

Provisioning
60-Second Provisioning
No tickets. No JIRA queues. No waiting for IT. Orion provisions containerized and virtualized workloads on demand — researchers and artists get their environment before they've finished their coffee.

Provisioning
60-Second Provisioning
No tickets. No JIRA queues. No waiting for IT. Orion provisions containerized and virtualized workloads on demand — researchers and artists get their environment before they've finished their coffee.

Provisioning
60-Second Provisioning
No tickets. No JIRA queues. No waiting for IT. Orion provisions containerized and virtualized workloads on demand — researchers and artists get their environment before they've finished their coffee.

Multi-Cloud Orchestration
Crossplane: no drift, no lock-in.
Via Crossplane (available as a Terra plugin), Orion enforces your infrastructure configuration continuously. Unlike Terraform or Pulumi, which apply once and drift. Mount AWS, GCP, Azure, or on-prem as a single abstraction. Deploy the same way everywhere. If a cloud raises prices or goes down, you have a path out.

Multi-Cloud Orchestration
Crossplane: no drift, no lock-in.
Via Crossplane (available as a Terra plugin), Orion enforces your infrastructure configuration continuously. Unlike Terraform or Pulumi, which apply once and drift. Mount AWS, GCP, Azure, or on-prem as a single abstraction. Deploy the same way everywhere. If a cloud raises prices or goes down, you have a path out.
Your entire fleet, one view.
Your entire fleet, one view.
Orion sits between your infrastructure and your workloads, scheduling containers, VMs, and bare metal jobs across GPU and CPU resources with unified visibility across the whole fleet.
Orion sits between your infrastructure and your workloads, scheduling containers, VMs, and bare metal jobs across GPU and CPU resources with unified visibility across the whole fleet.














Your DevOps team defines the rules. Your users click a button. Everything between those two moments is Orion.
Customer-hosted
Your data never leaves your perimeter.
Orion deploys in your environment: on-prem, air-gapped, or hybrid. There is no cloud management plane calling home, no vendor access to your cluster, and no egress fees. For life sciences, defense, and enterprise teams where data sovereignty is non-negotiable, this is the architecture that makes Orion viable where others aren't.
No cloud management plane
Orion runs entirely within your network. No AWS account required, no Azure backbone, no external orchestration layer. Your cluster operates independently of any vendor's cloud.
Zero vendor telemetry
No phone-home. No licensing server that needs internet access. Licensing, updates, and orchestration all operate inside your perimeter. Fully air-gapped deployments are production-supported.
No egress surprises
Data stays where you put it. No egress fees, no cross-region transfer, no hidden bandwidth costs. R3D cut their AWS compute bill ~40% in part because their data stopped moving.
Terra App Store
The infrastructure app store. Powered by GitOps.
Once engineers start using Terra, Orion stops being infrastructure — it becomes the developer experience. Terra is Orion's infrastructure app store. Three plugin types cover everything your team needs.
Operators install GPU drivers, AI runtimes, and cluster tooling with opinionated defaults that cover 99% of deployments.
Template Engines define full environments — Helios containerized desktops, JupyterLab, VS Code Server, custom pipelines — delivered to any user with one click.
Network and Services plugins drop in Tailscale exit nodes, NFS provisioners, and connection brokers without touching cluster config. Fork any chart if you need to go deeper.
Terra App Store
The infrastructure app store. Powered by GitOps.
The infrastructure app store. Powered by GitOps.
Terra is Orion's infrastructure app store. Operator plugins install GPU drivers, AI runtimes, and cluster tooling with opinionated defaults that cover the vast majority of deployments. Template Engines define full environments: Helios desktops, JupyterLab, VS Code Server, custom pipelines, delivered to any user with one click. Network and Services plugins drop in Tailscale nodes, NFS provisioners, and connection brokers without touching cluster config. Fork any chart to go deeper.
Terra is Orion's infrastructure app store. Operator plugins install GPU drivers, AI runtimes, and cluster tooling with opinionated defaults that cover the vast majority of deployments. Template Engines define full environments: Helios desktops, JupyterLab, VS Code Server, custom pipelines, delivered to any user with one click. Network and Services plugins drop in Tailscale nodes, NFS provisioners, and connection brokers without touching cluster config. Fork any chart to go deeper.
Alex Hatfield
CEO & Co-Founder, Juno Innovations
"The idea was always Lego bricks. You pick the tools your team needs: GPU operators, runtimes, workload templates. Click to install, and they just work. All the hard stuff stays on our side. You just build."

Terra App Store
One-click app installs
VS Code Server, JupyterLab, DCC tools, custom pipelines — deploy production-ready environments in seconds, not hours.

Terra App Store
One-click app installs
VS Code Server, JupyterLab, DCC tools, custom pipelines — deploy production-ready environments in seconds, not hours.

Terra App Store
One-click app installs
VS Code Server, JupyterLab, DCC tools, custom pipelines — deploy production-ready environments in seconds, not hours.

Templating
Reusable Templates
Build once, deploy everywhere. Create golden-image templates for dev, staging, and production workflows that work identically across your whole team.

Templating
Reusable Templates
Build once, deploy everywhere. Create golden-image templates for dev, staging, and production workflows that work identically across your whole team.

Templating
Reusable Templates
Build once, deploy everywhere. Create golden-image templates for dev, staging, and production workflows that work identically across your whole team.

Versioning
Full Version History
Track every environment change. Roll back to any previous configuration instantly. No more 'it worked last week' debugging sessions.

Versioning
Full Version History
Track every environment change. Roll back to any previous configuration instantly. No more 'it worked last week' debugging sessions.

Versioning
Full Version History
Track every environment change. Roll back to any previous configuration instantly. No more 'it worked last week' debugging sessions.
Helios Workstations
A workstation for every user. Launched in 60 seconds.
A workstation for every user. Launched in 60 seconds.
Containerized desktop environments provisioned on demand. Users request the resources they need: GPU, RAM, applications. Helios delivers a full workstation in under 60 seconds. When the session ends, resources return to the pool. No idle machines. No assigned hardware. No IT queue.
Containerized desktop environments provisioned on demand. Users request the resources they need: GPU, RAM, applications. Helios delivers a full workstation in under 60 seconds. When the session ends, resources return to the pool. No idle machines. No assigned hardware. No IT queue.
Full desktop, zero footprint
Every Helios workstation is a containerized environment with full GPU access, persistent storage, and the tools your team already uses. VS Code, JupyterLab, Nuke, Houdini, Blender — launched from a browser, destroyed on logout.
Full desktop, zero footprint
Every Helios workstation is a containerized environment with full GPU access, persistent storage, and the tools your team already uses. VS Code, JupyterLab, Nuke, Houdini, Blender — launched from a browser, destroyed on logout.
Capacity that comes back
Traditional VDI pre-provisions fixed VMs that sit idle 70% of the time. Helios provisions on demand and releases resources when sessions end. R3D Studios doubled artist capacity on the same GPU hardware with this model.
Capacity that comes back
Traditional VDI pre-provisions fixed VMs that sit idle 70% of the time. Helios provisions on demand and releases resources when sessions end. R3D Studios doubled artist capacity on the same GPU hardware with this model.
Browser-based, anywhere
Artists and researchers connect through a browser: no VPN client, no fat installer, no IT ticket. Selkies delivers color-accurate streaming with sub-frame latency via WebRTC. Works from the office, from home, or from a hotel lobby.
Browser-based, anywhere
Artists and researchers connect through a browser: no VPN client, no fat installer, no IT ticket. Selkies delivers color-accurate streaming with sub-frame latency via WebRTC. Works from the office, from home, or from a hotel lobby.
KubeVirt
Windows and Linux VMs, orchestrated like containers.
Windows and Linux VMs, orchestrated like containers.
Windows and Linux VMs, orchestrated like containers.
Orion orchestrates Windows and Linux VMs through KubeVirt on the same Kubernetes cluster as your containers. Windows Server 2019 and 2022, GPU pass-through and vGPU slicing, live migration between nodes without downtime: all managed from the same compute plane. No separate hypervisor stack. No infrastructure consolidation project. Run Adobe Creative Suite on Windows while rendering on Linux, and deploy both with the same 60-second provisioning your containerized workloads get.
Orion orchestrates Windows and Linux VMs through KubeVirt on the same Kubernetes cluster as your containers. Windows Server 2019 and 2022, GPU pass-through and vGPU slicing, live migration between nodes without downtime: all managed from the same compute plane. No separate hypervisor stack. No infrastructure consolidation project. Run Adobe Creative Suite on Windows while rendering on Linux, and deploy both with the same 60-second provisioning your containerized workloads get.
Windows app support
Adobe Creative Suite, Autodesk, DaVinci Resolve, and more.
Windows app support
Adobe Creative Suite, Autodesk, DaVinci Resolve, and more.
Windows app support
Adobe Creative Suite, Autodesk, DaVinci Resolve, and more.
GPU pass-through and vGPU slicing
Run GPU-accelerated Windows workloads via KubeVirt.
GPU pass-through and vGPU slicing
Run GPU-accelerated Windows workloads via KubeVirt.
Helm-compatible by default
Import existing Kubernetes workload definitions, templatize them, and make them requestable by end users. No rewriting required.
Helm-compatible by default
Import existing Kubernetes workload definitions, templatize them, and make them requestable by end users. No rewriting required.
Helm-compatible by default
Import existing Kubernetes workload definitions, templatize them, and make them requestable by end users. No rewriting required.
See everything. Operate with confidence.
See everything. Operate with confidence.
Orion surfaces utilization, CPU saturation, memory pressure, job throughput, and cost-per-workload across your entire fleet in real time. Provision in minutes with reusable templates, enforce resource quotas across teams, and run on any storage layer you already own. See where you're getting value and where capacity is sitting idle.
Orion surfaces utilization, CPU saturation, memory pressure, job throughput, and cost-per-workload across your entire fleet in real time. Provision in minutes with reusable templates, enforce resource quotas across teams, and run on any storage layer you already own. See where you're getting value and where capacity is sitting idle.
Up and running fast
Provision GPU capacity in minutes with sane defaults and reusable templates. No spreadsheet archaeology, no tribal knowledge required.
Up and running fast
Provision GPU capacity in minutes with sane defaults and reusable templates. No spreadsheet archaeology, no tribal knowledge required.
Built for shared teams
Researchers, engineers, and platform teams all in one place — with resource quotas, queue management, and RBAC so nobody steps on each other.
Built for shared teams
Researchers, engineers, and platform teams all in one place — with resource quotas, queue management, and RBAC so nobody steps on each other.
No lock-in. Your infrastructure, your choice.
Orion connects through standard Kubernetes primitives. No proprietary plugins, no forced migration, no rearchitecture required. Deploy on what you already have.
No lock-in. Your infrastructure, your choice.
Orion connects through standard Kubernetes primitives. No proprietary plugins, no forced migration, no rearchitecture required. Deploy on what you already have.
Works with the storage you already have
NFS and iSCSI connect natively. Qumulo, Weka, Vast, S3, and any other CSI-compatible provider connect via standard Kubernetes CSI driver. No migration, no rearchitecture, no new storage vendor required.
Why infrastructure teams choose Orion
Why infrastructure teams choose Orion
Three capabilities no funded competitor offers simultaneously. Here's how the platforms compare.
Three capabilities no funded competitor offers simultaneously. Here's how the platforms compare.
Traditional infrastructure management
Manual GPU provisioning — hours of wait time per deployment.
10-15% average GPU utilization — paying for capacity you never use.
Siloed clusters with no unified view across your fleet.
Kubernetes complexity that requires a dedicated platform team.
VMs and containers managed by completely separate tools.
Vendor lock-in with proprietary orchestration layers.
Juno
Purpose-built for compute orchestration — containers, VMs, GPUs, and bare metal.
Typically 2–4× workload density via native GPU operator time slicing — no new hardware.
One compute plane — containers, VMs, and bare metal unified.
Fast provisioning via the Orion dashboard — no complex setup required.
Production-ready in days, not months. Kubernetes-native.
Open standards — no vendor lock-in, ever.
Traditional infrastructure management
Manual GPU provisioning — hours of wait time per deployment.
10-15% average GPU utilization — paying for capacity you never use.
Siloed clusters with no unified view across your fleet.
Kubernetes complexity that requires a dedicated platform team.
VMs and containers managed by completely separate tools.
Vendor lock-in with proprietary orchestration layers.
Purpose-built for compute orchestration — containers, VMs, GPUs, and bare metal.
Typically 2–4× workload density via native GPU operator time slicing — no new hardware.
One compute plane — containers, VMs, and bare metal unified.
Fast provisioning via the Orion dashboard — no complex setup required.
Production-ready in days, not months. Kubernetes-native.
Open standards — no vendor lock-in, ever.
Traditional infrastructure management
Manual GPU provisioning — hours of wait time per deployment.
10-15% average GPU utilization — paying for capacity you never use.
Siloed clusters with no unified view across your fleet.
Kubernetes complexity that requires a dedicated platform team.
VMs and containers managed by completely separate tools.
Vendor lock-in with proprietary orchestration layers.
Purpose-built for compute orchestration — containers, VMs, GPUs, and bare metal.
Typically 2–4× workload density via native GPU operator time slicing — no new hardware.
One compute plane — containers, VMs, and bare metal unified.
Fast provisioning via the Orion dashboard — no complex setup required.
Production-ready in days, not months. Kubernetes-native.
Open standards — no vendor lock-in, ever.
Frequently asked questions

Orion is a containerized workload platform for on-prem, cloud, or hybrid deployments with GPU time-slicing and auto-scaling.

Yes, Orion mounts your existing file shares and toolchain locations into containers

2-5 seconds with cached images, 1-3 minutes for new nodes depending on workload type.

Yes, from Raspberry Pis to enterprise-grade - anything that can run containers

No license needed for default time-slicing. GRID/vGPU licensing only required for MIG mode.

Any Kubernetes cluster - EKS, AKS, GKE, or on-premises. We're 100% cloud-agnostic.

Per user, per month with volume discounts for larger teams. We will be moving to a node and consumption based model this year.
No long-term contract required · Deploy in your environment · Up and running in under two minutes
See Orion in your environment.
Most teams are getting 10–15% GPU utilization out of hardware they've already paid for. Orion changes that without a rip-and-replace. Talk to us about your workload profile.
No long-term contract required · Deploy in your environment · Up and running in under two minutes
