Skip to main content

Deployment Options — on-premises, private cloud, or Layerup-hosted cloud.

Layerup AI Agents can be deployed in three distinct hosting models. The right choice depends on your organisation’s IT capacity, data sovereignty requirements, regulatory posture, and — critically — how quickly you want to go live. All three models satisfy HIPAA requirements when the appropriate contractual and technical controls are in place.
dimensionOn-PremisesYour Private CloudLayerup’s Cloud
Infrastructure ownerYou (your data centre)You (your AWS / Azure account)Layerup
Who provisions serversYour IT teamYour cloud teamLayerup
Data residencyFully on-siteYour cloud VPC / VNetLayerup’s dedicated isolated cluster
AI compute (GPUs)Your hardwareYour cloud instancesLayerup-managed
Time to go liveLongestModerateFastest
HIPAA instrumentBAA requiredBAA requiredBAA required
IT burden on your teamHighestHighLowest
Recommended forStrict on-site mandatesExisting cloud maturity + data sovereigntyFastest path to production

Option 1 — On-premises deployment

In an on-premises deployment, the Layerup agent container runs entirely within your own physical data centre. No cloud provider is involved. Your infrastructure team provisions the servers, networking, storage, and GPU compute required to run the agent and its LLM inference layer.

Data stays on-site

All application data, inference calls, and audit logs remain within your physical facility. Nothing leaves your network perimeter — not to Layerup, not to a cloud provider.

Full infrastructure control

Your team configures every layer: compute, networking, identity, encryption, and logging. You are not dependent on any third-party cloud service being available.

AI compute requirement

LLM inference requires GPU hardware. Many on-premises environments are optimised for standard database and application workloads, not high-performance AI compute. Provisioning the right GPU infrastructure is typically the longest lead-time item in an on-premises deployment.

Timeline reality

Hardware procurement, rack installation, network configuration, security audits, and internal IT approvals all run sequentially. This is the slowest path to production.
When to choose on-premises: Your organisation has a hard regulatory or policy mandate that prohibits any data from being processed in a cloud environment. This is rare in 2026 — most regulated industries, including healthcare, have accepted cloud-based processing under appropriate contractual safeguards — but it remains a valid choice for organisations with strict air-gap requirements.
On-premises deployments require your team to own the full operational lifecycle: OS patching, container image updates, GPU driver management, and availability SLAs. Layerup provides the agent software and deployment guidance, but your infrastructure team is the operator. Factor this into your resourcing plan.

Option 2 — Your private cloud (customer-hosted VPC)

In this model, the Layerup agent runs entirely within your own cloud account — your AWS VPC or your Azure VNet. Your organisation owns and manages the cloud infrastructure; Layerup delivers the agent container image through the cloud marketplace into your private registry, and the agent runs exclusively inside your network boundary. This is the model described in full detail throughout the remainder of this documentation. See Native VPC Deployment Paradigm for the complete architectural specification.

Sovereign tenancy

All compute, storage, inference calls, and audit logs live inside your own cloud account. Your IAM, your encryption keys, your CloudWatch or Azure Monitor — the agent operates entirely within the controls you already own.

Zero-egress data residency

Application data never leaves your cloud boundary. LLM inference is routed through Amazon Bedrock or Azure OpenAI within your cloud tenant — not through any Layerup-controlled endpoint.

IT provisioning required

Your cloud team must provision the execution environment (Bedrock AgentCore, ECS Fargate, AKS, or Azure Container Apps), configure VPC networking, IAM roles, and any required GPU-capable instance types. This takes real coordination with your internal platform team.

Timeline reality

Go-live depends on your cloud team’s availability, internal security review processes, and the maturity of your existing cloud-native infrastructure. Healthcare organisations with slower IT approval cycles typically take longer.
When to choose your private cloud: Your organisation has existing AWS or Azure infrastructure, a capable cloud platform team, and a data sovereignty requirement that data must reside within your own cloud account — but you do not need full on-premises operation.

Option 3 — Layerup’s cloud (Layerup-hosted dedicated cluster)

In this model, Layerup provisions and operates a fully isolated, single-tenant cluster in Layerup’s own HIPAA-compliant AWS environment — dedicated exclusively to your organisation. Your data does not share compute, storage, networking, or any infrastructure with other Layerup customers. You access the agent through a secure, authenticated API endpoint; Layerup owns the infrastructure footprint.

Fastest path to production

Because Layerup controls the infrastructure, provisioning is not blocked by your IT team’s backlog. There is no hardware procurement, no internal firewall request queue, no GPU provisioning delay. This is the fastest path to production.

Single-tenant isolation

Your dedicated cluster is physically and logically isolated from every other customer. Separate VPC, separate compute, separate storage buckets, separate encryption keys. Your PHI does not mix with any other customer’s data at any layer.

HIPAA-compliant by default

Layerup’s cloud environment is HIPAA-compliant. We sign a Business Associate Agreement (BAA) with your organisation, legally binding us to HIPAA’s Security and Privacy Rules — encryption at rest and in transit, audit logging, breach notification within 60 days, and minimum necessary access. Your obligation is the BAA; the infrastructure compliance is Layerup’s responsibility.

Lowest IT burden

Your IT team is not responsible for provisioning, patching, scaling, or maintaining any infrastructure. Layerup handles all updates, monitoring, and incident response. Your team integrates with the agent via a stable API, not an infrastructure deployment.
When to choose Layerup’s cloud: Your primary goal is the fastest possible go-live, and your data sovereignty requirement is satisfied by a dedicated single-tenant cluster with a BAA in place. This is the recommended choice for most healthcare customers who want a SaaS-like experience without the AI infrastructure burden.

Business Associate Agreement (BAA)

A BAA is required before any Layerup AI Agent deployment can process Protected Health Information. This applies to all three deployment options — including Option 2, where the agent runs entirely within your own AWS account.

What a BAA is

A Business Associate Agreement is a legally binding contract under HIPAA § 164.504(e) between a Covered Entity (your organisation, as a health insurer) and a Business Associate (Layerup, as a service provider that handles PHI on your behalf). The BAA is not optional — HIPAA requires it any time PHI is disclosed to a service provider, regardless of where that service provider’s infrastructure is located.

What the BAA commits Layerup to

obligationspecifics
EncryptionPHI encrypted at rest (AES-256) and in transit (TLS 1.3) at all times
Access controlsPHI accessible only to Layerup personnel with a documented need — and only during the deployment engagement; no persistent access in production
Audit loggingAll access to PHI logged and retained for a minimum of 6 years
Breach notificationWritten notification to your organisation within 60 days of discovery of a breach involving your PHI
Minimum necessary usePHI used only to perform the contracted agent services — never for model training, product improvement, or any purpose outside the SOW
Subcontractor BAAsLayerup executes BAAs with all subcontractors (including AWS) who may have access to PHI in connection with your deployment
Return or destructionUpon contract termination, all PHI in Layerup’s possession is returned to you or securely destroyed within 30 days

What the BAA commits your organisation to

  • Providing only the minimum necessary PHI required for the agent to perform its underwriting function
  • Notifying Layerup promptly if you become aware of a breach or potential breach involving data processed by the agent
  • Ensuring your team uses the agent only in accordance with the permitted purposes in the BAA and SOW

Why the BAA is required even for Option 2

In Option 2, the agent container runs in your AWS account. However, Layerup is involved in the deployment engagement — our implementation engineers assist with configuration, troubleshooting, and AOP development. During this engagement, Layerup personnel may have access to systems that process PHI. The BAA governs this access and provides the legal framework for that involvement. After go-live, Layerup’s access is revoked entirely — but the BAA remains in place as a standing instrument for ongoing support interactions.

Execution timeline

BAA typetypical timeline
Layerup standard BAA (Layerup’s template)5–10 business days from delivery to countersignature
Customer-form BAA (your organisation’s template)Dependent on your legal team’s review cycle — Layerup will review and respond within 10 business days of receipt

Who signs

Layerup’s BAA is executed by Layerup’s VP of Legal and countersigned by your organisation’s designated BAA signatory (typically a Privacy Officer, Legal Counsel, or VP-level signatory with authority to bind your organisation to HIPAA business associate obligations).
BAA execution can begin in parallel with the technical deployment scoping — you do not need to wait for the technical engagement to start before initiating the BAA. Contact your Layerup account team to receive the standard BAA template.

Choosing the right option — decision guide

Fig. D0.1 — Deployment option decision tree. All three paths are HIPAA-permissible when a BAA is in place.

Comparative go-live timeline

milestoneOn-PremisesYour Private CloudLayerup’s Cloud
Hardware / account provisioningLongestModerateAlready done
Security review and firewall approvalsLongestModerateLayerup handles
GPU / AI compute provisioningLongestModerateAlready done
BAA executionRequiredRequiredRequired
Agent deployment and configurationModerateModerateFastest
Integration and UATModerateModerateModerate
Overall path to go liveSlowestModerateFastest
Healthcare IT deployments in a customer’s private cloud environment frequently take longer than expected due to internal IT backlogs, security review cycles, and the challenge of provisioning GPU infrastructure that most healthcare organisations do not have in their existing cloud footprint. Layerup’s cloud option bypasses the entire provisioning phase by eliminating the dependency on your IT department.