Azure Advanced Licensing

Azure Spot VMs: Licensing Strategy and Cost Savings Framework

Microsoft Negotiations · Est. 2016 · 500+ Engagements · $2.1B Managed

Azure Spot VMs provide 60-90% cost savings compared to PAYG pricing by allowing Microsoft to reclaim the capacity when needed. This is the single largest percentage discount available in the Azure catalogue — deeper than even 3-year reservations (49%). The catch: your VM can be evicted with 30 seconds' notice. For organisations with the right workloads, this is an extraordinary cost lever. For organisations that misapply it to production workloads, it's a reliability catastrophe. This guide gives you the framework to identify suitable workloads, design eviction-tolerant architectures, and extract maximum Spot savings without reliability risk.

The Scale of the Opportunity: A financial services firm running 500 batch processing VMs (D8_v5, 8 vCPUs) on PAYG at $0.384/hour: $1,382,400/year. The same workload on Spot at 75% discount: $345,600/year. Annual saving: $1,036,800. If the workload can tolerate eviction (most batch workloads can with checkpointing), this is a straightforward optimisation.

Independent Advisory. Zero Vendor Bias.

500+ Microsoft EA engagements. $2.1B in managed spend. 32% average cost reduction. We identify Spot VM opportunities and design eviction-tolerant architectures.

View Advisory Services →

Spot VM Pricing Mechanics

Spot VM pricing fluctuates based on supply and demand for Azure capacity in each region and availability zone. You set a maximum price (price cap) — if the Spot price rises above your cap, your VM is evicted. Setting the cap at or near PAYG price means you run whenever Spot is below PAYG (which is the majority of the time) and are evicted only during peak demand spikes.

VM SKUPAYG Price/HourTypical Spot Price/HourTypical Discount3-Year RI Price/HourSpot vs 3-Year RI
D4_v5 (4 vCPU, 16 GB)$0.192$0.038-$0.07760-80%$0.08756-66% cheaper
D8_v5 (8 vCPU, 32 GB)$0.384$0.077-$0.15460-80%$0.17556-56% cheaper
D16_v5 (16 vCPU, 64 GB)$0.768$0.154-$0.30760-80%$0.35056-66% cheaper
F8s_v2 (8 vCPU, 16 GB)$0.338$0.034-$0.10170-90%$0.15478-88% cheaper
E16_v5 (16 vCPU, 128 GB)$1.008$0.201-$0.40360-80%$0.45956-66% cheaper

Spot prices are approximate and fluctuate. East US region, Linux OS. Add Windows Hybrid Benefit where applicable — this reduces the OS component from Spot price too.

The key insight from the table: Spot VMs are typically 56-88% cheaper than even 3-year reserved instances. For workloads that can tolerate eviction, this is the most powerful Azure cost optimisation available — more impactful than any reservation strategy.

Workload Suitability Framework

Workload TypeSpot Suitable?Eviction HandlingExpected Saving
Batch data processing (ETL, analytics)✓ ExcellentCheckpoint state; restart from last checkpoint65-85%
CI/CD build agents✓ ExcellentRe-queue failed jobs; retry logic in pipeline70-80%
ML model training (non-real-time)✓ ExcellentCheckpoint model weights; resume training65-85%
Rendering / video encoding✓ ExcellentSave render progress; resume from last frame70-90%
Genomics / scientific computing✓ ExcellentCheckpoint simulation state70-85%
Dev/test environments✓ GoodRebuild from infrastructure-as-code; or Deallocate65-80%
Stateless web/API tier (with load balancer)✓ GoodLoad balancer removes evicted instance; traffic reroutes60-75%
Kubernetes worker nodes (non-critical pods)✓ GoodKubernetes reschedules pods on remaining nodes65-80%
Message queue consumers✓ GoodMessages reprocess; exactly-once delivery required65-80%
Production stateful databases✗ Not suitableCannot tolerate eviction without data loss riskN/A
Real-time streaming (sub-second SLA)✗ Not suitable30-second eviction notice insufficientN/A
Primary web servers (no redundancy)✗ Not suitableEviction causes outageN/A

Eviction Handling Architecture

The 30-second eviction notice is sufficient for graceful shutdown only if applications are pre-designed for it. Three eviction handling patterns:

Pattern 1: Checkpoint and Resume (Batch/ML)

Applications write a checkpoint to persistent storage (Azure Blob, Azure Files, or Managed Disk not attached to the Spot VM) at regular intervals — every 5-15 minutes for most batch jobs. On eviction notice, the application completes the current checkpoint write and exits cleanly. On VM restart, the job resumes from the last checkpoint. Cost of checkpointing: typically 2-5% of job duration in overhead. The 30-second eviction notice is sufficient to write most final checkpoints if the checkpoint write is in progress and is under 30 seconds.

Pattern 2: Stateless Replacement (Web/API)

Stateless application tiers behind a load balancer can use Spot in VMSS with mixed-instance pools. The recommended VMSS configuration: set a minimum of 2 on-demand (reserved) instances as the HA baseline, with Spot instances handling overflow capacity. Azure Load Balancer health probes detect evicted instances within 30-60 seconds and route traffic to remaining instances. For applications that can tolerate a brief (30-60 second) reduction in capacity, this architecture delivers 60-70% savings on overflow capacity.

Pattern 3: Eviction-Aware Kubernetes (AKS)

Azure Kubernetes Service supports Spot node pools. Eviction-tolerant pod workloads (batch jobs, non-critical background processing) are scheduled on Spot node pools via taints/tolerations. When a Spot node is evicted, AKS automatically reschedules pods on remaining nodes or scales up on-demand nodes to absorb the workload. The AKS Spot integration handles the complexity of Spot lifecycle management — it is the most operationally mature pattern for large-scale Spot adoption.

Get an Independent Second Opinion

Before deploying Spot VMs at scale, validate your eviction handling architecture with an adviser who has implemented these patterns across multiple enterprises.

Request a Consultation →

Hybrid Benefit + Spot: Maximum Discount Stacking

Azure Hybrid Benefit for Windows Server can be applied to Spot VMs, reducing the Windows licence component of the already-discounted Spot price. The combined effect for Windows workloads:

This makes Windows Server Spot VMs with Hybrid Benefit price-competitive with Linux VMs in many scenarios — a significant finding for enterprises with large Windows batch processing estates. The Hybrid Benefit application to Spot VMs uses the same SA rules as standard VMs: Datacenter covers unlimited VMs, Standard covers 2 VMs per licence.

Regional and SKU Strategy for Spot Stability

Not all Azure regions and VM families have equal Spot availability and eviction rates. Strategic considerations:

Use multiple regions: Distribute Spot workloads across East US, West US 2, and North Europe for batch processing. If one region experiences capacity pressure and evicts, the workload continues in another. Azure Batch and AKS both support multi-region Spot pools natively.

Diversify VM families: Configure VMSS or Batch pools to accept multiple compatible VM families (D4_v5, D4s_v4, D4s_v3). When capacity is constrained on D4_v5, Azure falls back to D4s_v4. This multi-family configuration can reduce eviction rates by 50-60% compared to single-SKU configurations.

Avoid GPU Spot during peak demand: GPU Spot VMs (NCsv3, NC A100 v4) are in high demand for AI training workloads. Eviction rates for GPU Spot are consistently higher than general-purpose Spot. Plan for 15-20% eviction rates on GPU Spot versus 3-8% for general-purpose Spot in standard regions.

📄 Free Guide: Azure Licensing Advanced Guide 2026

Spot VMs, Reserved Instances, Dedicated Host, Arc — complete cost optimisation frameworks for Azure's most impactful licensing decisions.

Download Free Guide →

Common Spot VM Mistakes

1. Deploying Spot without eviction handling. Applications that crash or lose data on eviction — without checkpointing or retry logic — negate the cost benefit. A failed batch job that must rerun from scratch wastes the Spot savings and adds rerun costs.

2. Single-region, single-SKU Spot pools. Concentration in one region and one VM family maximises eviction risk. Always diversify across SKU families and consider multi-region for mission-critical batch workloads.

3. Using Spot for production databases. This seems obvious, but we have seen production SQL Server workloads deployed on Spot VMs "to save money." A 30-second eviction on a primary SQL Server instance causes data inconsistency, corruption risk, and extended recovery time. Do not use Spot for any stateful production database.

4. Not monitoring Spot pricing history before deployment. Azure exposes historical Spot pricing per SKU per region via the portal and API. Review the last 30 days of pricing history before selecting a SKU to confirm the typical discount and eviction rate range. SKUs with "Medium" or "High" eviction rates will produce frequent interruptions for most workloads.

Microsoft Licensing Intelligence — Weekly

Negotiation tactics, Azure pricing alerts, and licensing analysis. Read by 4,000+ enterprise buyers.

Subscribe Free →

Frequently Asked Questions

What discount do Azure Spot VMs provide?

Azure Spot VMs provide 60-90% discount compared to PAYG pricing, varying by region, VM family, and current capacity demand. In practice, popular general-purpose VM families in East US sustain 70-80% discounts during most hours — deeper savings than even 3-year reserved instances (49%).

What happens when a Spot VM is evicted?

You receive a 30-second eviction notice before termination. You can configure Stop/Deallocate policy (VM deallocated, restartable when capacity returns) or Delete policy (VM and disk deleted). Applications must checkpoint state before eviction to avoid losing work.

Which workloads are suitable for Azure Spot VMs?

Excellent candidates: batch processing, CI/CD pipelines, ML training, rendering, ETL, dev/test environments. Not suitable: production databases, real-time streaming, or any workload where an unexpected 30-second shutdown causes user-facing or data-integrity issues.

Do Azure Hybrid Benefit or reservations apply to Spot VMs?

Azure Hybrid Benefit can be applied to Windows Server Spot VMs. Reserved Instances do NOT apply to Spot VMs. Combining Hybrid Benefit with Spot pricing achieves 86-90% total discount on Windows workloads versus PAYG — the maximum discount available in Azure.

How should I set the Spot price cap?

Best practice: set your cap at or slightly above the on-demand PAYG price. This maximises running time (you run whenever Spot is below PAYG, which is most of the time) while protecting against rare price spikes. Caps set significantly below PAYG cause frequent evictions without proportional cost benefit.

Related Azure Licensing Guides