Skip to main content

Azure PaaS Backup & Recovery β€” Consolidated Enterprise Guidance

Prepared by: Microsoft Cloud Solution Architecture
Date: April 2026
Audience: Enterprise Infrastructure & BCDR Teams
Context: Enterprise Scale Landing Zone, Multi-Region BCDR Strategy
Regions of Interest: West Europe, Sweden Central, Germany West Central


Table of Contents​

  1. Executive Summary
  2. Question 1 β€” Consolidated Guidance for Recovering Azure PaaS Services Across Multiple Regions
  3. Question 2 β€” Recommended Backup Mechanisms by Azure Resource Type
  4. Question 3 β€” Comprehensive List of Azure Resources with RSV Applicability
  5. Question 4 β€” Where Third-Party Solutions Are Needed
  6. Architecture β€” Multi-Region BCDR Reference Design
  7. Decision Matrix β€” Recovery Strategy Selection
  8. Comparison Table β€” Paired vs Restricted-Pair Region BCDR
  9. Scenario Analysis
  10. Recommended Next Steps
  11. Microsoft Learn Reference Links

1. Executive Summary​

Azure provides strong business continuity and disaster recovery (BCDR) capabilities, but recovery mechanisms vary significantly by service. Unlike traditional infrastructure backup models where a single agent or vault protects everything, Azure PaaS services use a combination of:

Recovery ModelDescriptionExamples
Native geo-replication / cross-region failoverBuilt-in data replication to secondary regionsAzure SQL (failover groups), Cosmos DB (multi-region writes), Storage (GRS/GZRS)
Service-managed backupsAutomatic backups managed by the serviceAzure SQL (PITR), Cosmos DB (continuous backup), PostgreSQL Flexible Server
Azure Backup / Recovery Services VaultCentralized backup for selected workloadsAzure VMs, SQL in VM, SAP HANA in VM, Azure Files, Azure Blobs, AKS
Infrastructure-as-Code redeploymentRecreate + restore patternApp Service, Functions, Logic Apps, API Management
Third-party backup/orchestrationEnterprise backup tools for gapsCross-cloud, air-gapped, advanced Kubernetes, compliance-driven

Key Insight: There is no single universal backup model for all PaaS services. A consolidated workload-based recovery framework is the correct enterprise approach.

Enterprise Context β€” European Multi-Region Deployment​

For organizations operating primarily in West Europe with plans to expand to Sweden Central and Germany West Central, the following considerations are critical:

  • Sweden Central is paired with Sweden South (restricted-access region) β€” passive replication (GRS, Key Vault, Backup CRR) works to Sweden South, but you cannot deploy active workloads in Sweden South
  • Germany West Central is paired with Germany North (restricted-access region) β€” same dynamic: passive replication works, active deployments are restricted
  • West Europe is paired with North Europe (full bi-directional pairing)
  • For active DR (deploying workloads in a secondary region), you must choose an unrestricted region such as West Europe, Sweden Central, or Germany West Central
  • Many Azure services now support geo-replication to any region, not just paired regions

Key Nuance: Having a restricted-access paired region means services like Storage GRS, Key Vault auto-replication, and Azure Backup CRR still function for passive data protection. However, for active failover (deploying applications, creating resources), you need a different unrestricted region.

Reference: Azure region pairs and nonpaired regions
Reference: Azure regions list
Reference: Multi-region solutions in nonpaired regions


2. Question 1 β€” Consolidated Guidance for Recovering Azure PaaS Services Across Multiple Regions​

Customer Question​

"Provide consolidated guidelines for recovering PaaS services across multiple regions, including recommended backup mechanisms and third-party solutions where Microsoft support is limited."

Microsoft recommends workload-based resilience design anchored on:

ConceptDescription
Recovery Time Objective (RTO)Maximum acceptable downtime
Recovery Point Objective (RPO)Maximum acceptable data loss
Regional redundancyMulti-region or multi-zone deployment
Paired/non-paired region strategyChoose approach based on region capabilities
Regular failover testingValidate DR plans through drills
Automated redeployment (IaC)Use Bicep/ARM/Terraform for rapid recovery

Recovery Models by Service Category​

Category A β€” Native Geo-Failover Services (Automatic or Near-Automatic)​

These services have built-in cross-region replication and failover capabilities:

ServiceRecovery MechanismRPORTONon-Paired Region Support
Azure SQL DatabaseActive geo-replication, Auto-failover groups< 5 sec< 30 sec (planned)Yes β€” any region
Azure Cosmos DBMulti-region writes, Automatic failover~0 (multi-write)SecondsYes β€” any region
Azure Storage (Blob/Files)GRS/GZRS/RA-GRS~15 minHours (failover)GRS uses paired; Object Replication for non-paired
Azure Event HubsGeo-replication (Premium/Dedicated)Near real-timeMinutesYes β€” configurable
Azure Service BusGeo-DR (metadata + optional data)Metadata only or near real-timeMinutesYes β€” flexible
Azure Cache for RedisPassive geo-replication (Premium), Active geo-replication (Enterprise)Seconds–MinutesMinutesEnterprise: any region
Azure Key VaultMicrosoft-managed cross-region replication (paired regions)Near real-timeMinutesPaired regions (incl. restricted-access pairs like Sweden South): auto-replication works; truly non-paired regions: manual backup/restore

Category B β€” Backup + Restore Services​

These services provide automated backups with restore capabilities:

ServiceBackup TypeRetentionCross-RegionRPO
Azure SQL DatabaseAutomated PITR + LTR7-35 days (PITR), up to 10 years (LTR)Geo-restore from GRS backupMinutes–Hours
Azure Cosmos DBContinuous (PITR) or Periodic7-30 days (continuous)Backup stored per-region100 seconds
Azure PostgreSQL FlexibleAutomated backups, geo-redundant backupUp to 35 daysYes (geo-redundant backup option)Minutes
Azure MySQL FlexibleAutomated backupsUp to 35 daysGeo-redundant backup optionMinutes
Azure FilesAzure Backup (snapshots)ConfigurableCross-region restore with GRS vaultDaily/Hourly
Azure Blob StorageAzure Backup (operational/vaulted)ConfigurableVault tier supports CRRConfigurable

Category C β€” Recreate + Restore Data (IaC-Driven Recovery)​

These services require redeployment in the secondary region with data restore:

ServiceRecovery StrategyData ProtectionKey Consideration
Azure App ServiceRedeploy via IaC + CI/CDBuilt-in backup to Storage AccountMulti-region with Front Door recommended
Azure FunctionsRedeploy via IaC + CI/CDStorage dependency protectionSource-controlled deployment
Azure Logic AppsActive-passive or active-active multi-regionIntegration account DRTrigger-type-dependent strategy
Azure API ManagementBackup/restore to storage + multi-region deploymentPowerShell backup (30-day expiry)Premium tier supports multi-region natively
Azure Container RegistryGeo-replication (Premium tier)Images replicated cross-regionNo backup needed β€” replication is sufficient

Architecture Diagram β€” PaaS Recovery Categories​


Customer Question​

"Provide a comprehensive list of Azure resources with their supported backup mechanisms."

Complete Azure Resource Backup & Recovery Matrix​

Databases​

ResourceNative BackupAzure Backup / RSVGeo-ReplicationNon-Paired Region SupportKey Docs
Azure SQL DatabasePITR (7-35 days), LTR (up to 10 years)No (not needed)Active geo-replication, Auto-failover groups, Geo-restoreYes β€” failover groups to any region; Geo-restore not available in regions without pairsAutomated backups
Azure SQL Managed InstancePITR (7-35 days), LTRNoFailover groupsYes β€” any regionBusiness continuity
Azure Cosmos DBContinuous (PITR 7-30 days), Periodic (configurable)NoMulti-region writes, automatic failoverYes β€” any regionDisaster recovery
Azure PostgreSQL Flexible ServerAutomated (up to 35 days)NoGeo-redundant backup, Read replicasYes β€” geo-redundant to paired; Read replicas to anyBackup & restore
Azure MySQL Flexible ServerAutomated (up to 35 days)NoGeo-redundant backup, Read replicasLimited β€” geo backup uses paired regionBackup & restore
Azure Database for MariaDBAutomated (up to 35 days)NoGeo-restoreLimitedBackup concepts

Storage​

ResourceNative BackupAzure Backup / RSVGeo-ReplicationNon-Paired Region SupportKey Docs
Azure Storage AccountsSoft delete, VersioningYes (Blob backup via Backup Vault)LRS/ZRS/GRS/GZRS/RA-GRS/RA-GZRSGRS uses paired region; Object Replication for non-pairedStorage redundancy
Azure FilesShare snapshotsYes (RSV)GRS/GZRS via storage accountCross-region restore if GRS vaultFile share backup
Azure Managed DisksIncremental snapshotsYes (Backup Vault)Cross-region copy via snapshotYes β€” snapshot to any regionDisk backup
Azure Data Lake Storage Gen2Soft delete, VersioningYes (via Blob backup)GRS/GZRSSame as Blob StorageADLS reliability

Application Platform​

ResourceNative BackupAzure Backup / RSVGeo-ReplicationNon-Paired Region SupportKey Docs
Azure App ServiceBuilt-in backup to StorageNoNo native geo-replicationMulti-region deploy via IaC + Front DoorApp Service backup
Azure FunctionsSource-controlledNoNoRedeploy via CI/CDFunctions best practices
Azure Logic AppsNo built-in backupNoNo native geo-replicationActive-passive multi-region deployLogic Apps DR
Azure API ManagementBackup/restore via PowerShell (30-day expiry)NoMulti-region deployment (Premium tier)Yes β€” deploy gateways to any regionAPIM DR

Containers & Kubernetes​

ResourceNative BackupAzure Backup / RSVGeo-ReplicationNon-Paired Region SupportKey Docs
Azure Kubernetes Service (AKS)No built-inYes (AKS Backup extension β€” Backup Vault)No native replicationCRR to paired region (Vault tier); Multi-cluster via Fleet ManagerAKS Backup
Azure Container RegistryNo backup neededNoGeo-replication (Premium tier)Yes β€” any regionACR geo-replication
Azure Container AppsNo built-inNoNo native replicationRedeploy via IaCContainer Apps DR

Messaging & Integration​

ResourceNative BackupAzure Backup / RSVGeo-ReplicationNon-Paired Region SupportKey Docs
Azure Event HubsNo backupNoGeo-replication (Premium/Dedicated β€” data + metadata), Geo-DR (metadata only)Yes β€” configurable regionsEvent Hubs geo-replication
Azure Service BusNo backupNoGeo-DR (metadata), Application-level replicationYes β€” flexible regionsService Bus Geo-DR
Azure Event GridNo backupNoNo native geo-replicationMulti-region deployEvent Grid reliability

Security & Identity​

ResourceNative BackupAzure Backup / RSVGeo-ReplicationNon-Paired Region SupportKey Docs
Azure Key VaultIndividual secret/key/certificate backup (encrypted blob)NoMicrosoft-managed cross-region replication (to paired region, including restricted-access pairs)For truly non-paired regions only: custom multi-vault solution required. Sweden Central and Germany West Central ARE paired (restricted) β€” auto-replication works.Key Vault reliability
Azure Managed HSMBackup/restoreNoMulti-master replication via Cosmos DB backendLimited β€” contact MicrosoftManaged HSM BCDR

Monitoring & Analytics​

ResourceNative BackupAzure Backup / RSVGeo-ReplicationNon-Paired Region SupportKey Docs
Azure Monitor / Log AnalyticsData exportNoWorkspace replication (preview)Yes β€” configurable secondary regions within geographyWorkspace replication
Application InsightsData exportNoNo native replicationDeploy separate instances per regionApp Insights data retention

Networking​

ResourceNative BackupAzure Backup / RSVGeo-ReplicationNon-Paired Region SupportKey Docs
Azure Front DoorN/A (global service)N/AGlobal β€” inherently multi-regionYesFront Door overview
Azure Traffic ManagerN/A (global DNS)N/AGlobal β€” inherently multi-regionYesTraffic Manager overview
Azure DNSN/A (global service)N/AGlobalYesAzure DNS reliability
VNet / NSG / UDRNo backupNoNoRedeploy via IaCUse ARM/Bicep/Terraform

4. Question 3 β€” Comprehensive List of Azure Resources with RSV Applicability​

Customer Question​

"Provide a list specifying which resources are supported by Recovery Services Vault and where third-party solutions are required."

Recovery Services Vault (RSV) β€” Supported Workloads​

WorkloadVault TypeBackup FrequencyCross-Region RestoreNotes
Azure Virtual MachinesRecovery Services Vault1x/dayYes (CRR with GRS)Full VM snapshots, app-consistent
SQL Server in Azure VMRecovery Services VaultEvery 15 min (log), daily (full)Yes (CRR with GRS)Full + differential + log backups
SAP HANA in Azure VMRecovery Services VaultConfigurableYes (CRR with GRS)Enterprise SAP support
Azure FilesRecovery Services VaultMultiple/dayYes (if GRS vault)Share-level snapshots
On-premises (MARS Agent)Recovery Services Vault3x/dayNoFiles, folders, system state
DPM / MABS WorkloadsRecovery Services Vault2x/dayNoApp-aware, broad workload support

Backup Vault β€” Supported Workloads​

WorkloadVault TypeKey FeatureCross-Region Restore
Azure BlobsBackup VaultOperational (continuous) + Vaulted (scheduled)Yes (Vault tier)
Azure Managed DisksBackup VaultIncremental snapshotsLimited
Azure Kubernetes Service (AKS)Backup VaultCluster resources + PVsYes (Vault tier β€” CRR to paired region)
Azure PostgreSQL ServerBackup VaultLong-term retentionYes (Vault tier)
Azure Data Lake StorageBackup VaultVia Blob backupYes (Vault tier)
Azure Elastic SANBackup VaultVolume snapshotsLimited

Services Where RSV Does NOT Apply​

These services use native backup models and RSV is NOT the protection mechanism:

ServiceWhy RSV Doesn't ApplyWhat to Use Instead
Azure SQL DatabaseHas native PITR + LTR + geo-replicationBuilt-in automated backups + failover groups
Azure Cosmos DBHas native continuous/periodic backupBuilt-in PITR + multi-region replication
Azure App ServiceCompute is stateless; data is in storage/DBIaC redeployment + built-in backup to Storage
Azure FunctionsCompute is statelessIaC + CI/CD redeployment
Azure Logic AppsCompute + orchestrationMulti-region deployment
Azure API ManagementConfiguration-based serviceBackup/restore PowerShell + multi-region deploy
Azure Key VaultHas native replication (paired)Microsoft-managed replication or manual backup/restore
Azure Event HubsStreaming platformGeo-replication or application-level replication
Azure Service BusMessaging platformGeo-DR or application-level replication
Azure Container RegistryImage registryGeo-replication (Premium)

Visual Decision Tree β€” RSV vs Native vs Third-Party​


5. Question 4 β€” Where Third-Party Solutions Are Needed​

Customer Question​

"Specify where third-party solutions are required, to better inform application teams operating in a decentralized model."

When Third-Party Tools Are Commonly Used​

ScenarioWhy Third-PartyRecommended Tools
Single backup console across Azure + AWS + On-premAzure Backup is Azure-onlyVeeam, Commvault, Rubrik
Long retention compliance (> 10 years)Some Azure services have limited retentionCommvault, Rubrik, Cohesity
Air-gapped / immutable backupsRegulatory requirement for offline copiesVeeam (hardened repository), Commvault
Granular Kubernetes workload restoreAKS Backup has limitations for complex stateful appsKasten by Veeam, Velero
Advanced reporting & governanceCentralized backup compliance dashboardVeeam, Rubrik, Commvault
Cross-platform orchestrationDR orchestration across heterogeneous environmentsZerto, Commvault
Key Vault in non-paired regionsNo Microsoft-managed replication (only applies to truly non-paired regions like Italy North, Poland Central)Custom solution + scripts
Complex database-level restore orchestrationMulti-step recovery with pre/post scriptsCommvault, Rubrik

Third-Party Integration Points by Service​

Azure ServiceMicrosoft-Native ProtectionThird-Party Gap / Addition
Azure VMsAzure Backup (RSV) β€” ExcellentThird-party adds: cross-cloud, long retention, air-gap
Azure SQL DatabaseNative PITR + LTR + Failover Groups β€” ExcellentRarely needed; only for cross-cloud compliance
Azure Cosmos DBContinuous + Periodic backup β€” ExcellentRarely needed
Azure FilesAzure Backup β€” GoodThird-party for advanced file-level restore, cross-platform
AKSAKS Backup β€” Good (improving)Kasten/Velero for complex stateful workloads, Helm-aware restore
Key Vault (truly non-paired regions)Manual backup/restore β€” LimitedCustom scripts; no mainstream third-party tool for KV backup. Note: Sweden Central and Germany West Central ARE paired and have auto-replication.
App Service / FunctionsBuilt-in backup β€” BasicUsually not needed; IaC covers compute
Event Hubs / Service BusGeo-replication β€” GoodApplication-level capture (Event Hubs Capture to Storage)

Practical Recommendation for Enterprise Customers​

Given a decentralized operating model:

  1. For IaaS workloads (VMs, SQL in VM, SAP HANA): Azure Backup (RSV) is fully sufficient
  2. For PaaS databases: Native backup mechanisms are best-in-class β€” no third-party needed
  3. For AKS: Start with AKS Backup extension; evaluate Kasten/Velero if complex stateful workloads exist
  4. For Key Vault in Sweden Central / Germany West Central: Implement custom multi-vault solution with automated backup/restore scripts
  5. For compliance-driven long retention: Evaluate Commvault or Rubrik if Azure LTR doesn't meet regulatory requirements
  6. For cross-cloud scenarios: Only if the organization has multi-cloud workloads requiring unified backup

6. Architecture β€” Multi-Region BCDR Reference Design​

Enterprise Multi-Region Architecture (West Europe + Sweden Central)​

Key Architecture Notes for Regions with Restricted-Access Pairs​

ConcernWest Europe (Paired: North Europe)Sweden Central (Paired: Sweden South β€” restricted)Germany West Central (Paired: Germany North β€” restricted)
Storage GRSGRS replicates to North EuropeGRS replicates to Sweden South (passive replication works)GRS replicates to Germany North
Key VaultAuto-replication to North EuropeAuto-replication to Sweden South (read-only failover works)Auto-replication to Germany North
Azure SQL geo-restoreAvailable to paired regionAvailable to Sweden South (but cannot create resources there β€” use failover groups to another region instead)Available to Germany North
Azure Backup CRRRestores to North EuropeCRR to Sweden South should work for GRS vaultsRestores to Germany North
Active DR (deploy workloads)Deploy in North Europe or any regionCannot deploy in Sweden South β€” use West Europe or Germany West Central for active DRCan deploy in Germany North (restricted β€” request access) or use another region
Cosmos DBAny regionAny regionAny region

Important: Sweden South is restricted-access, meaning you cannot create new resources there without special access. For passive data protection (GRS, Key Vault replication, Backup CRR), the pairing works. For active disaster recovery (deploying applications, AKS clusters, App Services), you need to use a different unrestricted region as your secondary.


7. Decision Matrix β€” Recovery Strategy Selection​

Workload Tiering Model​

TierClassificationRTO TargetRPO TargetRecommended StrategyCost Impact
Tier 0Mission Critical< 1 minNear-zeroActive-active multi-region, multi-write databases, global load balancing$$$$$
Tier 1Business Critical< 15 min< 5 minActive-passive warm standby, auto-failover groups, geo-replication$$$$
Tier 2Important Internal< 4 hours< 1 hourActive-passive cold standby, scheduled backups, IaC redeployment$$
Tier 3Dev/Test / Low Priority< 24 hours< 24 hoursBackup + restore only, redeploy from IaC$

Strategy Selection per Service​

ServiceTier 0 StrategyTier 1 StrategyTier 2 StrategyTier 3 Strategy
Azure SQL DBAuto-failover groups (active-active read)Auto-failover groupsActive geo-replicationGeo-restore from backup
Cosmos DBMulti-region multi-writeMulti-region single-write + auto-failoverSingle-region + continuous backup PITRPeriodic backup
StorageRA-GZRS + application-level routingGRS/GZRSGRSLRS + scheduled backup
App ServiceActive-active + Front DoorActive-passive + Front DoorPassive-cold + IaCRedeploy manually
AKSMulti-cluster Fleet ManagerActive-passive AKS + GitOpsAKS Backup + redeployAKS Backup only
Key VaultMulti-vault active-active with app routingPrimary + manual sync secondaryPrimary + backup scriptsSingle vault
Event HubsGeo-replication activeGeo-replicationGeo-DR (metadata)Single region

8. Comparison Table β€” Paired vs Restricted-Pair Region BCDR​

CapabilityFull Pair (e.g., West Europe ↔ North Europe)Restricted-Access Pair (e.g., Sweden Central ↔ Sweden South)
Storage GRS/GZRSAutomatic replication to pair βœ…Automatic replication to restricted pair βœ… (passive protection works)
Key Vault auto-failoverMicrosoft-managed replication + failover βœ…Microsoft-managed replication to restricted pair βœ… (read-only failover)
Azure Backup CRRCross-Region Restore to paired region βœ…CRR to restricted paired region βœ… (passive restore)
Azure SQL geo-restoreGeo-redundant backup storage available βœ…Geo-restore to restricted pair βœ… (but cannot create active resources there)
Active DR β€” deploy workloads in pairβœ… Can deploy in paired region❌ Cannot deploy in restricted pair β€” use another unrestricted region
Azure Site RecoveryFull support between paired regions βœ…Full support β€” ASR works between any regions (global DR) βœ…
Cosmos DBWorks with any regionWorks with any region β€” no pairing dependency
Event Hubs Geo-replicationConfigurable to any regionConfigurable to any region
Service Bus Geo-DRConfigurableConfigurable
Sequential updatesPaired regions get staggered updatesBoth regions in same geography β€” staggering applies
Data residencyBoth regions in same geographyBoth regions in same geography βœ…

Implications for European Enterprise Deployments​

When using Sweden Central as a secondary region:

  1. Azure SQL Database: Use auto-failover groups (works with any region) β€” βœ… Fully supported
  2. Cosmos DB: Multi-region replication to any region β€” βœ… Fully supported
  3. Storage GRS: Replicates to Sweden South (restricted) for passive protection β€” βœ… Works; for active cross-region access use Object Replication to an unrestricted secondary
  4. Key Vault: Auto-replication to Sweden South works for passive failover β€” βœ… Supported; for active multi-region access, maintain a secondary Key Vault in an unrestricted region
  5. Azure Backup CRR: CRR to Sweden South should work for GRS vaults β€” βœ… Verify per workload type
  6. App Service / Functions: Multi-region deploy via IaC β€” βœ… Region-agnostic
  7. Active DR workloads: Cannot deploy in Sweden South (restricted) β€” ⚠️ Use West Europe or Germany West Central as active secondary

9. Scenario Analysis​

Scenario 1: Complete West Europe Region Outage​

Impact: All primary services unavailable
Recovery actions by service:

ServiceActionTime to RecoverData Loss Risk
Azure SQL DBAutomatic failover to Sweden Central via failover group< 30 seconds (planned), minutes (forced)< 5 seconds RPO
Cosmos DBAutomatic failover to secondary regionSecondsNear-zero (multi-write)
StorageInitiate failover (GRS to North Europe) or use Object Replication copy in Sweden CentralHours (GRS failover) or immediate (Object Replication)~15 min RPO (GRS)
Key VaultSwitch to Sweden Central Key Vault (manual failover in app config)Minutes (depends on automation)Depends on sync frequency
App ServiceFront Door routes to Sweden Central deploymentSeconds (if pre-deployed)Zero (stateless)
AKSActivate standby cluster + restore from AKS Backup30 min – 2 hoursLast backup point
Event HubsGeo-replication auto-routes to secondaryMinutesNear real-time

Scenario 2: Key Vault Resilience for Sweden Central​

Clarification: Sweden Central IS paired with Sweden South (restricted-access). This means:

  • Microsoft-managed Key Vault replication to Sweden South DOES work β€” your secrets, keys, and certificates are replicated automatically
  • In a prolonged region failure, Microsoft may initiate failover β€” the Key Vault becomes read-only in Sweden South
  • Sweden South is restricted β€” you cannot create new Key Vaults there, but the failover replica is Microsoft-managed

For active multi-region scenarios (where you need writable Key Vaults in multiple regions):

  1. Maintain a secondary Key Vault in Germany West Central or West Europe
  2. Implement automated sync using Azure Functions or Logic Apps that:
    • Periodically export secrets/keys/certificates via backup API
    • Restore to secondary Key Vault
    • Note: Backups can only restore within the same Azure geography and subscription
  3. Application-level configuration to fall back to secondary Key Vault
  4. Use Managed HSM if HSM-level protection is needed β€” it uses multi-master replication

Key Vault Backup Limitations:

  • Backups are encrypted blobs that can only be restored within the same Azure subscription and geography
  • Maximum 500 past versions per key/secret/certificate
  • Backups are point-in-time snapshots (not continuous)
  • Key Vault backup documentation

Scenario 3: AKS Workload Recovery to Secondary Region​

Challenge: Stateful Kubernetes workloads with persistent volumes
Recovery approach:

  1. Cluster configuration: Stored in Git (GitOps) β€” redeploy to any region
  2. Container images: ACR geo-replication β€” immediately available in secondary
  3. Persistent volumes: AKS Backup extension β€” snapshot + vault tier for CRR
  4. Stateful databases: Use external PaaS databases (SQL/Cosmos) with their own DR
  5. Custom hooks: Implement pre/post-snapshot hooks for database consistency

Scenario 4: Decentralized Team Onboarding β€” Providing Self-Service BCDR Guidance​

Challenge: Application teams in a decentralized model need clear self-service guidance
Recommendation:

Create a BCDR Self-Service Guide for application teams containing:

  1. Service classification form β€” teams categorize their workload tier (0-3)
  2. Pre-built Bicep/Terraform modules β€” for each service's DR setup
  3. Runbook templates β€” failover and failback procedures per service
  4. Monitoring dashboards β€” Azure Monitor workbooks for backup health
  5. Policy enforcement β€” Azure Policy to enforce backup configuration
PolicyEffectScope
VMs must have Azure Backup enabledAudit / DeployIfNotExistsAll subscriptions
Storage accounts must use GRS or GZRSAuditProduction subscriptions
SQL databases must have LTR configuredAuditProduction subscriptions
Key Vaults must have soft delete and purge protectionDenyAll subscriptions

Immediate Actions (Next 2 Weeks)​

#ActionOwnerPriority
1Create inventory of all Azure PaaS resources across subscriptionsBCDR Lead + App TeamsP0
2Classify each workload into Tiers 0-3 based on business criticalityBCDR Lead + BusinessP0
3Map each service to its recovery mechanism using the matrix aboveBCDR LeadP0
4Identify Key Vault instances in regions with restricted-access pairs and verify replication; design active multi-region strategy where neededBCDR Lead + Microsoft CSAP1

Short-Term (Next 4-8 Weeks)​

#ActionOwnerPriority
5Implement Azure Policy for backup enforcementBCDR Lead / Platform TeamP1
6Deploy IaC templates for DR infrastructure in Sweden CentralPlatform TeamP1
7Set up AKS Backup for all production clustersApp TeamsP1
8Configure auto-failover groups for all Tier 0/1 Azure SQL databasesDBA TeamP1

Medium-Term (Next Quarter)​

#ActionOwnerPriority
9Conduct first DR drill / failover testBCDR Lead + All TeamsP1
10Evaluate third-party tools (Kasten/Veeam) for complex AKS workloadsPlatform TeamP2
11Create BCDR self-service guide for decentralized app teamsMicrosoft CSA + BCDR LeadP2
12Implement centralized backup monitoring dashboard (Azure Monitor Workbooks)Platform TeamP2

Core BCDR Documentation​

TopicURL
Azure Reliability Overviewhttps://learn.microsoft.com/azure/reliability/overview
Reliability Guides by Servicehttps://learn.microsoft.com/azure/reliability/overview-reliability-guidance
Business Continuity & Disaster Recovery Conceptshttps://learn.microsoft.com/azure/reliability/concept-business-continuity-high-availability-disaster-recovery
Azure Region Pairs and Non-Paired Regionshttps://learn.microsoft.com/azure/reliability/regions-paired
Multi-Region Solutions in Non-Paired Regionshttps://learn.microsoft.com/azure/reliability/regions-multi-region-nonpaired
Well-Architected Framework β€” Reliability Pillarhttps://learn.microsoft.com/azure/well-architected/reliability/
WAF β€” Disaster Recovery Strategieshttps://learn.microsoft.com/azure/well-architected/reliability/disaster-recovery
WAF β€” Develop a DR Planhttps://learn.microsoft.com/azure/well-architected/design-guides/disaster-recovery
CAF β€” Landing Zone BCDR Design Areahttps://learn.microsoft.com/azure/cloud-adoption-framework/ready/landing-zone/design-area/management-business-continuity-disaster-recovery

Azure Backup & Recovery Services​

TopicURL
Azure Backup Overviewhttps://learn.microsoft.com/azure/backup/backup-overview
Azure Backup Support Matrixhttps://learn.microsoft.com/azure/backup/backup-support-matrix
Recovery Services Vaulthttps://learn.microsoft.com/azure/backup/backup-create-recovery-services-vault
Cross-Region Restorehttps://learn.microsoft.com/azure/backup/backup-create-rs-vault#set-cross-region-restore
Reliability in Azure Backuphttps://learn.microsoft.com/azure/reliability/reliability-backup
Backup Center Support Matrixhttps://learn.microsoft.com/azure/backup/backup-center-support-matrix

Database Services​

TopicURL
Azure SQL β€” Automated Backupshttps://learn.microsoft.com/azure/azure-sql/database/automated-backups-overview
Azure SQL β€” Active Geo-Replicationhttps://learn.microsoft.com/azure/azure-sql/database/active-geo-replication-overview
Azure SQL β€” Failover Groupshttps://learn.microsoft.com/azure/azure-sql/database/failover-group-sql-db
Azure SQL β€” HA/DR Checklisthttps://learn.microsoft.com/azure/azure-sql/database/high-availability-disaster-recovery-checklist
Azure SQL β€” Disaster Recovery Guidancehttps://learn.microsoft.com/azure/azure-sql/database/disaster-recovery-guidance
Azure SQL β€” Long-Term Retentionhttps://learn.microsoft.com/azure/azure-sql/database/long-term-retention-overview
Cosmos DB β€” Disaster Recoveryhttps://learn.microsoft.com/azure/cosmos-db/disaster-recovery-guidance
Cosmos DB β€” Continuous Backup & PITRhttps://learn.microsoft.com/azure/cosmos-db/continuous-backup-restore-introduction
Cosmos DB β€” High Availabilityhttps://learn.microsoft.com/azure/cosmos-db/high-availability
PostgreSQL Flexible β€” Backup & Restorehttps://learn.microsoft.com/azure/postgresql/flexible-server/concepts-backup-restore

Storage​

TopicURL
Azure Storage Redundancyhttps://learn.microsoft.com/azure/storage/common/storage-redundancy
Azure Files Backuphttps://learn.microsoft.com/azure/backup/azure-file-share-backup-overview
Blob Storage Object Replicationhttps://learn.microsoft.com/azure/storage/blobs/object-replication-overview
Storage Disaster Recovery Guidancehttps://learn.microsoft.com/azure/storage/common/storage-disaster-recovery-guidance

Application Platform​

TopicURL
App Service Backuphttps://learn.microsoft.com/azure/app-service/manage-backup
App Service Multi-Region DRhttps://learn.microsoft.com/azure/architecture/web-apps/guides/multi-region-app-service/multi-region-app-service
Functions Best Practiceshttps://learn.microsoft.com/azure/azure-functions/functions-best-practices
Logic Apps Multi-Region DRhttps://learn.microsoft.com/azure/logic-apps/multi-region-disaster-recovery
API Management DR (Backup/Restore)https://learn.microsoft.com/azure/api-management/api-management-howto-disaster-recovery-backup-restore

Containers & Kubernetes​

TopicURL
AKS Backup Overviewhttps://learn.microsoft.com/azure/backup/azure-kubernetes-service-backup-overview
AKS Backup & Recovery Architecturehttps://learn.microsoft.com/azure/architecture/operator-guides/aks/aks-backup-and-recovery
AKS Multi-Region Deploymenthttps://learn.microsoft.com/azure/aks/reliability-multi-region-deployment-models
AKS Reliability Guidehttps://learn.microsoft.com/azure/reliability/reliability-aks
ACR Geo-Replicationhttps://learn.microsoft.com/azure/container-registry/container-registry-geo-replication
Container Apps DRhttps://learn.microsoft.com/azure/container-apps/disaster-recovery

Security & Key Management​

TopicURL
Key Vault Reliabilityhttps://learn.microsoft.com/azure/reliability/reliability-key-vault
Key Vault Backuphttps://learn.microsoft.com/azure/key-vault/general/backup
Key Vault Availability & Redundancyhttps://learn.microsoft.com/azure/key-vault/general/disaster-recovery-guidance

Messaging & Integration​

TopicURL
Event Hubs Geo-Replicationhttps://learn.microsoft.com/azure/event-hubs/geo-replication
Event Hubs Geo-DR (Metadata)https://learn.microsoft.com/azure/event-hubs/event-hubs-geo-dr
Service Bus Geo-DRhttps://learn.microsoft.com/azure/service-bus-messaging/service-bus-geo-dr

Monitoring​

TopicURL
Log Analytics Workspace Replicationhttps://learn.microsoft.com/azure/azure-monitor/logs/workspace-replication

Closing Note​

As correctly identified by the customer's infrastructure team: current recovery mechanisms vary by resource, and a consolidated workload-based recovery framework is the right enterprise approach rather than assuming one universal backup model for all PaaS services.

The matrices and decision trees in this document provide a service-by-service mapping that enterprise application teams can use as a self-service reference in their decentralized operating model.

For regions with restricted-access pairs like Sweden Central (paired with Sweden South), passive data protection (GRS, Key Vault replication, Backup CRR) works automatically. However, for active disaster recovery (deploying workloads in a secondary region), an unrestricted secondary region must be used. This distinction should be clearly communicated to application teams.


Document prepared based on Microsoft Learn documentation as of April 2026. Service capabilities evolve β€” always verify against the latest Azure reliability guides.

πŸ“–Learn