Multi-Region Resiliency & BCDR for Monitoring Infrastructure
Document Type: Decision Guide & BCDR Reference
Version: 1.0
Last Updated: April 2026
Audience: Platform Architects, Cloud Operations, Leadership
| ๐ Quick Navigation | README | Architecture | Operations Runbook | Advanced Topics | Platform Scenarios | BCDR |
|---|
1. Context & Decision Scopeโ
The customer has been asked to evaluate a Single Log Analytics Workspace (LAW) collecting logs from all regions at the platform level versus a Per-Region LAW model. This document provides a structured comparison with BCDR implications for the following regional topology:
| Region | Pair Type | Paired With | LAW Replication Supported |
|---|---|---|---|
| West Europe | Fixed pair | North Europe | โ Yes (same Europe group) |
| Germany West Central | Fixed pair | Germany North | โ Yes (same Europe group) |
| Germany West Central | Non-paired | Sweden Central | โ Yes (same Europe group) |
Key Fact: Azure LAW Workspace Replication supports any target within the same region group โ not limited to fixed pairs. The full Europe group includes: France Central, Germany West Central, North Europe, South UK, West Europe, West UK, and others.
2. Single LAW vs Per-Region LAW โ Architecture Comparisonโ
2.1 Architecture Overviewโ
2.2 Head-to-Head Comparisonโ
| Criteria | Single Central LAW | Per-Region LAW | Winner |
|---|---|---|---|
| Operational simplicity | One workspace, one pane of glass | Multiple workspaces to manage | Single |
| Cross-region query | Native (all data in one place) | Requires workspace() cross-queries | Single |
| Data sovereignty / residency | โ ๏ธ All data flows to one region | โ Data stays in region of origin | Per-Region |
| Blast radius | ๐ด Single point of failure โ total loss of visibility | ๐ข Failure isolated to one region | Per-Region |
| Ingestion latency | Higher for remote regions (network hop) | Lower (local ingestion) | Per-Region |
| Egress costs | Cross-region egress charges apply | Minimal (local ingestion) | Per-Region |
| Replication cost | Pay once for one workspace | Pay per workspace replicated | Single |
| RBAC granularity | Resource-context RBAC required | Natural workspace-level isolation | Per-Region |
| Regulatory compliance | May violate German data residency requirements | โ Compliant by default | Per-Region |
| Sentinel integration | One Sentinel instance | Multiple Sentinel instances or cross-workspace | Single |
| Alert rule management | Centralized | Per-region alert rules needed | Single |
| Scalability (ingestion) | Risk of hitting single LAW limits | Distributed load | Per-Region |
2.3 Recommendation Summaryโ
| Use Case | Recommended Model |
|---|---|
| No data residency requirements, small footprint (<50 GB/day total) | Single LAW |
| German data residency (BDSG/GDPR strict), multi-region production | Per-Region LAW |
| Compliance-driven (financial, healthcare) | Per-Region LAW |
| Enterprise with 400+ landing zones across EU | Per-Region LAW with cross-workspace queries |
3. BCDR: Failover Scenarios by Regionโ
3.1 Azure LAW Resilience Layersโ
| Layer | Protection Scope | Cost | Activation | RTO |
|---|---|---|---|---|
| Availability Zones | Datacenter failure within region | Free | Automatic | ~0 min |
| Workspace Replication | Full regional outage | Paid (per replicated GB) | Manual switchover | 15โ30 min |
| Continuous Data Export | Data backup (no service failover) | Export + Storage costs | Manual restore | 1โ4 hours |
โ ๏ธ Critical: Azure does NOT provide automatic geo-failover for LAW. Workspace Replication requires a manual
az monitor log-analytics workspace failovercommand.
3.2 Failover Matrix โ Three Regional Scenariosโ
Scenario A: West Europe โ North Europe (Fixed Pair)โ
| Attribute | Value |
|---|---|
| Primary | West Europe |
| Secondary | North Europe |
| Pair type | Fixed Azure region pair |
| Replication | โ Supported natively |
| RTO | 15โ30 min (DNS propagation + manual trigger) |
| RPO | ~0 (continuous replication from enablement) |
| Switchover | az monitor log-analytics workspace failover --location northeurope |
| Switchback | az monitor log-analytics workspace failback |
| Data residency | Both in EU geography โ |
Scenario B: Germany West Central โ Germany North (Fixed Pair)โ
| Attribute | Value |
|---|---|
| Primary | Germany West Central |
| Secondary | Germany North |
| Pair type | Fixed Azure region pair |
| Replication | โ Supported natively |
| RTO | 15โ30 min |
| RPO | ~0 (continuous replication) |
| Switchover | az monitor log-analytics workspace failover --location germanynorth |
| Switchback | az monitor log-analytics workspace failback |
| Data residency | Both in German geography โ โ satisfies BDSG |
Scenario C: Germany West Central โ Sweden Central (Non-Paired)โ
| Attribute | Value |
|---|---|
| Primary | Germany West Central |
| Secondary | Sweden Central |
| Pair type | Non-paired (but same Europe region group) |
| Replication | โ Supported โ LAW replication works within region groups, not limited to pairs |
| RTO | 15โ30 min |
| RPO | ~0 (continuous replication) |
| Switchover | az monitor log-analytics workspace failover --location swedencentral |
| Switchback | az monitor log-analytics workspace failback |
| Data residency | โ ๏ธ Data leaves German geography โ review legal/compliance requirements |
3.3 Failover Flow โ Step by Stepโ
3.4 Key Operational Facts for Failoverโ
| Item | Detail |
|---|---|
| Trigger | Manual only โ no auto-failover |
| DNS propagation | Minutes, but some clients with sticky connections take longer |
| Pre-requisite | Replication must be enabled โฅ7 days before switchover (data build-up) |
| Logs before enablement | NOT replicated โ only new logs post-enablement are copied |
| Alert rules | NOT auto-replicated โ must be manually exported/imported to secondary region |
| Private Links | NOT supported during failover |
| VM Insights / Container Insights | NOT supported during failover |
| Sentinel Watchlists | Up to 12 days to fully replicate |
| Workspace management ops | Blocked during switchover (retention, pricing tier, schema changes) |
4. SLA, RTO, RPO Matrixโ
4.1 Azure Monitor & LAW SLA Referenceโ
| Component | Microsoft SLA | Notes |
|---|---|---|
| Azure Monitor Logs ingestion | 99.9% | Per-region SLA |
| Azure Monitor Metrics | 99.9% | Platform metrics |
| Log Analytics query availability | 99.9% | Does not cover regional outage |
| Workspace Replication | No additional SLA | Feature enhances resilience but is not a separate SLA commitment |
| Availability Zones | 99.99% (with AZ) | Free, automatic in supported regions |
4.2 RTO/RPO by DR Strategyโ
| Strategy | RPO | RTO | Annual Cost Impact | Best For |
|---|---|---|---|---|
| AZ only (no replication) | 0 | 0 (auto) | Free | Datacenter-level failure |
| Workspace Replication (fixed pair) | ~0 | 15โ30 min | ~1ร ingestion cost for replication | Regional outage, compliance |
| Workspace Replication (non-paired) | ~0 | 15โ30 min | ~1ร ingestion cost | Regional outage, flexibility |
| Dual DCR ingestion | 0 | ~0 (both active) | 2ร full ingestion cost | Maximum resilience, budget available |
| Continuous Export to GRS Storage | 5โ15 min | 1โ4 hours | Export + Storage costs | Cost-optimized backup |
4.3 Business Impact Mappingโ
| Outage Duration | Impact Without DR | Impact With Workspace Replication |
|---|---|---|
| < 15 min | Minor โ buffered by AMA agent retry | Not triggered (within AZ tolerance) |
| 15 min โ 1 hour | Alert blindness, no new data | Switchover triggered, visibility restored in ~15 min |
| 1 โ 4 hours | โ ๏ธ SLA breach risk, no incident correlation | Full operations on secondary |
| 4+ hours | ๐ด Critical gap in audit trail, compliance exposure | Full operations on secondary |
5. Cost Analysisโ
5.1 Cost Model โ Single LAW vs Per-Region LAWโ
Assumption: 100 GB/day total ingestion (60 GB West Europe, 40 GB Germany West Central), Pay-As-You-Go pricing, 90-day retention.
| Cost Component | Single LAW | Per-Region LAW (2 workspaces) |
|---|---|---|
| Ingestion (100 GB/day ร ~โฌ2.76/GB) | ~โฌ276/day | ~โฌ276/day (same total volume) |
| Cross-region egress | ~โฌ0.02/GB ร 40 GB = โฌ0.80/day | โฌ0/day |
| Workspace Replication | 1 workspace ร 100 GB = ~โฌ276/day | 2 workspaces ร (60+40) GB = ~โฌ276/day |
| Retention (90d included) | Included | Included |
| Total (without replication) | ~โฌ277/day | ~โฌ276/day |
| Total (with replication) | ~โฌ553/day | ~โฌ552/day |
Note: Costs are illustrative. Commitment Tiers (100 GB/day, 200 GB/day, etc.) reduce per-GB cost significantly. Check the Azure Monitor pricing page for current rates.
5.2 Cost Optimization Tacticsโ
| Tactic | Savings Potential | Effort |
|---|---|---|
| Commitment Tier (100/200/300 GB/day) | 20โ35% off ingestion | Low |
| DCR transformations โ filter noisy logs before ingestion | 15โ40% | Medium |
| Basic Logs plan for verbose/debug tables | ~67% on applicable tables | Low |
| Replicate only critical tables via selective DCR-to-DCE association | 30โ60% on replication cost | Medium |
| Archive tier for data >90 days | ~90% vs interactive retention | Low |
6. Decision Framework โ Quick Referenceโ
Which model should you choose?โ
START
โ
โโ Is German data residency required (BDSG/EU strict)?
โ โโ YES โ Per-Region LAW โ
โ โโ NO โโโ
โ โ
โ โโ Total ingestion > 200 GB/day across regions?
โ โ โโ YES โ Per-Region LAW โ
(distributed load)
โ โ โโ NO โโโ
โ โ โ
โ โ โโ Need blast radius isolation?
โ โ โ โโ YES โ Per-Region LAW โ
โ โ โ โโ NO โ Single LAW โ
(simplicity)
Final Architecture Recommendation for This Customerโ
Given the customer's use of West Europe + Germany West Central with data residency considerations:
| Decision | Recommendation |
|---|---|
| LAW Model | Per-Region LAW โ one per primary region |
| Replication | Enable Workspace Replication on each LAW |
| West Europe LAW | Replicate to North Europe (fixed pair) |
| Germany West Central LAW | Replicate to Germany North (fixed pair, stays in German geography) |
| Cross-workspace visibility | Use workspace() function in KQL and Azure Workbooks |
| Alerting | Deploy AMBA per region + manually replicate critical alert rules to secondary |
| Sentinel | Single Sentinel instance with cross-workspace connectors if needed |
7. Microsoft Official Referencesโ
| Topic | URL |
|---|---|
| LAW Workspace Replication | https://learn.microsoft.com/azure/azure-monitor/logs/workspace-replication |
| Azure Region Pairs | https://learn.microsoft.com/azure/reliability/cross-region-replication-azure |
| LAW Best Practices | https://learn.microsoft.com/azure/azure-monitor/logs/best-practices-logs |
| Azure Monitor Reliability | https://learn.microsoft.com/azure/azure-monitor/best-practices-reliability |
| Workspace Design | https://learn.microsoft.com/azure/azure-monitor/logs/workspace-design |
| Azure Monitor Pricing | https://azure.microsoft.com/pricing/details/monitor/ |
| LAW SLA | https://www.microsoft.com/licensing/docs/view/Service-Level-Agreements-SLA-for-Online-Services |
Related UMS Documents: This guide extends 03-Advanced Topics ยง Disaster Recovery with customer-specific multi-region scenarios.