Skip to main content

Unified Monitoring Solution (UMS)

Workshop Edition - Cloud Platform Observability
Purpose: Federated monitoring solution for enterprise Azure environments


📚 Solution Documents

#DocumentDescriptionAudience
1Architecture OverviewFederated monitoring architecture, LAW design patterns, RBAC strategyArchitects, Platform Team
2Operations RunbookKQL queries, alert response, troubleshooting, cost optimization, DCR patternsOperations, SRE, DevOps
3Advanced TopicsAudit logs, DR, cost optimization, AIOps roadmapArchitects, Leadership
4Platform Observability ScenariosPlatform vs LZ monitoring, visibility at scale, Service HealthPlatform Team, Leadership
5Multi-Region Resiliency & BCDRSingle vs Per-Region LAW, failover scenarios, SLA/RTO/RPO, cost analysisArchitects, Leadership

🎯 Key Concepts

Federated Monitoring Model

Federated Monitoring Architecture

Technology Stack

  • Data Collection: Azure Monitor Agent (AMA), Data Collection Rules (DCR), Data Collection Endpoints (DCE)
  • Storage: Log Analytics Workspace (LAW) with resource-context RBAC
  • Alerting: AMBA (Azure Monitor Baseline Alerts), Alert Processing Rules, Action Groups
  • Visualization: Azure Workbooks, Azure Dashboards
  • Infrastructure as Code: Bicep

🚀 Quick Start

1. Deploy Central Log Analytics Workspace

az deployment group create `
--resource-group <resource-group-name> `
--template-file ./log-analytic-workspace.bicep `
--parameters lawName=law-ums-central-prod

2. Deploy Baseline DCR

az deployment group create `
--resource-group <resource-group-name> `
--template-file ./dcr-baseline.bicep `
--parameters lawId=<law-resource-id>

3. Assign AMBA Policy Initiative

az policy assignment create `
--name 'AMBA-UMS' `
--policy-set-definition '/providers/Microsoft.Management/managementGroups/<mg>/providers/Microsoft.Authorization/policySetDefinitions/Alerting-ServiceHealth'

📚 Microsoft Official Documentation References

These are the authoritative Microsoft sources that informed this solution. Use these for the latest updates and deep-dives.

Azure Monitor Best Practices

TopicOfficial URL
Cost Optimization in Azure Monitorhttps://learn.microsoft.com/en-us/azure/azure-monitor/best-practices-cost
Reliability Best Practiceshttps://learn.microsoft.com/en-us/azure/azure-monitor/best-practices-reliability
Operational Excellencehttps://learn.microsoft.com/en-us/azure/azure-monitor/best-practices-operation
Performance Efficiencyhttps://learn.microsoft.com/en-us/azure/azure-monitor/best-practices-performance
Enterprise Monitoring Architecturehttps://learn.microsoft.com/en-us/azure/azure-monitor/fundamentals/enterprise-monitoring-architecture

Log Analytics Workspace

TopicOfficial URL
LAW Best Practiceshttps://learn.microsoft.com/en-us/azure/azure-monitor/logs/best-practices-logs
Well-Architected LAW Service Guidehttps://learn.microsoft.com/en-us/azure/well-architected/service-guides/azure-log-analytics
Workspace Designhttps://learn.microsoft.com/en-us/azure/azure-monitor/logs/workspace-design
Cost Calculations & Optionshttps://learn.microsoft.com/en-us/azure/azure-monitor/logs/cost-logs
Data Retention & Archivehttps://learn.microsoft.com/en-us/azure/azure-monitor/logs/data-retention-configure

Data Collection Rules (DCR)

TopicOfficial URL
DCR Overviewhttps://learn.microsoft.com/en-us/azure/azure-monitor/data-collection/data-collection-rule-overview
DCR Best Practiceshttps://learn.microsoft.com/en-us/azure/azure-monitor/data-collection/data-collection-rule-best-practices
Transformations in Azure Monitorhttps://learn.microsoft.com/en-us/azure/azure-monitor/data-collection/data-collection-transformations
Transformation Sampleshttps://learn.microsoft.com/en-us/azure/azure-monitor/data-collection/data-collection-transformations-samples

Alerting & AMBA

TopicOfficial URL
Azure Monitor Alerts Overviewhttps://learn.microsoft.com/en-us/azure/azure-monitor/alerts/alerts-overview
Alert Typeshttps://learn.microsoft.com/en-us/azure/azure-monitor/alerts/alerts-types
Azure Monitor Baseline Alerts (AMBA)https://azure.github.io/azure-monitor-baseline-alerts/
AMBA ALZ Deploymenthttps://azure.github.io/azure-monitor-baseline-alerts/patterns/alz/HowTo/deploy/Introduction-to-deploying-the-ALZ-Pattern/

Well-Architected Framework

TopicOfficial URL
Monitoring & Alerting Strategyhttps://learn.microsoft.com/en-us/azure/well-architected/reliability/monitoring-alerting-strategy
Observability Designhttps://learn.microsoft.com/en-us/azure/well-architected/operational-excellence/observability

📊 Workshop Agenda

TimeTopicDocument
09:00-10:30Architecture Deep-Dive01-architecture-overview.md
10:45-12:30Operations & Runbooks02-operations-runbook.md
13:30-15:00Advanced Topics & Roadmap03-advanced-topics.md

📝 Document Control

PropertyValue
Maintained byCentral Platform Team
Review CycleMonthly
ClassificationInternal
Version1.0 - Workshop Edition
📖Learn