Observability Architect (Dynatrace SME)
Purple Drive - McLean, VA
Apply NowJob Description
Title: Observability Architect (Dynatrace SME) Location: McLean, VA - Onsite Only Experience: 8-12 Years Role Overview We are seeking an experienced Observability Architect with deep expertise in Dynatrace to design, implement, and scale end-to-end observability solutions across enterprise applications, infrastructure, and cloud environments. This role will drive observability strategy, enable proactive monitoring, support AIOps/self-healing capabilities, and collaborate with stakeholders to deliver actionable insights that enhance availability, performance, and reliability. Key Responsibilities Define and own the observability strategy using Dynatrace as the core platform, integrating with existing monitoring ecosystems. Architect end-to-end monitoring solutions across applications, microservices, APIs, databases, infrastructure, and cloud workloads. Design and implement service flow mapping, distributed tracing, RUM, and synthetic monitoring frameworks. Enable AI-driven RCA, anomaly detection, and self-healing automation through Dynatrace and its ecosystem. Collaborate with Enterprise Architects, DevOps, SREs, and Application teams to embed observability in CI/CD pipelines. Establish best practices for logs, metrics, and traces instrumentation aligned with OpenTelemetry (OTel) and open standards. Define dashboards, SLIs/SLOs, and KPIs mapped to business outcomes. Provide technical leadership, governance, and roadmap planning for observability adoption across hybrid and multi-cloud environments. Act as a Dynatrace SME, mentoring teams and driving observability maturity across the enterprise. Required Skills & Experience 8-12 years of IT experience with 4+ years in Observability/Monitoring architecture. Strong hands-on expertise in Dynatrace deployment, configuration, and architecture (App Monitoring, Infra Monitoring, RUM, Synthetic, APM). In-depth understanding of application architectures: microservices, containers (Kubernetes, OpenShift, Docker), APIs, and cloud-native apps. Proven experience with cloud platforms (AWS, Azure, GCP) and hybrid/on-premises integrations. Knowledge of ITIL, SRE, and AIOps practices, focusing on proactive incident management and RCA. Familiarity with OpenTelemetry, Prometheus, ELK/EFK, Grafana, ServiceNow integration (preferred). Ability to design dashboards and SLIs/SLOs tied to business KPIs. Strong analytical, problem-solving, and communication skills. Preferred Certifications Dynatrace Certified Professional / Associate / Master Cloud certifications (AWS, Azure, GCP) ITIL Foundation / SRE Foundation
Created: 2026-03-10