Senior Manager, Platform Reliability & Automation
Applied Medical - EUROPE - Rancho Santa Margarita, CA
Apply NowJob Description
Position DescriptionApplied Medical is seeking a Senior Manager, Platform Reliability & Automation to lead a team responsible for infrastructure reliability, automation, and observability. This role ensures our platforms are stable, scalable, and well-governed.The ideal candidate brings a clear, practical leadership style, fosters cross-functional alignment, and builds high-performing teams grounded in accountability, transparency, and continuous improvement.Key ResponsibilitiesLeadership & Team DevelopmentBuild and lead a team that focuses on infrastructure reliability and automation.Foster a culture of ownership, curiosity, and continuous learning.Define goals and metrics tied to system reliability, performance, and improvement.Platform Resiliency StrategyDrive a proactive approach to uptime, performance, and incident response.Define and manage SLOs/SLIs and use post-incident reviews to reduce risk and downtime.Automation & Governance: Promote infrastructure-as-code and automation-first practices (e.g., Terraform, Ansible, CI/CD).Eliminate manual toil and improve consistency through standardized automation.Set and enforce reliability and automation standards across platforms.Observability & PerformanceBuild robust observability practices across logs, metrics, and traces.Use telemetry to guide performance tuning and improve platform efficiency.Provide other teams with clear visibility into system health.Cross-Functional PartnershipCollaborate with infrastructure, security, and application teams to improve system maturity.Drive shared accountability for platform uptime and municate goals and progress clearly across both technical and non-technical audiences.Performance ObjectivesFirst 30 DaysLearn the current landscape of platform reliability, automation, and monitoring.Build relationships with stakeholders and align on priorities.Begin shaping the foundational vision and roadmap for resiliency and automation.Next 60 DaysLaunch pilot projects for alerting, CI/CD automation, and infrastructure tuning.Propose operating models for on-call, incident response, and proactive remediation.Define initial KPIs and success measures.By Day 90Finalize team structure and reliability operating model.Expand automation and observability coverage across services.Present roadmap, accomplishments, and scale strategy to leadership.Position RequirementsTen or more years of engineering experience, including five or more years in platform or infrastructure leadership roles.Demonstrated ability to lead transformation initiatives while mentoring and scaling high-performing technical teams.Passion for growing people and inspiring a culture of excellence.Ability to lead cross-functional collaboration and uphold company culture.Strong knowledge of SRE principles, observability frameworks, and automation tooling.Strategic mindset with a passion for resilient, reliable systems.Strong knowledge of modern systems architectures and design principles.PreferredHands-on familiarity with Terraform, Ansible, CI/CD pipelines, and observability tools.Experience with Microsoft Azure or Hybrid Cloud Infrastructures.BenefitsCompetitive compensation range: $110,000 - $160,000 / year (California).Comprehensive benefits package.Training and mentorship opportunities.On-campus wellness activities.Education reimbursement program.401(k) program with discretionary employer match.Generous vacation accrual and paid holiday schedule.Please note that the compensation range may be adjusted in the future, and bonus and incentive compensation plans may apply.Our total reward package reflects our commitment to employee growth and well-being, as we invest in your development and offer a range of benefits designed to enhance your career and life.All compensation and benefits are subject to plan documents and written agreements.Equal Opportunity EmployerApplied Medical is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, disability (mental and physical), exercising the right to family care and medical leave, gender, gender expression, gender identity, genetic information, marital status, medical condition, military or veteran status, national origin, political affiliation, race, religious creed, sex (including pregnancy, childbirth, breastfeeding and related medical conditions), or sexual orientation, or any other status protected by federal, state or local laws in the locations where Applied Medical operates.Applied Medical is seeking a Senior Manager, Platform Reliability & Automation to lead a team that will be responsible for infrastructure reliability, automation, and observability. This role is essential to ensuring our platforms are stable, scalable, and well-governed.The ideal candidate brings a clear, practical leadership style, fosters cross-functional alignment, and builds high-performing teams grounded in accountability, transparency, and continuous improvement. The ability to lead with clarity and purpose is critical to success in this role. #J-18808-Ljbffr
Created: 2025-09-17