Senior Manager, Reliability Engineering
Okta - Chicago, IL
Apply NowJob Description
Get to know OktaOkta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we're looking for lifelong learners and people who can make us better with their unique experiences.Join our team! We're building a world where Identity belongs to you.Senior Manager, Reliability EngineeringWe are looking for an experienced technical leader to join our Infrastructure and Operations team. The Reliability Engineering Organization (REO) is responsible for providing observability, cloud platform engineering, system engineering and release management capabilities for our corporate Okta ecosystem.Our ideal candidate is an experienced infrastructure practitioner who has worked in multiple technical realms. They are a strong leader, use Agile to guide their organization to success, and welcome the challenge of building in a dynamic and ever-changing environment. They enjoy seeing their strategic designs run at scale with automation, testing, and an excellent operational mindset. Most importantly, they are interested in bringing a culture of operational excellence and ownership to a highly experienced and successful group of engineers. If you love removing road blocks, fostering an environment of continuous growth, believe in the servant leader mentality and strive to learn as much from your employees as they learn from you, then we want to hear from you!What you'll be doingMentoring, managing, and leading multiple teams and managers across a variety of technical and operational spacesBe an advocate for SRE, observability and release management best practices, and be a leading voice in initiatives and projects to further advance BT's REO strategic goals and Okta's security maturityCreate and maintain Agile practices that provide transparency and clarity for engineers and key metrics for leadership stakeholdersHelp expand our BT SRE program and processes, recommending and implementing tooling and services, especially in the realms of observability, automation, and systems engineeringWork closely with global partners in BT, Security, GTM, and Marketing to gather requirements and build a strategic vision for delivering capabilities across OktaHelp align the organization on solutions and be a thought leader within the teams to ensure technical solutions are secure, scalable, and following industry best practicesAdvocate and present to senior leadership strategy and accomplishments using clear communication and metricsManage vendor relationships and budgets for various tools and services leveraged by Business Technology REOMaintaining Release Management processes for SOX and FedRAMP managed applications and understand release impact to regulated applications and environmentsWhat you'll bring to the role10+ years of experience in SRE, Observability, or DevOps Platform Engineering, with at least 7 years in a leadership or management capacity.Good communication skills, with the ability to communicate complex technical concepts and strategic roadmaps to different audiencesProficient with Agile methodology and understanding how Agile practices can inform project timelines and provide organizational transparencyProficient with reliability engineering concepts and security best practices on public cloud platforms, especially AWSDemonstrated ability to drive complex applications for cloud infrastructure at scale and deliver projects on schedule and within budgetFamiliarity with observability tools including Splunk, New Relic, Cloudwatch, Prometheus/GrafanaExperience with systems engineering practices, including fleet management, configuration management, OS patching and hardening, and performance monitoringExperience with infrastructure as code concepts and operational engineering practicesExperience with developing tooling and automation in Bash, Python, Go, etc.Proficient with building and maintaining engineering enablement functions, especially developer tools that leverage Git, Github, and CI/CD pipelinesFamiliar with Linux administration, containerization, Kubernetes (especially EKS), and networking conceptsFamiliarity with working in a FedRAMP High/IL4 or Moderate infrastructure environments, especially managing FIPS, STIG, and data boundary implementationsAdditional requirements:This position requires the ability to access federal environments and/or have access to protected federal data. As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws. #J-18808-Ljbffr
Created: 2025-10-07