Sr Engineer, Site Reliability - Network Supply Chain
T-Mobile USA, Inc - Bellevue, WA
Apply NowJob Description
At T-Mobile, we invest in YOU Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That's how we're UNSTOPPABLE for our employeesJob OverviewThe Senior Site Reliability Engineer at T-Mobile plays a crucial role in enhancing system reliability and resilience, facilitating faster and more efficient software development and deployment. They utilize their strong problem-solving and analytical skills to automate processes, reducing manual effort and preventing operational incidents. Their expertise in programming and scripting languages, incident response management, and various tech tools contributes to the robustness and efficiency of our systems. By continuously learning new skills and technologies, they adapt to changing circumstances and drive innovation. Their work and expertise contribute significantly to the stability and performance of T-Mobile's digital infrastructure.The Sr Site Reliability Engineer is responsible for ensuring stability, security, and performance of Network Supply Chain ecosystem including SC Digital, o9 Order Management & Planning Systems, Network Asset Lifecycle Management system, SAP ERP system, and 3PL Systems. While this position includes hands-on pro-active maintenance, it also focuses majority of the time on technical leadership, which includes driving the vision to ensure stability and security, proposing the optimal solution for bringing efficiencies, coaching engineers and junior team members to follow best practices and, participating in advanced troubleshooting of production and pre-production systems. They own production as well as non-production environments and actively collaborate on architectural, technological, and infrastructural discussions for both current ecosystem and future strategy.Job Responsibilities:Utilizes fluent knowledge and skill in emerging DevOps-centric automation tools and technologies for CI/CD, configuration management, etc. for non-prod environments.Manages Network Supply Chain production and non-production environments for SC Digital Layer, o9 Planning and Order Management systems, Network Asset Lifecycle Management system (CATS and SiteHound), SAP ERP systems, and 3PL Systems.Performs environment management, automated server provisioning, pipeline configuration (VMs).Delivers software to improve the availability, scalability, latency, and efficiency of T-Mobile's services.Creates, manages, and uses dashboard for continuous monitoring and health check of applications, and the underlying infrastructure, improves the quality of services using the monitoring feedback for non-production environment.Contributes to future improvements of software delivery processes and operations, e.g., cloud enablement, and use of microservices with containerization.Relationship and People Management: Mentors/guides other Systems Reliability Engineers and vendor resources as needed.Also responsible for other Duties/Projects as assigned by business management as needed.Education and Work Experience:Bachelor's Degree Computer Science, Engineering or related field (Preferred)Master's/Advanced Degree Computer Science, Engineering or related field (Preferred)4-7 years - Working in operations or develops environments4-7 years - Troubleshooting customer related issues and managing customer relationships4-7 years - Developing software solutions using Python or similar programming languagesKnowledge, Skills and Abilities:4-7 of progressive experience in software engineering/maintenance across multiple products, systems and/or platforms coupled with strong business acumen.
Created: 2025-12-05