Site Reliability Engineer
Tata Consultancy Services - Phoenix, AZ
Apply NowJob Description
Site Reliability Engineering (SRE) Must Have Technical/Functional Skills • Core Java, Splunk, Kibana, Grafana • Databases: Postgres, MongoDB • Experience in Production support engineering or SRE roles, preferably within the banking industry. • Skilled in L1/L2 support, debugging, performance monitoring, and working in Agile/Scrum environments. Hands-on with ServiceNow, Spring Boot, REST APIs, and CI/CD pipelines. • Strong knowledge of cloud services. Roles & Responsibilities • Excellent problem-solving skills and the ability to work under pressure in a fast-paced environment. • Monitor and maintain the health, availability, and performance of production systems and applications. • Troubleshoot and resolve production incidents, ensuring minimal downtime and service disruption. • Identifying Defects and working with Dev to get them fixed based on priority. • Taking care of implementation of RFCs. • Doing pre and post validation of servers during traffic diversion. • Collaborate with engineering teams to implement reliability best practices and improve system performance. • Develop and maintain monitoring alerts and dashboards to ensure visibility into system metrics. • Participate in on-call rotation and provide timely support for high-impact incidents. • Implement automation tools and processes to streamline operations and reduce manual workloads. • Document incidents and solutions for knowledge management and continuous improvement. Salary Range- $90,000-$100,000 a year
Created: 2026-03-04