StaffAttract
  • Login
  • Create Account
  • Products
    • Private Ad Placement
    • Reports Management
    • Publisher Monetization
    • Search Jobs
  • About Us
  • Contact Us
  • Unsubscribe

Login

Forgot Password?

Create Account

Job title, industry, keywords, etc.
City, State or Postcode

InformationTechnology - Senior Site Reliability ...

Pacer Staffing - St Louis, MO

Apply Now

Job Description

Senior Site Reliability Engineer Remote Position Purpose: Helps lead projects that are focused on managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools, manual and automated procedures, documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs. Develops complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability issues and incidents. Understands and advocates for standardized and scalable software tools to ensure that systems operate without interruption at optimum performance and leads project teams through out the deployment process. Troubleshoots and analyzes service disruptions to determine the root cause of issues and develop solutions for improved reliability. Education/Experience: A Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science) and Requires 4 - 6 years of related experience. Or equivalent experience acquired through accomplishments of applicable knowledge, duties, scope and skill reflective of the level of this position. Technical Skills: One or more of the following skills are desired. Experience with Linux Operating System; Operating Systems; Unix Operating System; Windows Operating System Experience with Other: Experience with observability/monitoring tools such as Splunk, Dynatrace, Elastic, New Relic, Prometheus, Grafana Experience with Other: enterprise level CICD Tools such as Ansible, Jenkins, Cloudbees, OpenShift Experience with Other: working in public cloud platforms like AWS and Azure Experience with Programming Tools Experience with Other: building and operating highly scaled applications Experience with MongoDB; MySQL; Oracle Database Management System (DBMS); PL SQL; SQL (Programming Language) Experience with Other: varying code repositories, auto deployments, branching with tools such as Gitlab, Bitbucket, Subversion Experience with Other: IT service management tools such as Service Now, Atlassian, BMC Soft Skills: Intermediate - Seeks to acquire knowledge in area of specialty Intermediate - Ability to identify basic problems and procedural irregularities, collect data, establish facts, and draw valid conclusions Intermediate - Ability to work independently Intermediate - Demonstrated analytical skills Intermediate - Demonstrated project management skills Intermediate - Demonstrates a high level of accuracy, even under pressure Intermediate - Demonstrates excellent judgment and decision making skills Responsibilities: Troubleshoots and resolves more complex problems with systems and services and initiates regular deployment of new versions of the systems and their subcomponents Leads more complex projects focused on building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility. Helps make decisions around periodic system validation and testing, service monitoring, and standing up new services/tools Uses knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization Identifies and implements necessary manual and automated procedures for improved collaborative response in real-time Leads lower level Engineers in stress, security, and performance testing Resolves issues that come up through support escalation Keeps documentation and runbooks up to date to effectively deal with new incidents that might arise Leads post incident reviews and documents findings for future informed decision making Reviews proposals to optimize Software Development Life Cycle (SDLC) to boost service reliability and makes decisions around which proposals should move forward. Communicates complex topics with development teams to investigate and document issues and leads internal team to develop solutions to mitigate them Performs other duties as assigned Complies with all policies and standards

Created: 2026-03-04

➤
Footer Logo
Privacy Policy | Terms & Conditions | Contact Us | About Us
Designed, Developed and Maintained by: NextGen TechEdge Solutions Pvt. Ltd.