StaffAttract
  • Login
  • Create Account
  • Products
    • Private Ad Placement
    • Reports Management
    • Publisher Monetization
    • Search Jobs
  • About Us
  • Contact Us
  • Unsubscribe

Login

Forgot Password?

Create Account

Job title, industry, keywords, etc.
City, State or Postcode

SRE Engineer

InterSources - Austin, TX

Apply Now

Job Description

Job Title: SRE Engineer: Location: Austin [Hybrid] Job Description: We are currently seeking a highly skilled SRE hands-on Lead Engineer with solid experience to help lead transformational initiatives within IT operations, encompassing development as well. As a crucial figure in this role, you will participate/help designing and implementing cutting-edge SRE solutions, driving the transformation of IT operations organizations to adopt an engineering-centric approach. Responsibilities: • Participate in design, architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance. • Primary skillset to be expertise in Observability as service, Telemetry data collection using Dynatrace APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk) and AIOPS Tools • Configure application performance monitoring (APM), infrastructure monitoring, synthetic monitoring, RUM, and log monitoring. • Integrate Dynatrace with CI/CD pipelines, alerting tools, ITSM systems, and incident automation frameworks. • Tune alert thresholds, baselines, and AI-driven anomaly detection to reduce noise and improve actionable insights. • Deeper understanding of Login authentication mechanisms using Ping, ForgeRock and SiteMinder technologies (session management and cookie management) • Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications. • Evangelize SRE evolution within IT operations and promoting a culture of engineering excellence and best practices. • Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation. • Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind. • Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identifying areas for optimization. • Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents. • Analyze resource utilization patterns and forecasting future capacity needs to ensure optimal performance and cost-efficiency. • Ensure that SRE practices align with security and compliance requirements and implementing measures to protect systems and data. • Operational excellence with focus on automation and developing tools to streamline operational tasks and increase efficiency. • Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice.

Created: 2026-03-04

➤
Footer Logo
Privacy Policy | Terms & Conditions | Contact Us | About Us
Designed, Developed and Maintained by: NextGen TechEdge Solutions Pvt. Ltd.