SRE Lead / Architect
Cigniti Technologies Inc - Chicago, IL
Apply NowJob Description
Over all IT experience of 18 + years• Relevant experience on Chaos engineering / Resilience / High availability testing of 4 years is must• Implement and lead execution of the chaos engineering Lifecycle - Chaos Test Planning, Chaos Test Designing, and Reporting• Ensure recovery and resilience testing is scheduled, staffed, executed, and documented, including remediation and closure of issues• Ability to analyse the architecture & recommend weak areas that are likely to failure / outages• Ability to work with Business & technology teams to identify and report on resilience / High availability requirements• Ability to work with enterprise architecture and development teams to architect applications for high availability and resiliency• Design, develop and execute automated / continuous Chaos Engineering experiments,• Ability to troubleshoot the failures in CI/CD pipeline• Automate Chaos experiments through chaos engineering tools (Gremlin / Chaos Native / Litmus etc) to run continuously • Hands on experience in Unix/Linux OS environments and operating system internals, file systems, disk/storage and networking protocols.• Strong knowledge on Public cloud platforms - AWS, GCP, Azure• Knowledge on Monitoring, Alerting, Logging• Knowledge on VPC''s, proxy''s, load balancers, availability zones• Ability in diagnosing and debugging complex distributed systems• Tools (any of these) - Gremlin, Chaos Native, Litmus• Strong leadership skills and ability to work in a cross-functional environment• Strong interpersonal, oral, and written communication skills• Strong analytical, organizational, and decision-making skills - provided by Dice
Created: 2025-11-15