Site Reliability Production Engineer (AI/ML Ops)
Kforce - Smithfield, RI
Apply NowJob Description
Kforce has a client that is seeking a Site Reliability Production Engineer (AI/ML Ops) in Smithfield, RI.nnKey Tasks:n Analyze system and application metrics to improve performance, reliability, and fault detectionn Partner closely with engineering teams to design, build, deploy, and support resilient servicesn Contribute to system design reviews, platform management, and capacity planningn Build sustainable automation to reduce manual effort and operational overheadn Develop and refine SLI/SLO/SLA frameworks to balance speed, reliability, and customer experiencen Improve observability across environments using modern tools and practicesn Identify, prototype, and implement automation using scripting, infrastructure tooling, and AI/LLM-based solutionsn Diagnose and tackle complex issues across distributed systems and end-user computing environmentsn Evaluate new technologies, patterns, and tools to drive continuous improvementn Create and deliver high-quality technical content, remote actions, and workflows to enable self-service and operational efficiency
Created: 2026-03-24