Director of Cloud & AI Infrastructure
Rogue Fitness - Columbus, OH
Apply NowJob Description
Director of Cloud & AI Infrastructure page is loadedDirector of Cloud & AI InfrastructureApply remote type On-Site locations Columbus, Ohio time type Full time posted on Posted Yesterday job requisition id R-102114Job Description:OverviewRogue Fitness is accelerating AI across manufacturing, warehousing, and e‑commerce and needs a hands-on leader to build the MLOps platform, enhance SRE, and keep our hybrid infrastructure state‑of‑the‑art.You’ll own everything from GPU clusters and data pipelines to uptime, security, and factory networks, partnering with Technology and Solutions Directors to deploy AI at speed.Your mandate: deliver measurable gains in throughput, cost savings, and customer experience to keep Rogue ahead of the pack.ResponsibilitiesAI Infrastructure:Build and lead the cross‑functional team that designs, scales, and operates Rogue’s production MLOps platform—covering data pipelines, model versioning, automated deployments, and real‑time monitoring across on‑prem and cloud GPU clustersOwn reliability, performance, and cost management for all AI compute and storage—capacity planning, incident response, and continuous optimization to meet SLA/SLO targetsSite Reliability:Direct the SRE organization that safeguards and all internal apps—defining SLIs/SLOs, automating CI/CD pipelines, and ensuring release velocity without sacrificing stabilityDrive proactive reliability engineering: establish unified observability, conduct capacity and chaos testing, and lead rapid incident response to keep MTTR low and uptime above targetsOwn continuous improvement of performance, scalability, and cost efficiency—partnering with product and infrastructure teams to embed reliability best practices from design through deploymentTraditional Infrastructure:Oversee end‑to‑end operations of on‑premises and cloud infrastructure—Windows/Linux servers, storage, backups, DR, networks, and collaboration platforms—managed through infrastructure‑as‑code and real‑time dashboardsLead lifecycle planning and execution for upgrades, migrations, and capacity expansions, enforcing disciplined change control, budget stewardship, and clear communication to stakeholdersEstablish and monitor service performance and security standards (availability, latency, compliance) while mentoring engineering staff and aligning roadmaps with business objectivesRequired QualificationsMaster’s degree in Computer Science, Electrical Engineering, or a related technical field10+ yrs experience in hybrid infrastructure with 3+ yrs as a manager4+ years running production MLOps pipelines4+ years leading SRE/DevOps practices: CI/CD, metrics, rapid rollbacksHands-on with Azure/GCP, Windows/AD, Google Workspace, virtualization, Terraform/Bicep, modern observabilityNetworking & security team lead: VLANs, firewalls, Zero‑Trust, incident responsePreferred QualificationsManaged ML Ops and SRE teams for highly dynamic companiesExperience in online retail, manufacturing, and/or warehousing companiesOT experience: Modbus, segmented VLANs, factory networksTuned edge‑security services like Cloudflare, Google Cloud Armor, or AWS WAFBy applying to Rogue, regardless of the platform you choose to use, you are agreeing to Rogue's preferred methods of communication (i.e. text message). Submitting an application, through whatever online forum is ultimately used, constitutes a knowing and voluntary agreement to send and receive text messages during the recruitment process.Similar Jobs (1)Director of Engineeringremote type On-Site locations Columbus, Ohio time type Full time posted on Posted 30+ Days AgoRogue Fitness is the leading manufacturer of strength and conditioning equipment, including barbells, power racks, sleds, and accessories. Founded in a garage in 2006, the company has grown to over 1400 team members globally. Rogue is the official equipment supplier of the CrossFit Games, USA Weightlifting, the Arnold Strongman Classic, and the World’s Strongest Man competition. The company remains dedicated to serving the needs of serious athletes at every level, from the garage to the arena. #J-18808-Ljbffr
Created: 2025-09-28