Compute Infrastructure Deployment Lead
OpenAI - San Francisco, CA
Apply NowJob Description
About the TeamThe Industrial Compute team builds and operates the infrastructure behind OpenAI’s research and products. We design for scale, performance, and adaptability—bridging physical and logical layers so frontier workloads run efficiently across a global recent months, this team enabled large scale compute systems, built foundational network infrastructure, partnered with engineering to unlock major compute expansions, and worked with compute, inference, and storage systems engineering to materially lower serving cost while improving performance.Our mandate spans power, compute, network, manufacturing and assembly, operations, scheduling, orchestration, and the broader ecosystem needed to enable OpenAI’s next generation of systems.About the RoleYou have deep technical experience (infrastructure / systems engineering, TPM, or product) and move comfortably between system detail and program execution. Your charter will be to deliver step-function improvements in cost, capability, capacity, reliability, and time-to-ready across OpenAI’s infrastructure and compute platform.You’ll turn ambitious goals into tightly scoped plans, run through blockers and drive projects to production—owning all aspects of the stack from strategy, to technical problem definition, to vendor engagement, through to a clean handoff to execution teams. You’ll collaborate with partners across the org—engineering, capacity planning, research, infra, product, finance, and business development—to produce the technical, operational, and commercial outcomes needed to make these bets real.Near-term focus may span things such as driving object-storage direction and roll-out; building out our backbone and shipping several interconnect PoPs to run at multi-Tbps scale across the world; and collaborating with our partners to deliver usable FLOPs as fast as this role, you will:Ship cross-stack, highly technical infrastructure programs end-to-end: frame the problem, define requirements, run fast validations, and deliver to production with clear success metrics.Own outcomes end-to-end, and move fast to validate approaches while balancing total cost, performance, and operability.Maintain precise technical intuition while shipping pragmatic solutions to complex, ambiguous infrastructure problems.Manage external vendors and partners and make sound calls on cost, performance, and operability to deliver usable FLOPs faster.Operate both in the weeds at a system level to drive decisions: own the plan of record, force clear trade-offs, and move decisively to unblock execution.You may be a good fit if you:Have led ambiguous, cross-functional infrastructure work to tangible outcomes without waiting for perfect information.Default to action, can get things done, and enjoy switching between strategy and hands-on work to fully own projects end-end.Sweat the details and are outcomes-driven, owning technical nuances while also zooming out for the full picture.Have a humble attitude and a desire to pick up whatever knowledge you're missing to successfully deliver infrastructure systems.Operate with high horsepower, have strong problem-solving skills, are adept at frequent context switching, effectively manage working on multiple projects at once with ownership, and ruthlessly prioritize.
Created: 2025-11-07