Technical Program Manager ( {{city}})
BrickRed Systems - Mountain View, CA
Apply NowJob Description
We are seeking a Senior Technical Program Manager (TPM) with a background in distributed AI, resource management, forecasting, capacity and strategic planning to join our team. This role involves supporting platform operations, handling customer escalations, and monitoring cluster health. Additionally, you'll ensure optimal compute resource allocation aligned with product, sales, and research priorities and drive decisions with data, analysis, and reporting in GenA I. This job is fast-paced, cross-functional, and requires strong communication, prioritization, and organization skills. It involves a deep understanding of stakeholder management, and the ability to navigate an environment with passionate people intent on delivering valuable products. Key Responsibilities: Act as a single point of contact for escalations from sales and global support teams and help with various billing and support issues Drive innovation and deliver high-quality products by ensuring that AI teams have the necessary GPU and resources. Improve product margins by leading strategic initiatives to optimize GPU utilization and procurement. Establish and maintain effective communication with technical and non-technical stakeholders and customers, including regular project updates, status reports, and presentations. Deliver step-level improvements with compute management, efficiency and scalability by identifying and implementing process improvements. Ensure strategic alignment across Sales, Global Support, and Engineering Qualifications & Experience: 6+ years of professional experience with a Bachelors degree and related experience in technical program management, distributed platforms, resource management, execution and strategic planning. Proven track record of driving cross-functional teams to deliver complex technical projects on time and with high quality. Excellent communication, negotiation and analytical skills, with the ability to document standard operating procedures and processes Advanced working SQL Knowledge, Ability to build and maintain analytics to track, forecast, and visualize consumption through ad-hoc SQL, reports, and dashboards Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement. Self-motivated and able to work independently, as well as in a team environment. Preferred good working knowledge of GPU technology and its applications in generative AI and machine learning. Familiarity with big data technologies such as Apache Spark, Delta Lake, and MLflow is a plus. Experience with compute capacity management, as well as financial analysis or sales/deal desk quoting, is a plus. Location is Mountain View (preferred) first choice, San Francisco & Seattle second choice About BrickRed Systems: BrickRed Systems is a global leader in next-generation technology, consulting, and business process service companies. We enable clients to navigate their digital transformation. BrickRed Systems delivers a range of consulting services to our clients across multiple industries around the world. Our practices employ highly skilled and experienced individuals with a client-centric passion for innovation and delivery excellence. With ISO 27001 and ISO 9001 certification and over a decade of experience in managing the systems and workings of global enterprises, we harness the power of cognitive computing hyper-automation, robotics, cloud, analytics, and emerging technologies to help our clients adapt to the digital world and make them successful. Our always-on learning agenda drives their continuous improvement through building and transferring digital skills, expertise, and ideas from our innovation ecosystem.
Created: 2025-09-11