GPU / CUDA Engineers (Multiple Openings)
Greylock Partners - San Francisco, CA
Apply NowJob Description
Overview Several growth-stage investments in San Francisco, CA are looking for experts in GPU Optimization / Inference Acceleration. Responsibilities Primarily focused on GPGPU programming to increase the performance of the product -- writing, debugging, and optimizing CUDA code from GPU kernel-level on upward to improve the holistic performance of new AI models Play a key role creating all of the tooling and associated infrastructure to increase the performance of the company -- from fairly straight-forward projects (profilers) to incredibly complex (new inference engines) Qualifications Proven background in CPU acceleration and/or GPU optimization (latter preferred) with a strong preference toward candidates who have expertise in CUDA Kernel hacking Experience working in deep learning environments and/or on products targeting high-performance ML systems Strong coding skills in high-performance environments (C/C++) About Us About Us: Greylock is an early-stage investor in hundreds of remarkable companies including Airbnb, LinkedIn, Dropbox, Workday, Cloudera, Facebook, Instagram, Roblox, Coinbase, Palo Alto Networks, among others. More can be found about us here: How We Work How We Work: We are full-time, salaried employees of Greylock and provide free candidate referrals/introductions to our active investments. We will contact anyone who looks like a potential match--requesting to schedule a call with you immediately. Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries Technology, Information and Internet and Software Development #J-18808-Ljbffr
Created: 2025-09-17