Infrastructure Capacity Engineer Palo Alto; San ...
Perplexity AI Inc. - Palo Alto, CA
Apply NowJob Description
Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world’s leading AI platforms. Perplexity has raised over $1B in venture investment from some of the world’s most visionary and successful leaders, including Elad Gil, Daniel Gross, Jeff Bezos, Accel, IVP, NEA, NVIDIA, Samsung, and many more. Our objective is to build accurate, trustworthy AI that powers decision-making for people and assistive AI wherever decisions are being made. Throughout human history, change and innovation have always been driven by curious people. Today, curious people use Perplexity to answer more than 780 million queries every month–a number that’s growing rapidly for one simple reason: everyone can be curious.Perplexity is seeking an experienced Infrastructure Capacity Engineer to own our infrastructure scaling, capacity planning, and resource optimization across our AI/ML infrastructure. The ideal candidate will have deep experience in large-scale distributed systems, capacity modeling, and infrastructure efficiency optimization to support our rapidly growing AI products and user base.ResponsibilitiesDesign and implement comprehensive capacity planning models and forecasting systems that predict infrastructure needs across compute, storage, and network resources for our AI/ML workloadsBuild and maintain automated capacity management systems that dynamically scale our infrastructure based on real-time demand patterns and usage forecastsLead cross-functional capacity planning initiatives including hardware procurement, data center expansion, and cloud resource optimizationDevelop sophisticated monitoring and alerting systems that provide early warning indicators for capacity constraints and performance degradationCreate and maintain detailed infrastructure capacity models that account for seasonal patterns, product launches, and scaling efficiency across different workload typesOptimize resource utilization and cost efficiency through advanced placement algorithms, load balancing strategies, and infrastructure rightsizingDesign and implement disaster recovery and business continuity plans that ensure service availability during infrastructure failures or capacity emergenciesCollaborate with Site Reliability Engineering and Platform teams to establish capacity-aware deployment strategies and infrastructure automationPlay a leading role in defining the capacity engineering discipline within Perplexity’s engineering organizationQualificationsMinimum of 4+ years of experience in infrastructure capacity planning, systems engineering, or related technical roles at scaleProven experience managing infrastructure capacity for high-growth technology companies, preferably with AI/ML workloads or real-time systemsStrong background in distributed systems architecture, cloud infrastructure (AWS/GCP/Azure), and container orchestration (Kubernetes)Experience with capacity modeling tools, forecasting methodologies, and statistical analysis for infrastructure planningProficiency in programming languages such as Python, Go, or similar for automation and tooling developmentDeep understanding of infrastructure monitoring, observability, and performance optimization techniquesExperience with infrastructure-as-code tools (Terraform, Ansible) and CI/CD pipelines for infrastructure managementStrong analytical and problem-solving skills with the ability to make data-driven decisions under uncertaintyExcellent cross-functional collaboration skills and experience working with engineering, product, and business stakeholdersExperience with large-scale database systems, caching layers, and content delivery networks preferredBackground in AI/ML infrastructure, LLM inference, GPU cluster management, or high-performance computing is a plusOur cash compensation range for this role is $225,000 - $300,000.Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.Equity: In addition to the base salary, equitymay be partof the total compensation package.Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.Create a Job AlertInterested in building your career at Perplexity AI? Get future opportunities sent straight to your email.Apply for this job*indicates a required fieldFirst Name *Last Name *Email *Phone *Resume/CV *Enter manuallyAccepted file types: pdf, doc, docx, txt, rtfEnter manuallyAccepted file types: pdf, doc, docx, txt, rtfWebsiteLinkedIn ProfileWill you now or in the future require visa sponsorship for employment? * Select...Perplexity has an office-centric work model with 4 days per week in the office from the following locations: San Francisco, Palo Alto, or New York. Are you willing to come in 4 days per week? * Select...If you are not based in any of these locations, are you open to relocation to either San Francisco, Palo Alto, or New York? * Select... #J-18808-Ljbffr
Created: 2025-09-17