Principal Engineer - AI GPU Innovation
Oracle - St Paul, MN
Apply NowJob Description
Job Description Join Oracle Cloud Infrastructure's (OCI) architecture development engineering team as a Principal Engineer focused on GPU platform software and system development. We are leading the charge in AI innovation, paving the way for the next generation of AI accelerators and advanced hardware solutions. As a Senior Principal Software Engineer, you will engage in evaluating, prototyping, and optimizing cutting-edge AI hardware and AI accelerators, including custom-designed AI chips and systems, to bolster next-gen Cloud AI Infrastructure platforms. Your contributions will shape platform definition and oversee platform development, including design reviews, system integration, and performance testing. You will work closely with third-party GPU IC suppliers, internal hardware teams, and software developers to drive Oracle’s AI Cloud solutions forward. Your efforts will be pivotal in developing Oracle's evolving Cloud AI infrastructure. You will explore the latest AI hardware architectures, benchmark their performance, and collaborate with software engineers for tight integration with AI workloads. Your role will significantly influence the future of AI hardware for machine learning and deep learning applications. Responsibilities Analyze system architecture and evaluate proposed implementation pathways. Collaborate with hardware design and development teams on architecture and troubleshooting of AI hardware platforms. Engage with Oracle's broader engineering and operations groups, as well as external partners. Conduct thorough benchmarking and performance analysis of AI accelerators from emerging hardware vendors. Assess new AI accelerators against industry-standard hardware for training and inference workloads. Create tools and processes for evaluating hardware performance in real-world AI applications. Contribute to designing and optimizing performance algorithms for running AI models on hardware. Basic Qualifications Bachelor's or Master's degree in Computer Science or a related technical field with coding experience. 10+ years of software development experience. Ability to write proficient code in Java, GoLang, C#, or similar object-oriented languages. Strong understanding of AI and GPU platform architecture. Experience with high-scale distributed services infrastructure. Practical experience with GPU supplier test code and open-source AI tools. Experience with modern server platform architecture, including x86 and ARM platforms. Proven ability to debug and root-cause complex hardware and software issues. Strong problem-solving abilities, excellent communication skills, and a profound sense of ownership. Preferred Qualifications Technical lead experience on large-scale cloud services. Hands-on experience with services on public cloud platforms. Familiarity with AI accelerator chips. Experience with AI benchmarks and performance evaluation tools. Understanding of AI model optimization techniques for hardware acceleration. Skills in running firmware and system diagnostics using relevant tools and scripting for test customization. Disclaimer: Certain US customer or client-facing roles may be subject to immunization and occupational health requirements. Compensation: The hiring range in the US is $96,800 to $223,400 per year, with potential eligibility for bonuses and equity. Benefits: Oracle offers comprehensive benefits, including medical, dental, and vision insurance, short and long-term disability, life insurance, flexible spending accounts, a 401(k) plan with company match, paid time off, and more. Join Oracle to harness the power of innovation and contribute to a collaborative workforce that values diversity and encourages community involvement.
Created: 2026-03-04