Senior Technical Consultant - AI Networking and RDMA
Oracle - Olympia, WA
Apply NowJob Description
Job Overview We are seeking a dynamic and experienced Senior Technical Consultant to spearhead the design and implementation of cutting-edge networking solutions for large-scale AI systems. In this hands-on role, you will be a key player in architecting and developing production-quality features across SmartNICs, programmable switches, and end-host networking stacks, emphasizing real-world deployment, performance, and robustness. Key Responsibilities Data Plane & Offload Development: Innovate and implement multi-path-aware routing algorithms for SmartNICs and switches. Create high-performance data plane logic utilizing low-level systems programming and hardware SDKs for routing, load-balancing, and real-time telemetry offloads. Enhance packet processing pipelines to ensure deterministic low-latency and high-throughput performance. Congestion Control & Transport Algorithms: Develop and implement sophisticated congestion control mechanisms using comprehensive network telemetry (e.g., queue depth, loss, latency, link health). Design transport-level solutions that adaptively select paths and adjust rates to optimize large-scale AI fabrics. ML Collective & RDMA Systems: Create collective communication transports tailored for ML workloads, ensuring high performance amid congestion and system failures. Scale and implement RDMA across multipath environments while ensuring correctness and respecting ordering guarantees. Extend RDMA-based architectures to integrate with storage and data services for improved efficiency and reduced latency. Validation & Performance Engineering: Establish frameworks, simulators, and proof-of-concept models to validate functionalities, scalability, and resilience. Analyze and enhance performance across hardware (NIC/switch), kernel/firmware, and user space. Troubleshoot and resolve complex issues in distributed and low-level systems swiftly. Qualifications Over 10 years of experience in systems, networking, or kernel-level engineering, demonstrating deep technical expertise. Proven knowledge in RDMA, congestion control, transport protocols, and high-performance packet processing. Strong skills in executing complex, hardware-near algorithms with a focus on precision and performance. Expertise in debugging cross-layer, distributed, and low-level challenges. Preferred Qualifications Experience with large-scale AI training/inference infrastructure. Familiarity with programmable networking environments and hardware SDKs. History of delivering complex, production-quality networking solutions at scale. Additional Information Certain customer-facing roles may require compliance with applicable requirements, such as immunization and occupational health mandates. The salary range for this role is from $96,800 to $251,600 per annum, with potential eligibility for bonuses, equity, and deferred compensation. Oracle provides a comprehensive benefits package, including medical, dental, and vision insurance, life insurance, retirement plans, paid time off, and more. Oracle is committed to fostering a diverse workforce and inclusive environment. We encourage all qualified candidates to apply, regardless of race, color, religion, sex, national origin, sexual orientation, gender identity, disability, or veteran status.
Created: 2026-03-07