
Senior Principal Engineer - AI Networking
Job Description
Oracle is seeking a highly experienced Lead Principal Software Engineer (IC5) to help define and build the next generation of AI networking infrastructure powering large-scale GPU clusters and distributed AI training platforms. This role focuses on high-performance networking, RDMA-based services, collective communication libraries, congestion management, resiliency, and software platforms that enable efficient operation of large-scale AI workloads.
In this role, you will help build the networking and communication foundation for Oracle's AI infrastructure, enabling efficient scaling of cutting-edge AI workloads across large GPU clusters. Your work will directly influence the performance, reliability, and scalability of systems powering some of the industry's largest AI deployments.