• AI / HPC Network

    Meta (Menlo Park, CA)
    …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...responsible for design, model, develop, test, deploy and operate AI / HPC Networks at scale 2. Provide continual… more
    Meta (10/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Architect, AI

    NVIDIA (Santa Clara, CA)
    …and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new ... runtime designs, and new network hardware features. + Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX,… more
    NVIDIA (10/30/25)
    - Save Job - Related Jobs - Block Source
  • AI / HPC System Performance Engineer

    Meta (Menlo Park, CA)
    …5. Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques **Minimum ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead… more
    Meta (11/06/25)
    - Save Job - Related Jobs - Block Source
  • AI / HPC Systems Performance…

    Meta (Menlo Park, CA)
    …and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active member of ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like… more
    Meta (11/18/25)
    - Save Job - Related Jobs - Block Source
  • HPC Technical Account Manager

    NVIDIA (Santa Clara, CA)
    …to stand out from the crowd: + Experience in solving problems in large-scale HPC network environments with overlay technologies (BGP, OSPF, VXLAN, EVPN), RoCE ... part of the role is also to interact with Engineering , Marketing, and Support teams regularly. What you will...installing our products with a focus on Infiniband, next-generation AI , and HPC server technologies. + Own… more
    NVIDIA (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer - Genomics…

    NVIDIA (Santa Clara, CA)
    …challenges and provide outstanding HPC solutions. + Collaborate closely with hardware engineering , CUDA engineering , and AI research groups to apply the ... healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as...integrating genomic solutions into mainstream healthcare. As a healthcare HPC engineer, you will join a dynamic development team… more
    NVIDIA (11/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior GPU and HPC Infrastructure Engineer…

    NVIDIA (Santa Clara, CA)
    NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, knowledge of datacenter hardware, operations, ... and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly… more
    NVIDIA (10/09/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer - High-Performance…

    NVIDIA (Santa Clara, CA)
    …Familiarity with datacenter automation, advanced network protocols, and supporting large HPC or AI clusters in production environments. + Understanding of ... , or related field, or equivalent experience. + 8+ years of proven experience in AI / HPC Infrastructure. + Familiarity with AI / HPC job schedulers and… more
    NVIDIA (11/11/25)
    - Save Job - Related Jobs - Block Source
  • Technical Program Manager, AI

    Meta (Menlo Park, CA)
    …operate in a multi-organization landscape. **Required Skills:** Technical Program Manager, AI Network Infra Responsibilities: 1. Lead technical program ... and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps...matrix organization covering a range of areas (Data Center, Network , Hardware Systems, Infrastructure Engineering , Software … more
    Meta (11/19/25)
    - Save Job - Related Jobs - Block Source
  • Principal Network Engineer - DC…

    NVIDIA (Santa Clara, CA)
    …networking problems for scalable AI clusters. This is a hands-on network engineering position focused on the architecture, design, development and deployment ... We are seeking a highly skilled Principal Network Engineer to join our dynamic team to...and deployment of global-scale DCs inter-connects and fabric for HPC , AI , and GPU computing clusters. +… more
    NVIDIA (10/02/25)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer, Networking…

    Oracle (Sacramento, CA)
    AI Infrastructure Innovation team is pioneering the creation of next-generation AI / HPC networking for GPU superclusters at massive scale. Our mission is ... system design, and implementation for high-performance RDMA solutions across OCI's AI / HPC platforms, including frontend and backend fabrics. + Innovate… more
    Oracle (12/10/25)
    - Save Job - Related Jobs - Block Source
  • AI Factory Digital Twin Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA's AI Factories are built to accelerate AI and HPC workloads. At their core the Digital Twin (physics-based model used to design, validate, and operate ... be shaping the digital and physical foundation of NVIDIA's AI Factories, engineering virtual replicas that not...to stand out from the crowd + Background in AI / HPC data center cooling, including immersion and… more
    NVIDIA (10/22/25)
    - Save Job - Related Jobs - Block Source
  • Architect, AI Compute, OCI, NA

    Oracle (Sacramento, CA)
    …at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... automation, and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging technologies like… more
    Oracle (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Research Scientist, AI & Systems Co-design…

    Meta (Menlo Park, CA)
    …many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network and storage. The team invests significantly ... develop and help productionize high performance software & hardware technologies for AI at datacenter scale. We achieve this via concurrent design and optimization… more
    Meta (11/07/25)
    - Save Job - Related Jobs - Block Source
  • Deloitte - Infrastructure Engineering

    Deloitte (Los Angeles, CA)
    …Maintain up-to-date knowledge of advancements in AI , cloud computing, network technologies, infrastructure automation, and trends in HPC infrastructure, ... modern data centers , to enabling the adoption of AI or high-performance computing ( HPC ), you'll gain...path that can evolve toward ML engineer ing , AI architect ure , cloud adoption, network more
    Deloitte (12/05/25)
    - Save Job - Related Jobs - Block Source
  • AI Applications Engineer

    quadric.io, Inc (Burlingame, CA)
    …bridge between development engineering and hands-on users in the field. The AI Application Engineer will [1] integrate Quadric product and software stack into ... co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of...+ Bachelor's or Master's in Computer Science and/or Electronics Engineering field. + 5+ years experience with AI more
    quadric.io, Inc (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Principal Software Developer - AI /ML

    Oracle (Sacramento, CA)
    …We are seeking a Principal Software Developer (IC4) with deep expertise in AI /ML system design, large-scale data engineering , and applied intelligence to help ... drive reliability, performance, and automation across Oracle's global cloud network and compute platforms. In this role, you will design and deliver AI -powered… more
    Oracle (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Sr. System Development Engineer, High-Performance…

    Amazon (Cupertino, CA)
    …for the entire AI industry. You'll join a diverse AWS Hardware Engineering team of software, hardware, and network engineers, supply chain specialists, ... design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing… more
    Amazon (10/25/25)
    - Save Job - Related Jobs - Block Source
  • Sr Hardware Development Engineer, High Performance…

    Amazon (Cupertino, CA)
    …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
    Amazon (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Cloud Hardware Dev Engineer (AWS Generative…

    Amazon (Cupertino, CA)
    …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. Utility Computing (UC) AWS Utility Computing (UC) ... Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
    Amazon (12/10/25)
    - Save Job - Related Jobs - Block Source