• AI / HPC Network

    Meta (Menlo Park, CA)
    … fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. Design, develop, ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of GPUs together. In… more
    Meta (10/17/24)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect - HPC AI

    NVIDIA (CA)
    …can make a lasting impact on the world. NVIDIA Infrastructure Specialists team seeks an HPC / AI Infiniband Network Engineer to help customers realize ... to the world's largest and most sophisticated data centers and supercomputers. HPC Network Engineers deliver the technologies, solutions and services customers… more
    NVIDIA (10/10/24)
    - Save Job - Related Jobs - Block Source
  • AI / HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like… more
    Meta (08/01/24)
    - Save Job - Related Jobs - Block Source
  • HPC Systems Engineer III

    University Corporation for Atmospheric Research (WY)
    Job Description Summary: UCAR is excited to announce the job opening for a HPC Systems Engineer III role. This position is responsible for providing system ... stronger workforce at the intersection of High Performance Computing ( HPC ) and geoscience problems, CISL engages in education and...in your work. + Please explain how you see AI tools, like large language models, potentially enhancing your… more
    University Corporation for Atmospheric Research (10/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Systems Engineer

    General Dynamics Information Technology (Fairfax, VA)
    …Description:** At GDIT, people are our differentiator. Our work depends on a Senior HPC Systems Engineer joining our team to support the National Oceanic and ... Obtain:** None **Job Family:** Systems Engineering **Skills:** High-Performance Computing ( HPC ) Systems,Linux System Administration,Systems Management **Certifications:** None - N/A… more
    General Dynamics Information Technology (10/12/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    …have extensive experience in low-latency networking and collective operations, such as HPC network fabric or machine learning accelerator cluster systems. Also ... Description We are seeking an experienced software engineer with low-level latency networking or interconnect expertise to optimize customer experience by designing… more
    Amazon (10/06/24)
    - Save Job - Related Jobs - Block Source
  • HPC Systems Administrator

    The MITRE Corporation (Mclean, VA)
    …Technology Division provides multiple corporate-wide services including High Performance Computing ( HPC ), Enterprise PC and Mobile Solutions, Network Services, ... organizations. Job Description: We are seeking an experienced Linux HPC Systems engineer to join our team!...intelligence ( AI ), and advanced computing to deliver AI and HPC services to MITRE organization… more
    The MITRE Corporation (10/02/24)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect - InfiniBand and HPC

    NVIDIA (CA)
    NVIDIA is looking for a Senior HPC Engineer to join its Professional Services team. Academic, commercial and government groups around the world are using NVIDIA ... and to power data centers. Join the team building many of the largest and fastest AI / HPC systems in the world! NVIDIA is looking for someone with the ability to… more
    NVIDIA (08/20/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - AI

    Meta (Menlo Park, CA)
    …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Enabling reliable ... this role, you will be a member of the AI Networking Software team and part of the bigger...Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and… more
    Meta (10/18/24)
    - Save Job - Related Jobs - Block Source
  • Technical Lead/Manager - AI /ML…

    Cisco (San Jose, CA)
    …agile team engaged in the design, development and execution of tests to qualify network performance for AI .ML capability. In this role you'll have opportunity ... to build the next generation infrastructure to meet the needs of AI /ML workloads and continuously increasing internet users and application. We are uniquely… more
    Cisco (09/12/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Hardware Dev Engineer (AWS Generative…

    Amazon (Cupertino, CA)
    …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. AWS Infrastructure Services owns the design, planning, ... Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
    Amazon (10/12/24)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer , AI

    General Motors (Columbus, OH)
    …This role will involve working across various areas, from enhancing underlying HPC infrastructure to optimizing Kubernetes and Kubeflow setups, as well as refining ... teams to understand requirements and implement solutions. + Troubleshoot complex HPC infrastructure issues and implement effective resolutions with partner team. +… more
    General Motors (08/14/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer , Sustaining

    Meta (Menlo Park, CA)
    …hardware requirements and specifications (eg, configuring hardware components, GPU, memory, network for AI / HPC workloads) **Public Compensation:** ... **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP)...Responsibilities: 1. Develop robust, industry leading practices for supporting AI / HPC infrastructure at scale 2. Interface with… more
    Meta (10/18/24)
    - Save Job - Related Jobs - Block Source
  • Systems Development Eng (AWS Generative AI

    Amazon (Seattle, WA)
    …delivering and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release ... Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
    Amazon (10/18/24)
    - Save Job - Related Jobs - Block Source
  • Senior Cloud Services Software Engineer

    NVIDIA (Santa Clara, CA)
    …As a Senior engineer , you'll be instrumental in developing and optimizing AI infrastructure services to bring peak AI performance and high resiliency for ... engineers. What You Will Be Doing: As a software engineer specializing in backend development, you'll work in a...well as container technologies like Docker and Kubernetes, and HPC / AI platforms such as Slurm. Ways to… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Liquid Cooling Design Engineer

    Dell Technologies (Austin, TX)
    …and compliant mechanical design solutions, as well as cross-functional interfaces for AI , Cloud, High Performance Computing ( HPC ), Edge computing devices and ... **Liquid Cooling Design Engineer ** Mechanical Engineering leads and delivers the development...be instrumental in delivering advanced liquid cooling solutions for AI , HPC , and enterprise server markets. Your… more
    Dell Technologies (10/18/24)
    - Save Job - Related Jobs - Block Source
  • Technical Marketing Engineer

    Cisco (San Jose, CA)
    …and driven Technical Marketing Engineer to define, validate, and drive compute & AI performance on UCS. Who you are: You have over 4 years of expertise in ... and PowerTool, and have experience in benchmarking for compute, network , and AI /ML. You are flexible, enthusiastic,...range of products and solutions * Understand high-performance computing ( HPC ), GPU workloads, and other AI infrastructure… more
    Cisco (10/07/24)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Engineer , TPU Machine…

    Google (Sunnyvale, CA)
    AI accelerators, which are optimized for training and inference of large AI models. As a Software Engineer in the TPU Accelerator Software team, ... and hardware operations. + 5 years of experience building Network Architecture along with routing algorithms and topologies. +...+ 5 years of experience with High Performance Computing ( HPC ). + 3 years of experience working in a… more
    Google (10/11/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …Ways to stand out from the crowd: + Have built , deployed and operated AI platforms on HPC clusters. Have built, deployed and operated cloud native system ... We are seeking a Sr System Software Engineer to help us build out our scientific...computing cloud platform enables Physics based Numerical Simulation Solvers, AI based Training, Inference and Visualization workflow for physical… more
    NVIDIA (09/10/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Systems Engineer

    NVIDIA (Santa Clara, CA)
    …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... lasting impact on the world. We are looking for a Senior Linux Software Engineer to join the NVIDIA Applied Systems Engineering group. The work environment is… more
    NVIDIA (08/24/24)
    - Save Job - Related Jobs - Block Source