• Senior AI - HPC Storage…

    NVIDIA (Santa Clara, CA)
    …in any of the leading Cloud environment [ AWS, Azure or GCP] + Experience with AI / HPC cluster job schedulers such as SLURM, LSF + In depth understating ... InfiniBand with IBOP and RDMA + Background with Software Defined Networking and AI / HPC cluster networking + Familiarity with deep learning frameworks like… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer, HPC /ML…

    Amazon (Cupertino, CA)
    …have extensive experience in low-latency networking and collective operations, such as HPC network fabric or machine learning accelerator cluster systems. Also ... and TPUs. This role is on the forefront of AI /ML, we spend a good deal of the day...like solving hard infrastructure problems, want to work with HPC and ML customers, iterate fast and deliver meaningful… more
    Amazon (07/08/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer - Internal…

    NVIDIA (Santa Clara, CA)
    …& external partners + Finding and fixing problems before they occur + Building automation for AI - HPC GPU Cluster bring up and scaled up operation + Improving ... diverse team today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership...including workflows that uses MPI + Working knowledge of cluster configuration management tools such as BCM, Ansible, Puppet,… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, NPN

    NVIDIA (Santa Clara, CA)
    …a hardworking Solution Architect with experience in designing, building, and maintaining large scale HPC and AI hybrid computing solutions to join our team at ... or equivalent experience. + Established track record working with AI and HPC clusters, both on-premises and...cloud based. + 12+ years of proven experience with cluster management and related tools, including Docker Containers, Slurm,… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Senior Cloud Services Software Engineer

    NVIDIA (Santa Clara, CA)
    …as AWS, Azure, and GCP, as well as container technologies like Docker and Kubernetes, and HPC / AI platforms such as Slurm. Ways to stand out from the crowd: + ... Joining NVIDIA's AI Efficiency Team means contributing to the infrastructure...distributed software engineer to join our team! As a Senior engineer, you'll be instrumental in developing and optimizing… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Systems Engineer

    NVIDIA (Santa Clara, CA)
    …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... impact on the world. We are looking for a Senior Linux Software Engineer to join the NVIDIA Applied...as our employees are currently working on innovative, next-generation AI Factory hardware and software infrastructure at the forefront… more
    NVIDIA (08/24/24)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer, ML Infrastructure…

    Amazon (Cupertino, CA)
    …involve exposure to and experience with Amazon's growing suite of generative AI services and other cutting-edge cloud computing offerings across the AWS portfolio. ... top performance of AWS ML and High Performance Computing ( HPC ) technologies developed by our organization. Bring your exceptional...Join us as we expand the AWS offerings for AI , including Trainium, Neuron and the Elastic Fabric Adapter… more
    Amazon (09/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer…

    NVIDIA (Santa Clara, CA)
    …Ways to stand out from the crowd: + Have built , deployed and operated AI platforms on HPC clusters. Have built, deployed and operated cloud native system ... scientific computing cloud platform enables Physics based Numerical Simulation Solvers, AI based Training, Inference and Visualization workflow for physical science… more
    NVIDIA (09/10/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    …GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional OEM business. ... NVIDIA is also well positioned as the ' AI Computing Company', and NVIDIA GPUs are the brains...OS experience, reliability testing with various telemetries, scale out cluster , test plan development, CI/CD and DevOps experience to… more
    NVIDIA (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Research Associate - Neutrino

    SLAC National Accelerator Laboratory (Menlo Park, CA)
    …GPUs (28 A100, 280 RTX 2080Ti), current allocation at the NERSC Perlmutter (A100 cluster ), and other potential HPC centers where we apply for future allocations. ... experimental neutrino physics, and Artificial Intelligence and Machine Learning ( AI /ML) research. **About us:** **The Machine Learning Initiatives (MLI)** is… more
    SLAC National Accelerator Laboratory (08/26/24)
    - Save Job - Related Jobs - Block Source