• Senior AI - HPC

    NVIDIA (Santa Clara, CA)
    …in Computer Science or related field, with 8+ years of proven experience in AI /ML and HPC workloads and infrastructure. + Hands-on experience in using or ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...operating High Performance Computing ( HPC ) grade infrastructure as well as in-depth knowledge of… more
    NVIDIA (01/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Developer, HPC

    NVIDIA (Santa Clara, CA)
    …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... integration and bare-metal provisioning related functionality in our Linux-based cluster management software environment. NVIDIA's Bright Cluster Manager… more
    NVIDIA (01/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI - HPC Storage…

    NVIDIA (Santa Clara, CA)
    …in any of the leading Cloud environment [ AWS, Azure or GCP] + Experience with AI / HPC cluster job schedulers such as SLURM, LSF + In depth understating ... InfiniBand with IBOIP and RDMA + Background with Software Defined Networking and AI / HPC cluster networking + Familiarity with deep learning frameworks like… more
    NVIDIA (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, HPC

    NVIDIA (Santa Clara, CA)
    …Primary responsibilities will be to validate and debug customer cluster performance issues, functional bottlenecks and drive customer technical engagements ... ecosystems. You'll be called on to help architect and scale high-performance, distributed AI infrastructure on-prem or in the cloud built with the latest NVIDIA GPU… more
    NVIDIA (12/11/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer, HPC /ML…

    Amazon (Cupertino, CA)
    …have extensive experience in low-latency networking and collective operations, such as HPC network fabric or machine learning accelerator cluster systems. Also ... and TPUs. This role is on the forefront of AI /ML, we spend a good deal of the day...like solving hard infrastructure problems, want to work with HPC and ML customers, iterate fast and deliver meaningful… more
    Amazon (12/20/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer…

    NVIDIA (Santa Clara, CA)
    …Make the choice, join our diverse team today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation of ... You will also be maintaining and building deep learning AI - HPC GPU clusters at scale and supporting...cluster . + Deep understanding of GPU computing and AI infrastructure. + Passion for solving complex technical challenges… more
    NVIDIA (12/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior SRE Engineering Leader - AI

    NVIDIA (Santa Clara, CA)
    …(Perl, Python, Go). + Expertise in managing large-scale distributed systems and AI / HPC environments. + Leadership experience, mentoring, and coaching skills. + ... NVIDIA is leading the way in the AI revolution, revolutionizing industries with our brand-new GPU...leaders to join us on an exciting journey as Senior SRE Engineering Leader. Lead our globally distributed clusters,… more
    NVIDIA (01/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer, Foundation Model…

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the ... works on multimodal foundation models, large-scale robot learning, embodied AI , and physics simulation. Our past projects include Eureka… more
    NVIDIA (12/07/24)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Program Manager - GPU…

    NVIDIA (Santa Clara, CA)
    …development and large scale distributed computing + Experience managing large scale HPC and/or AI Infrastructure deployments that stretch across hardware and ... Hardware Infrastructure is seeking a Senior Technical Program Manager to lead the strategy...infrastructure we build and operate enables NVIDIAs most advanced AI and hardware researchers and engineers to create the… more
    NVIDIA (12/03/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Systems Engineer

    NVIDIA (Santa Clara, CA)
    …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... impact on the world. We are looking for a Senior Linux Software Engineer to join the NVIDIA Applied...as our employees are currently working on innovative, next-generation AI Factory hardware and software infrastructure at the forefront… more
    NVIDIA (11/23/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer…

    NVIDIA (Santa Clara, CA)
    …Ways to stand out from the crowd: + Have built , deployed and operated AI platforms on HPC clusters. Have built, deployed and operated cloud native system ... scientific computing cloud platform enables Physics based Numerical Simulation Solvers, AI based Training, Inference and Visualization workflow for physical science… more
    NVIDIA (12/10/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    …GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional OEM business. ... NVIDIA is also well positioned as the ' AI Computing Company', and NVIDIA GPUs are the brains...OS experience, reliability testing with various telemetries, scale out cluster , test plan development, CI/CD and DevOps experience to… more
    NVIDIA (12/17/24)
    - Save Job - Related Jobs - Block Source