• Senior HPC Systems

    NVIDIA (Santa Clara, CA)
    …a lasting impact on the world. We are looking for an outstanding engineer for a Senior HPC Systems Engineer role for at scale AI system performance ... develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems... HPC , OS, CPU and GPU compute, and systems specialist to architect, develop and bring up large… more
    NVIDIA (09/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI- HPC Storage…

    NVIDIA (Santa Clara, CA)
    …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing needs… more
    NVIDIA (06/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance…

    NVIDIA (Santa Clara, CA)
    …you will analyze and run High Performance Computing ( HPC ) applications on HPC servers and systems to gain insight into the performance characteristics of ... to full applications that utilize all of the resources on distributed-memory systems with heterogeneous compute nodes including CPUs, GPUs and Manycore processors.… more
    NVIDIA (09/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance…

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
    NVIDIA (07/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    …one else can solve. Make the choice to join us today. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC ... What you'll be doing: + Design highly available and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as the… more
    NVIDIA (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Architect

    NVIDIA (Santa Clara, CA)
    …the choice, join our diverse team today! We are looking for an outstanding hands-on architect/ engineer for a Senior HPC architect role to support deployment ... develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (08/24/24)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer , Nitro High…

    Amazon (Sunnyvale, CA)
    …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux… more
    Amazon (08/24/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Architect, AI…

    NVIDIA (Santa Clara, CA)
    …environments, and system software to make current and future high-end computer systems more performant, scalable, and usable. As an NVIDIAN, you'll be immersed ... proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new… more
    NVIDIA (07/26/24)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Software…

    NVIDIA (Santa Clara, CA)
    …the broader NVIDIA team to operate, design and build infrastructure management systems , Kubernetes operators, and end-to-end HPC integration solutions that ... NVIDIA is looking for outstanding software and systems engineers to help us develop and operate...ecosystem. We are focused on supporting NVIDIA products across HPC , Cloud, and enterprise on both bare metal and… more
    NVIDIA (09/10/24)
    - Save Job - Related Jobs - Block Source
  • Senior GPU Supercomputer Scheduler…

    NVIDIA (Santa Clara, CA)
    …human imagination and intelligence. Join us today! As a member of the GPU/ HPC Infrastructure team, you will provide leadership in the design and implementation of ... to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads… more
    NVIDIA (08/07/24)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer

    NVIDIA (Santa Clara, CA)
    …help us accelerate the next wave of technological innovation. We are seeking Systems Engineers to join our Datacenter System Engineering (DCSE) Team to help deliver ... the next generation of DSCI High Performance Computing ( HPC ) server products. You will be working with a team of designers, programmers and data center engineers on… more
    NVIDIA (08/14/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... implementing, and optimizing on-prem High-Performance Computing ( HPC ) storage solutions while harnessing the power of cloud computing. You will be responsible for… more
    NVIDIA (08/30/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
    NVIDIA (08/14/24)
    - Save Job - Related Jobs - Block Source
  • Senior GPU Capacity and Resource Management…

    NVIDIA (Santa Clara, CA)
    …InfiniBand with IBOP and RDMA as well as understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads + Familiarity with deep ... join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (07/12/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …Familiarity with InfiniBand with IBoIP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads + Familiarity with ... diverse team today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the...for our GPU Compute Clusters. As a Site Reliability Engineer , you will help us with the strategic challenges… more
    NVIDIA (09/12/24)
    - Save Job - Related Jobs - Block Source
  • Senior GPU Cluster Software Engineer

    NVIDIA (Santa Clara, CA)
    …implement, test, deploy, release, and support large scale distributed systems infrastructure with monitoring, logging, visualization, and alerting capabilities with ... profiling tools for real world ML/DL applications running on HPC GPU clusters for failure and efficiency analysis to...with distributed system software architecture + Basic understanding of HPC GPU cluster, slurm + Basic understanding of Machine… more
    NVIDIA (08/13/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Systems Engineer

    NVIDIA (Santa Clara, CA)
    …impact on the world. We are looking for a Senior Linux Software Engineer to join the NVIDIA Applied Systems Engineering group. The work environment is ... network, storage security, and management software elements in support of HPC environments. + Driving a complete engineering process, including refining… more
    NVIDIA (08/24/24)
    - Save Job - Related Jobs - Block Source
  • Senior Offensive Security Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a highly motivated, creative engineer with experience in system software and a background in security to join the Server Platform Software ... focus on offensive security efforts for our Data Center Systems , such as NVIDIA HGX, DGX, and MGX. What...ecosystem. We are focused on supporting NVIDIA products across HPC , cloud, and enterprise on both bare metal and… more
    NVIDIA (07/16/24)
    - Save Job - Related Jobs - Block Source
  • Senior BMC Security Firmware…

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a highly motivated, creative engineer with experience in system software security to join the Server Platform Software team. In this role, ... on securing the management plane for NVIDIA's Data Center Systems , such as NVIDIA HGX, DGX, and MGX. NVIDIA...ecosystem. We are focused on supporting NVIDIA products across HPC , cloud, and enterprise on both bare metal and… more
    NVIDIA (07/16/24)
    - Save Job - Related Jobs - Block Source
  • Senior Math Libraries Engineer

    NVIDIA (Santa Clara, CA)
    …As a Sr. Engineer specializing on optimizing for single and multi-processor systems you will be responsible for: + Developing scalable HPC math library ... is now looking for a self-motivated and expert software engineer for its Fast Fourier Transform libraries. Around the...bigger and higher fidelity problems faster in AI and HPC has led to the development of the NVIDIA… more
    NVIDIA (07/16/24)
    - Save Job - Related Jobs - Block Source