• Senior HPC Systems

    NVIDIA (Santa Clara, CA)
    …a lasting impact on the world. We are looking for an outstanding engineer for a Senior HPC Systems Engineer role for at scale AI system performance ... develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems... HPC , OS, CPU and GPU compute, and systems specialist to architect, develop and bring up large… more
    NVIDIA (09/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Cloud Engineer

    Stanford University (Stanford, CA)
    Senior HPC Cloud Engineer **Doerr School of Sustainability, Stanford, California, United States** Information Technology Services Post Date Sep 28, 2024 ... help our school expand and scale our research computing ( HPC ) resource portfolio in the cloud to meet the...including distributed filesystems and an emphasis on object storage systems and data lifecycle management. + **Infrastructure as Code… more
    Stanford University (09/29/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI- HPC Storage…

    NVIDIA (Santa Clara, CA)
    …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing needs… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance…

    NVIDIA (Santa Clara, CA)
    …you will analyze and run High Performance Computing ( HPC ) applications on HPC servers and systems to gain insight into the performance characteristics of ... to full applications that utilize all of the resources on distributed-memory systems with heterogeneous compute nodes including CPUs, GPUs and Manycore processors.… more
    NVIDIA (09/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance…

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
    NVIDIA (07/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    …one else can solve. Make the choice to join us today. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC ... What you'll be doing: + Design highly available and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as the… more
    NVIDIA (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    HPC network fabric or machine learning accelerator cluster systems . Also applicable is experience high-frequency trading networking, high-speed wireless ... Description We are seeking an experienced software engineer with low-level latency networking or interconnect expertise to optimize customer experience by designing … more
    Amazon (07/08/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Architect

    NVIDIA (Santa Clara, CA)
    …the choice, join our diverse team today! We are looking for an outstanding hands-on architect/ engineer for a Senior HPC architect role to support deployment ... develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (08/24/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Worldwide Specialist Solutions Architect,…

    Amazon (Santa Clara, CA)
    …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... physical or life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying IT systems more
    Amazon (10/01/24)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer , Nitro High…

    Amazon (Sunnyvale, CA)
    …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux… more
    Amazon (08/24/24)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer , Nitro High…

    Amazon (Sunnyvale, CA)
    …performance across our product line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux OS boot sequencing, Kernel, ... High Memory and High performance computing workloads. The Nitro High Memory and HPC team owns the purpose built platform development for the High performance… more
    Amazon (08/23/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Architect, AI…

    NVIDIA (Santa Clara, CA)
    …environments, and system software to make current and future high-end computer systems more performant, scalable, and usable. As an NVIDIAN, you'll be immersed ... proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new… more
    NVIDIA (07/26/24)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Software…

    NVIDIA (Santa Clara, CA)
    …the broader NVIDIA team to operate, design and build infrastructure management systems , Kubernetes operators, and end-to-end HPC integration solutions that ... NVIDIA is looking for outstanding software and systems engineers to help us develop and operate...ecosystem. We are focused on supporting NVIDIA products across HPC , Cloud, and enterprise on both bare metal and… more
    NVIDIA (09/10/24)
    - Save Job - Related Jobs - Block Source
  • Senior GPU Supercomputer Scheduler…

    NVIDIA (Santa Clara, CA)
    …human imagination and intelligence. Join us today! As a member of the GPU/ HPC Infrastructure team, you will provide leadership in the design and implementation of ... to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads… more
    NVIDIA (08/07/24)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer

    NVIDIA (Santa Clara, CA)
    …help us accelerate the next wave of technological innovation. We are seeking Systems Engineers to join our Datacenter System Engineering (DCSE) Team to help deliver ... the next generation of DSCI High Performance Computing ( HPC ) server products. You will be working with a team of designers, programmers and data center engineers on… more
    NVIDIA (08/14/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …Familiarity with InfiniBand with IBoIP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads + Familiarity with ... diverse team today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the...automation to improve researchers productivity. As a Site Reliability Engineer , you will help us with the strategic challenges… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... implementing, and optimizing on-prem High-Performance Computing ( HPC ) storage solutions while harnessing the power of cloud computing. You will be responsible for… more
    NVIDIA (08/30/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
    NVIDIA (08/14/24)
    - Save Job - Related Jobs - Block Source
  • Senior GPU Cluster Software Engineer

    NVIDIA (Santa Clara, CA)
    …implement, test, deploy, release, and support large scale distributed systems infrastructure with monitoring, logging, visualization, and alerting capabilities with ... profiling tools for real world ML/DL applications running on HPC GPU clusters for failure and efficiency analysis to...with distributed system software architecture + Basic understanding of HPC GPU cluster, slurm + Basic understanding of Machine… more
    NVIDIA (08/13/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Systems Engineer

    NVIDIA (Santa Clara, CA)
    …impact on the world. We are looking for a Senior Linux Software Engineer to join the NVIDIA Applied Systems Engineering group. The work environment is ... network, storage security, and management software elements in support of HPC environments. + Driving a complete engineering process, including refining… more
    NVIDIA (08/24/24)
    - Save Job - Related Jobs - Block Source