• Senior HPC AI Cluster…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for an experienced HPC Engineer to join the E2E...We are looking for an outstanding architect for a senior HPC , be a key player to ... develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist to architect, develop and bring up large scale… more
    NVIDIA (01/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI- HPC Cluster…

    NVIDIA (Santa Clara, CA)
    …doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC ...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (03/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI- HPC Storage…

    NVIDIA (Santa Clara, CA)
    …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing needs… more
    NVIDIA (02/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance…

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
    NVIDIA (01/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    …one else can solve. Make the choice to join us today. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC ... What you'll be doing: + Design highly available and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as the… more
    NVIDIA (03/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Observability Engineer , AI…

    NVIDIA (Santa Clara, CA)
    NVIDIA's Hardware Infrastructure organization is seeking a Senior Observability Engineer to help architect and implement our distributed observability systems ... You will be working with a team of dedicated engineers on systems for data collection, aggregation, enrichment, storage, retrieval, and visualization to… more
    NVIDIA (01/31/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC and AI Networking…

    NVIDIA (Santa Clara, CA)
    …like a fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis ... types of hardware and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools and methodologies to dive deeply into… more
    NVIDIA (03/11/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    Description We are seeking an experienced engineer to work on distributed AI/ML systems . This role involves working on collective operations - the fundamental ... kernels, and performant code is important. Experience with embedded systems is valued, and experience with high-speed networking or... is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving… more
    Amazon (02/12/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer , Nitro High…

    Amazon (Sunnyvale, CA)
    …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux… more
    Amazon (01/28/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
    NVIDIA (01/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …Experience with Cloud Deployment, BCM, Terraform. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads. + Familiarity ... diverse team today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the...automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of… more
    NVIDIA (12/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Marketing Engineer

    NVIDIA (Santa Clara, CA)
    …team and see how you can make a lasting impact on the world. As a Senior Technical Marketing Engineer for AI Infrastructure, you will join a dedicated team that ... advancement of datacenter GPUs and large scale GPU computing systems . What you will be doing: + Evaluate and...+ Proficiency in Python and C++ for AI and HPC applications. + Experience using large scale multi node… more
    NVIDIA (01/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , GPU…

    NVIDIA (Santa Clara, CA)
    …the next wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and ... crew that develops and maintains software for complex heterogeneous computing systems that power disruptive products in High Performance Computing and Deep… more
    NVIDIA (01/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer

    Google (Sunnyvale, CA)
    …testing/launching software products. + 5 years of experience with Distributed File Systems , or Storage systems . Preferred qualifications: + Master's degree or ... field. + 8 years of experience with Distributed File Systems , or Storage systems . + 5 years...on and is growing every day. As a software engineer , you will work on a specific project critical… more
    Google (03/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    …supports many key products spanning the gamut of data analytics, deep learning, HPC and professional graphics running on hardware ranging from supercomputers to the ... and creativity to develop and improve the functionality and performance of runtime systems that underlay the foundation of distributed GPU computing at NVIDIA. +… more
    NVIDIA (03/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    We are now hiring a Senior System Software Engineer to join the NVIDIA's System Software group focusing on Data Center Server Platform Diagnostics. You will join ... dynamic crew that builds and maintains software for complex heterogeneous computing systems that power sophisticated server products used in ground breaking of… more
    NVIDIA (03/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the ... What you will be doing: + Design and maintain large-scale distributed training systems to support multi-modal foundation models for robotics. + Optimize GPU and… more
    NVIDIA (03/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Developer Technology Engineer

    NVIDIA (Santa Clara, CA)
    We're currently seeking a Senior Developer Technology Engineer , Artificial Intelligence Would you enjoy researching parallel algorithms to accelerate AI ... (industry and academia) to perform in-depth analysis and optimization of complex AI and HPC algorithms to ensure the best possible AI solutions on modern CPU and GPU… more
    NVIDIA (01/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    …optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server platforms. Simulations play a ... NVIDIA data center systems , such as DGX and HGX, have become core...significant role in building scalable systems at Speed of Light! You will work with world… more
    NVIDIA (03/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Platform Software Engineer

    NVIDIA (Santa Clara, CA)
    …NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload.Scale out is inherent to design of this ... I/O bus (PCIe, etc.) and CPU. + Architecting complex systems , I/O error handling from PCIe & other I/O...datacenter requirements and improve resiliency of a GPU based systems + Identify gaps in platform debuggability and drive… more
    NVIDIA (03/05/25)
    - Save Job - Related Jobs - Block Source