• Senior AI - HPC

    NVIDIA (Santa Clara, CA)
    …+ Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Working knowledge of cluster configuration managements tools such ... Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation of ground… more
    NVIDIA (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Developer, HPC

    NVIDIA (Santa Clara, CA)
    …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... integration and bare-metal provisioning related functionality in our Linux-based cluster management software environment. NVIDIA's Bright Cluster Manager… more
    NVIDIA (10/15/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI - HPC Storage…

    NVIDIA (Santa Clara, CA)
    …in any of the leading Cloud environment [ AWS, Azure or GCP] + Experience with AI / HPC cluster job schedulers such as SLURM, LSF + In depth understating ... InfiniBand with IBOP and RDMA + Background with Software Defined Networking and AI / HPC cluster networking + Familiarity with deep learning frameworks like… more
    NVIDIA (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Principal Observability Architect, AI

    NVIDIA (Santa Clara, CA)
    …, HW, and SW engineering and research teams to define a vision and roadmap for AI / HPC cluster observability. + Architect and lead teams to d evelop, test, ... NVIDIA's Hardware Infrastructure organization is seeking a Senior or Princip al Data and Observability Architect....vision and roadmap for distributed observability systems for large-scale AI and HPC clusters and workloads and… more
    NVIDIA (11/02/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solution Architect - AI

    Lenovo (Morrisville, NC)
    Senior Solution Architect - AI cluster direction **General Information** Req # WD00074559 Career area: Software Engineering Country/Region: United States of ... and qualify the solution defined. **Location:** Morrisville, NC **What You'll Do** + Design AI / HPC cluster topology + Design and develop test software… more
    Lenovo (11/21/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Specialist

    NYU Rory Meyers College of Nursing (New York, NY)
    Position Summary The Senior HPC Specialist supports New York University's High-Performance Computing ( HPC ) and Research Cloud services by partnering with ... on HPC topics, and maintaining documentation for a rapidly changing HPC environment. Fine-tune cluster and cloud configuration which involves managing… more
    NYU Rory Meyers College of Nursing (09/06/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Systems Engineer

    General Dynamics Information Technology (Fairfax, VA)
    …**Job Description:** At GDIT, people are our differentiator. Our work depends on a Senior HPC Systems Engineer joining our team to support the National Oceanic ... Obtain:** None **Job Family:** Systems Engineering **Skills:** High-Performance Computing ( HPC ) Systems,Linux System Administration,Systems Management **Certifications:** None - N/A… more
    General Dynamics Information Technology (10/12/24)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer, HPC /ML…

    Amazon (Cupertino, CA)
    …have extensive experience in low-latency networking and collective operations, such as HPC network fabric or machine learning accelerator cluster systems. Also ... and TPUs. This role is on the forefront of AI /ML, we spend a good deal of the day...like solving hard infrastructure problems, want to work with HPC and ML customers, iterate fast and deliver meaningful… more
    Amazon (11/12/24)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect - InfiniBand and HPC

    NVIDIA (CA)
    NVIDIA is looking for a Senior HPC Engineer to join its...the team building many of the largest and fastest AI / HPC systems in the world! NVIDIA is ... customers, partners and internal teams to analyze, define, and implement large-scale AI / HPC projects. These efforts include a combination of networking, system… more
    NVIDIA (11/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer…

    NVIDIA (Santa Clara, CA)
    …Make the choice, join our diverse team today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation of ... You will also be maintaining and building deep learning AI - HPC GPU clusters at scale and supporting...cluster . + Deep understanding of GPU computing and AI infrastructure. + Passion for solving complex technical challenges… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior SRE Engineering Leader - AI

    NVIDIA (Santa Clara, CA)
    …(Perl, Python, Go). + Expertise in managing large-scale distributed systems and AI / HPC environments. + Leadership experience, mentoring, and coaching skills. + ... NVIDIA is leading the way in the AI revolution, revolutionizing industries with our brand-new GPU...leaders to join us on an exciting journey as Senior SRE Engineering Leader. Lead our globally distributed clusters,… more
    NVIDIA (10/08/24)
    - Save Job - Related Jobs - Block Source
  • Senior MLOps Engineer, GenAI Framework

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a dedicated and motivated senior build and continuous integration (CI/CD) engineer for its GenAI Frameworks (NeMo, ... working on Large Language Models (LLM), Multimodal (MM), and Speech AI . NeMo provides end-to-end model training, including data curation, alignment, customization,… more
    NVIDIA (10/08/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, NPN

    NVIDIA (Santa Clara, CA)
    …a hardworking Solution Architect with experience in designing, building, and maintaining large scale HPC and AI hybrid computing solutions to join our team at ... or equivalent experience. + Established track record working with AI and HPC clusters, both on-premises and...cloud based. + 12+ years of proven experience with cluster management and related tools, including Docker Containers, Slurm,… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Senior Architect, NVIDIA Infrastructure…

    NVIDIA (TX)
    …and to power data centers. Join the team building many of the largest and fastest AI / HPC systems in the world! We are looking for someone with the ability to ... solve the worlds hardest problems. NVIDIA is looking for Senior Hybrid Cloud Infrastructure Solutions Architect to join its...be doing: + Design, implement and maintain large scale HPC / AI clusters with monitoring, logging and alerting… more
    NVIDIA (11/21/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, Infiniband…

    NVIDIA (CA)
    …and to power data centers. Join the team building many of the largest and fastest AI / HPC systems in the world! We are looking for someone with the ability to ... the customer! What you'll be doing: + Primary responsibilities will include building AI / HPC infrastructure for new and existing customers. + Support operational… more
    NVIDIA (10/05/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, Healthcare…

    NVIDIA (CA)
    …and/or CUDA. + Proficient in the Linux/GNU toolchain and operating as a user in HPC cluster environments. + Background in molecular modeling and life sciences. + ... NVIDIA is seeking a Senior Solutions Architect to join our team, focused...industry leader with vision on integrating NVIDIA technology into AI and HPC architectures for advanced applications,… more
    NVIDIA (11/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, NIC and DPU…

    NVIDIA (CA)
    …and to power data centers. Join the team building many of the largest and fastest AI / HPC systems in the world! We are looking for someone with the ability to ... validation tools for InfiniBand health and performance including medium to large scale HPC / AI network environments. + Knowledge and experience with Linux system… more
    NVIDIA (10/05/24)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Program Manager - GPU…

    NVIDIA (Santa Clara, CA)
    …development and large scale distributed computing + Experience managing large scale HPC and/or AI Infrastructure deployments that stretch across hardware and ... Hardware Infrastructure is seeking a Senior Technical Program Manager to lead the strategy...infrastructure we build and operate enables NVIDIAs most advanced AI and hardware researchers and engineers to create the… more
    NVIDIA (11/01/24)
    - Save Job - Related Jobs - Block Source
  • Senior Research Fellow - Research Software…

    University of Massachusetts Amherst (Amherst, MA)
    …for research software development, deployment, and workflows to leverage cutting-edge and prototype HPC and AI hardware. + Interface with the larger research ... Senior Research Fellow - Research Software & Research...is theUnity Research Computing Platform (https://unity.rc.umass.edu/) , a collaborative cluster led by UMass Amherst in cooperation with UMass… more
    University of Massachusetts Amherst (10/11/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Systems Engineer

    NVIDIA (Santa Clara, CA)
    …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... impact on the world. We are looking for a Senior Linux Software Engineer to join the NVIDIA Applied...as our employees are currently working on innovative, next-generation AI Factory hardware and software infrastructure at the forefront… more
    NVIDIA (11/23/24)
    - Save Job - Related Jobs - Block Source