• Senior AI - HPC

    NVIDIA (Santa Clara, CA)
    …implementation of distributed storage services. + Design, implement an on-prem AI / HPC infrastructure supplemented with cloud computing to support the growing ... join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership...in the design and implementation of ground breaking fast storage solutions to enable runs of demanding deep learning,… more
    NVIDIA (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Senior Product Architect, HPC

    NVIDIA (Santa Clara, CA)
    …harness your infrastructure expertise to create reference designs for the world's most powerful AI clusters. As an AI / HPC Product Architect at NVIDIA, you'll ... experience with benchmarking systems and analyzing performance bottlenecks in large-scale AI / HPC infrastructure + Exceptional communication skills, with the… more
    NVIDIA (10/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI - HPC Cluster…

    NVIDIA (Santa Clara, CA)
    …Understanding of fast, distributed storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning frameworks like PyTorch and ... join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership...the strategic challenges we encounter including: compute, networking, and storage design for large scale, high performance workloads, effective… more
    NVIDIA (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI and HPC Clusters…

    NVIDIA (Santa Clara, CA)
    …Understanding of fast, distributed storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning frameworks like PyTorch and ... and parallel computing. Now, GPU deep learning is driving modern AI forward. Join our GPU AI / HPC Infrastructure team and lead the design of groundbreaking… more
    NVIDIA (11/08/24)
    - Save Job - Related Jobs - Block Source
  • Principal Observability Architect, AI

    NVIDIA (Santa Clara, CA)
    …to define a vision and roadmap for distributed observability systems for large-scale AI and HPC clusters and workloads and guide implementation towards this ... NVIDIA's Hardware Infrastructure organization is seeking a Senior or Princip al Data and Observability Architect....visualization to spectacularly improve efficiency, performance, and productivity of AI and HPC workloads. You will lead… more
    NVIDIA (11/02/24)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer, HPC /ML…

    Amazon (Cupertino, CA)
    …workloads over thousands of CPUs, GPUs, and TPUs. This role is on the forefront of AI /ML, we spend a good deal of the day optimizing the networking for the latest ... AI workload such as LLMs. AWS Utility Computing (UC)...innovations - from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2),… more
    Amazon (11/12/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer…

    NVIDIA (Santa Clara, CA)
    …Understanding of fast, distributed storage systems like Lustre and GPFS for AI / HPC workloads. + Familiarity with deep learning frameworks like PyTorch and ... the choice, join our diverse team today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation of… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior SRE Engineering Leader - AI

    NVIDIA (Santa Clara, CA)
    …(Perl, Python, Go). + Expertise in managing large-scale distributed systems and AI / HPC environments. + Leadership experience, mentoring, and coaching skills. + ... NVIDIA is leading the way in the AI revolution, revolutionizing industries with our brand-new GPU...leaders to join us on an exciting journey as Senior SRE Engineering Leader. Lead our globally distributed clusters,… more
    NVIDIA (10/08/24)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Systems Software…

    NVIDIA (Santa Clara, CA)
    …benchmarking on large-scale distributed systems + Hands-on experience with NVIDIA GPUs, HPC storage , networking, and cloud computing. + In-depth understanding ... fiction inventions from artificial intelligence to autonomous cars. NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Distinguished Engineer, AI Resiliency Lead

    NVIDIA (Santa Clara, CA)
    …At least 5 years of hands-on experience in software development on high-complexity projects involving HPC or AI . Ways to Stand Out from the Crowd: + Proven ... and leadership across organizations, with direct exposure to NVIDIA's senior leadership. What You'll Be Doing: + Define a...A strong passion for designing system architectures tailored for AI , covering CPU, GPU, memory, storage , and… more
    NVIDIA (10/23/24)
    - Save Job - Related Jobs - Block Source
  • Principal Infrastructure SRE - Storage

    NVIDIA (Santa Clara, CA)
    …automation. + Proficient in Python or Go. + Experience with Large-scale data storage and compute clusters ( HPC ) infrastructure and Excellent working knowledge of ... by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts… more
    NVIDIA (10/27/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer, NCCL…

    NVIDIA (Santa Clara, CA)
    …test design + Experience working with engineering or academic research community supporting HPC or AI + Practical experience with high performance networking: ... runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner...to get an end to end understanding of the AI networking stack. Are you ready for to contribute… more
    NVIDIA (10/22/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, NPN

    NVIDIA (Santa Clara, CA)
    …a hardworking Solution Architect with experience in designing, building, and maintaining large scale HPC and AI hybrid computing solutions to join our team at ... Science, or a related field or equivalent experience. + Established track record working with AI and HPC clusters, both on-premises and cloud based. + 12+ years… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Program Manager…

    NVIDIA (Santa Clara, CA)
    …span across multiple teams and engineers (100+) + Experience managing large scale HPC and/or AI Infrastructure deployments that stretch across hardware and ... Hardware Infrastructure is seeking a Senior Technical Program Manager to lead the strategy...infrastructure we build and operate enables NVIDIA's most advanced AI and hardware researchers and engineers to create the… more
    NVIDIA (11/10/24)
    - Save Job - Related Jobs - Block Source
  • Senior Runtime Software Development…

    Amazon (Cupertino, CA)
    Description At AWS AI our vision is to make deep learning pervasive for everyday developers and to democratize access to cutting edge infrastructure. In order to ... performance and cost-effective inference and training. The Neuron team is hiring senior Runtime Software Development Engineers with a background in machine learning… more
    Amazon (09/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer…

    NVIDIA (Santa Clara, CA)
    …Ways to stand out from the crowd: + Have built , deployed and operated AI platforms on HPC clusters. Have built, deployed and operated cloud native system ... scientific computing cloud platform enables Physics based Numerical Simulation Solvers, AI based Training, Inference and Visualization workflow for physical science… more
    NVIDIA (09/10/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Systems Engineer

    NVIDIA (Santa Clara, CA)
    …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... impact on the world. We are looking for a Senior Linux Software Engineer to join the NVIDIA Applied...as our employees are currently working on innovative, next-generation AI Factory hardware and software infrastructure at the forefront… more
    NVIDIA (08/24/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer, Base OS…

    NVIDIA (Santa Clara, CA)
    …you'll be doing: + The NVIDIA Grace Superchips are the foundation for the next generation of AI and HPC platforms. Your role as a Base OS Kernel Engineer is to ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing - with...concepts such as filesystems, job scheduling, device drivers, and storage , is required. + Experience with complex system-level debugging… more
    NVIDIA (11/08/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Architect, Advanced…

    NVIDIA (Santa Clara, CA)
    AI and deep learning solutions, data analytics, High Performance Computing ( HPC ), Software Defined Networking (SDN), virtualization, storage , and more. What ... technology, and extraordinary people, NVIDIA is looking for a senior architect to join us in shaping the future....to join us in shaping the future. As a senior architect in the Advanced Development team, you will… more
    NVIDIA (11/13/24)
    - Save Job - Related Jobs - Block Source
  • Senior SoC Functional Modeling Engineer,…

    Amazon (Cupertino, CA)
    …these accelerator SoCs for use by internal partner teams. We're looking for a Senior SoC Modeling Engineer to join the team and deliver new functional models, ... UC organization, you'll support the development and management of Compute, Database, Storage , Platform, and Productivity Apps services in AWS, including support for… more
    Amazon (11/12/24)
    - Save Job - Related Jobs - Block Source