• Senior DGX Cloud Performance

    NVIDIA (Santa Clara, CA)
    … analysis, optimization, and modeling to define the architecture and design of NVIDIA's DGX Cloud clusters. The ideal candidate will have a deep understanding of ... the methodology to conduct end to end performance analysis of critical AI applications running on large...will work closely with the multi-functional teams to define DGX Cloud cluster architecture for different CSPs,… more
    NVIDIA (11/21/25)
    - Save Job - Related Jobs - Block Source
  • DGX Cloud Performance

    NVIDIA (Santa Clara, CA)
    … analysis, optimization, and modeling to define the architecture and design of Nvidia's DGX Cloud clusters. The ideal candidate will have a deep understanding of ... the methodology to conduct end to end performance analysis of critical AI applications running on large...work closely with the cross functional teams to define DGX Cloud cluster architecture for different CSPs,… more
    NVIDIA (12/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior DGX AI Cloud

    NVIDIA (Santa Clara, CA)
    Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on optimizing ... Engineers to design and develop tools for AI application performance analysis. Your work will enable AI researchers to...to work efficiently with a wide variety of DGXC cloud AI systems as they seek out opportunities for… more
    NVIDIA (12/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Datacenter System Software Engineer

    NVIDIA (Santa Clara, CA)
    …creative engineer with strong experience in system software to join the DGX Cloud Software Team. You will lead the architecture, design and implementation ... of our next generation DGX cloud clusters using latest technologies. On...stack deployment including hardware architecture, workload orchestration and application performance tuning. Are you ready to change the next… more
    NVIDIA (11/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for an outstanding, passionate, and talented Senior AI Infrastructure Engineer to join our DGX Cloud group. This engineering role will ... cloud enabling technologies like Kubernetes and OpenStack. DGX Cloud SRE at NVIDIA ensures that...AI training and Inferencing platform built on top of cloud infrastructure + Conduct in-depth performance characterization… more
    NVIDIA (11/06/25)
    - Save Job - Related Jobs - Block Source
  • Principal Systems Software Engineer

    NVIDIA (Santa Clara, CA)
    The NVIDIA DGX Cloud organization is looking for software engineering talent to build NVIDIA's accelerated compute infrastructure. This includes software to ... of compute hardware and networking equipment. As a Principal Systems Software Engineer , you will work with other software engineers, product architects, and product… more
    NVIDIA (10/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …+ Contribute to architecture, integration, and alignment with both on-prem and cloud -native platforms. + Optimize system performance and reliability through ... We are looking for a Sr Storage Services Software engineer to join the block storage group. You will...storage services! Services that will need to meet extreme performance and scalability demands! We have crafted a team… more
    NVIDIA (11/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , DGX

    NVIDIA (Santa Clara, CA)
    NVIDIA is seeking a Senior Systems Software Engineer to build cloud -native platform software harnessing open-source container runtimes and Kubernetes. You will ... + Automate and optimize build, test, integration, and release pipelines for cloud -native services. + Diagnose and improve performance , reliability, and security… more
    NVIDIA (12/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Storage Production Engineer

    NVIDIA (Santa Clara, CA)
    …possess expertise in different domains, such as storage architecture, high- performance distributed storage, data management, systems, networking, coding, database ... planning, continuous delivery and deployment, as well as open-source cloud -enabling technologies like Kubernetes, containers, and virtualization. Their responsibilities… more
    NVIDIA (01/03/26)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …database, capacity management, continuous delivery and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures ... that our internal and external facing GPU cloud services run maximum reliability and uptime as promised...planning while keeping an eye on capacity, latency and performance . SRE is also a mindset and a set… more
    NVIDIA (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Object Storage…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior Software Engineer in Object Storage to design, implement, and extend the capabilities of our internal object storage system. This ... - 10k+ nodes, exabytes of data + Analyzing and improving system performance at all levels + Automating storage infrastructure end-to-end including provisioning,… more
    NVIDIA (11/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Engineer , AI…

    NVIDIA (Santa Clara, CA)
    We are looking for a Senior AI Infrastructure Engineer (AI Tooling) to design and build the backend systems and infrastructure powering our internal AI tools and ... writing clear design documentation + Deep experience with Kubernetes and cloud -native infrastructure + Experience working with building, deploying, and maintaining… more
    NVIDIA (10/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Data Ingestion…

    NVIDIA (Santa Clara, CA)
    NVIDIA is seeking a Senior Software Engineer to build a worldwide network of fast, efficient, and reliable data transfer systems. The goal is to enable NVIDIA AI ... building high-scale distributed systems such as distributed databases, storage systems, or cloud services NVIDIA is leading the way in groundbreaking developments in… more
    NVIDIA (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior GPU and HPC Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    …and excellent communication and planning abilities. Experience working with High Performance Computing (HPC), GPUs, and high- performance networking (RDMA, ... that automates GPU asset provisioning, configuration, and lifecycle management across cloud providers. You'll contribute to this platform to build end-to-end… more
    NVIDIA (10/09/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer , BCM…

    NVIDIA (Santa Clara, CA)
    …in providing excellent, comprehensive support to our customers! ​Sr Site Reliability Engineer in this role will significantly impact and contribute to the overall ... complex cluster configurations including Slurm and Kubernetes orchestrators for performance , scalability and resilience, ensuring they meet real-world customer… more
    NVIDIA (10/28/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Engineer , Regulated…

    NVIDIA (Santa Clara, CA)
    …As a technology leader at NVIDIA, you will lead the development of secure DGX Cloud offerings for regulated and sovereign cloud environments (governments, ... + Various Architectural Work: define and drive the technical implementation for DGX Cloud offerings in regulated and sovereign environments, including FedRAMP,… more
    NVIDIA (12/09/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Engineer , Observability,…

    NVIDIA (Santa Clara, CA)
    …worldwide! As a technology leader at NVIDIA, you will own the development of DGX Cloud strategy for observability, monitoring, and remediation across all layers ... + Various Architectural Work: define and drive the technical implementation for DGX Cloud offerings in the observability, monitoring, and remediation practice.… more
    NVIDIA (11/24/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Engineer , GPU Fleet…

    NVIDIA (Santa Clara, CA)
    …worldwide! As a technology leader at NVIDIA, you will lead the development of DGX Cloud strategy for GPU fleet lifecycle, health, observability and utilization ... Be Doing: + Various Architectural Work: define and drive the technical implementation for DGX Cloud operations practice for GPU fleet lifecycle. + Collaborate on… more
    NVIDIA (01/03/26)
    - Save Job - Related Jobs - Block Source
  • Senior Data Center Performance

    NVIDIA (Santa Clara, CA)
    …NVIDIA AI and HPC software stack. We are searching for a highly motivated engineer to lead performance benchmarking and optimization efforts for our data center ... ecosystem of data center platform designs. From single node HGX/ DGX systems all the way up to large multi-node...have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. Each brings together the full power… more
    NVIDIA (12/09/25)
    - Save Job - Related Jobs - Block Source
  • Senior Machine Learning Engineer

    NVIDIA (Santa Clara, CA)
    … at NVIDIA, you will build the machine learning brain that keeps NVIDIA's global DGX Cloud healthy, efficient and ready for the next waves of AI breakthroughs. ... DGX Cloud fuses NVIDIA GPUs, NVLink networking...solving capacity planning problems. + Deep understanding of GPU performance metrics. + Familiarity with prometheus and PromQL. Your… more
    NVIDIA (01/06/26)
    - Save Job - Related Jobs - Block Source