• Senior AI Cluster

    NVIDIA (Santa Clara, CA)
    …be doing: + Build internal perf/power profiling and analysis tools and platform for AI workloads at cluster scale + Build debugging tools for common ... frameworks like Pytorch, TensorFlow and etc + Knowledge of AI cluster job scheduling, storage management and...GPU cluster scale continuous profiling & analysis tools /platforms + Solid experience in large AI more
    NVIDIA (12/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI -HPC Cluster

    NVIDIA (Santa Clara, CA)
    …in administering Centos/RHEL and/or Ubuntu Linux distributions + Solid understanding of cluster configuration managements tools such as Ansible, Puppet, Salt + ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...join us today! As a member of the GPU AI /HPC Infrastructure team, you will provide leadership in the… more
    NVIDIA (03/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC AI Cluster

    NVIDIA (Santa Clara, CA)
    …looking for an experienced HPC Engineer to join the E2E software verification HPC/ AI Infrastructure team. We are building supercomputers and HPC clusters based on ... We are looking for an outstanding architect for a senior HPC, be a key player to the most...be doing: + Designing, implementing and maintaining large scale HPC/ AI clusters with monitoring, logging and alerting + Managing… more
    NVIDIA (01/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer…

    NVIDIA (Santa Clara, CA)
    …5K GPUs cluster . + Deep understanding of GPU computing and AI infrastructure. + Passion for solving complex technical challenges and optimizing system ... NVIDIA is the leader in AI , machine learning and datacenter acceleration. NVIDIA is...Solid experience with GPU clusters, and working knowledge of cluster configuration management tools such as BCM… more
    NVIDIA (12/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior Observability Architect, AI

    NVIDIA (Santa Clara, CA)
    …HW, and SW engineering and research teams to define a vision and roadmap for AI /HPC cluster observability. + Architect and lead teams to develop, test, and ... NVIDIA's Hardware Infrastructure organization is seeking a Senior or Princip al Data and Observability Architect....We serve and collaborate directly with NVIDIA's rapidly growing AI , HW, and SW engineering and research teams across… more
    NVIDIA (02/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Product Manager, Systems, and Cloud…

    Google (Sunnyvale, CA)
    …storage, compute hardware, databases, file systems, data analytics, cluster management, and other software infrastructure areas. Preferred qualifications: ... We deliver enterprise-grade solutions that leverage Google's cutting-edge technology, and tools that help developers build more sustainably. Customers in more than… more
    Google (03/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer, Foundation Model…

    NVIDIA (Santa Clara, CA)
    …JAX, or TensorFlow. + Deep understanding of GPU acceleration, CUDA programming, and cluster management tools like Kubernetes. + Strong programming skills in ... NVIDIA is searching for a senior or principal engineer who specializes in building...works on multimodal foundation models, large-scale robot learning, embodied AI , and physics simulation. Our past projects include Eureka… more
    NVIDIA (03/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Cloud Test Developer Architect

    NVIDIA (Santa Clara, CA)
    …cloud architectures, for a dynamic role. What You'll Be Doing: + Leverage AI -powered testing tools to improve test automation, increase coverage, and accelerate ... We are in search of a highly skilled Senior Test Developer Architect to join our dynamic...and optimization techniques. + Improve observability and monitoring through AI -powered anomaly detection, predictive analytics, and intelligent alerting. +… more
    NVIDIA (03/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer, Bare Metal…

    NVIDIA (Santa Clara, CA)
    …software engineers with bare metal hardware experience to help scale up its AI Infrastructure. We expect you to have significant software engineering experience with ... to build and deploy leading infrastructure solutions for a broad range of AI -based applications. If you're creative, passionate about GPU hardware, and love having… more
    NVIDIA (01/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior ASIC Physical Design Engineer,…

    NVIDIA (Santa Clara, CA)
    …graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that ... parallel computing! More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...high-frequency and low-power CPUs, GPUs, SoCs at block level, cluster level, and/or full chip level, with a focus… more
    NVIDIA (02/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior NVLink System Software Bringup…

    NVIDIA (Santa Clara, CA)
    …architecture. Expertise with NVIDIA systems is a huge bonus. + Any experience with cluster software automation tools such as Ansible, Jenkins NVIDIA is leading ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...GPU accelerated applications, such as Deep Learning. As a Senior NVLink System Software Bringup Engineer within our Fabric… more
    NVIDIA (02/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    …KVM/VMWare/Hyper-V environment. + Good knowledge and hands-on experience in model testing, AI tools /frameworks (TensorFlow, Pytorch, Cursor and etc ), NLP and ... OEM business. NVIDIA is also well positioned as the ' AI Computing Company', and NVIDIA GPUs are the brains...OS experience, reliability testing with various telemetries, scale out cluster , test plan development, CI/CD and DevOps experience to… more
    NVIDIA (03/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer, Languages (Rust,…

    LinkedIn (Mountain View, CA)
    …This spans multiple areas including but not limited to: Providing tools and infrastructure that creates delightful development experience; Provide and support ... member facing products; Provide actionable insights that leverages data and metrics; Provide tools and data that help LinkedIn teams listen to their customers; and… more
    LinkedIn (01/19/25)
    - Save Job - Related Jobs - Block Source
  • Senior Product Manager, On-Device Machine…

    Google (Sunnyvale, CA)
    …related field. + 10 years of experience in product management, with a focus on developer tools or AI /ML. + Experience taking a product from 0 to 1 and from ... networking, storage, compute hardware, databases, file systems, data analytics, cluster management, and other software infrastructure areas. Preferred qualifications:… more
    Google (02/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer…

    NVIDIA (Santa Clara, CA)
    …development team to build and develop test content in C++. + Experience of AI -Powered tools that rapidly accelerating automation. Those tools leverage ML, ... of high-quality software by focusing on code coverage and maintaining automation tools and infrastructure. + Contribute to the automation of manual test cases… more
    NVIDIA (03/13/25)
    - Save Job - Related Jobs - Block Source