• AI K8s Infrastructure

    NVIDIA (Santa Clara, CA)
    We are looking for a highly motivated k8s AI infrastructure generalist to join our team in the one of the most expert k8s organization. There is an ... Please apply if you are passionate about Kubernetes, building k8s infrastructure automation and deployment tools and...and craft new ways to improve availability of our Cloud Native AI Platform by automating critical… more
    NVIDIA (10/11/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI Platform Engineer

    NVIDIA (Santa Clara, CA)
    …from the crowd: + Experience with large-scale distributed AI systems on cloud or on-premises infrastructure using NVIDIA hardware. + Extensive experience in ... This role focuses on ensuring optimal efficiency across NVIDIA's AI infrastructure . What you'll be doing: +...Slurm workload manager and K8s . + Understanding of parallel programming and experience with… more
    NVIDIA (11/16/24)
    - Save Job - Related Jobs - Block Source
  • Senior Product Architect, Cloud Native

    NVIDIA (Santa Clara, CA)
    …member of our team, you'll work across all layers of the AI Infrastructure stack, from operating systems to cloud native orchestration, ML platforms, and ... + Stay at the forefront of technology trends in AI Infrastructure , MLOps, and GenAI applications, proposing...of complex systems, networks, and storage + Expertise in Cloud Native Architectures, K8s Operators, Ansible, Go,… more
    NVIDIA (10/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI -HPC Cluster Engineer

    NVIDIA (Santa Clara, CA)
    …intelligence. Make the choice to join us today! As a member of the GPU AI /HPC Infrastructure team, you will provide leadership in the design and implementation ... graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that… more
    NVIDIA (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Principal Engineer, Distributed Systems - NIM…

    NVIDIA (Santa Clara, CA)
    …are easy to use, highly performant and tested in all deployment scenarios, in the cloud , on customer's self hosted infrastructure and locally on all NVIDIA GPUs. ... and build a software factory that will take an AI model and create deployable services across Cloud...across organizational boundaries + Deep technical expertise in Microservices, K8s , Cloud Endpoints, Temporal, Helm, Prometheus, Kafka… more
    NVIDIA (10/02/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Customer Success Engineer

    Dynatrace (Mountain View, CA)
    …Davis hypermodal AI , to help our customers modernize and automate cloud operations, deliver software faster and more securely, and enable flawless digital ... intelligent automation from data. This enables innovators to modernize and automate cloud operations, deliver software faster and more securely, and ensure flawless… more
    Dynatrace (10/05/24)
    - Save Job - Related Jobs - Block Source
  • Senior MLOps Engineer, GenAI Framework

    NVIDIA (Santa Clara, CA)
    …systems. + Hands on experience with DL frameworks (PyTorch, JAX, Tensorflow) + Cluster/ cloud technologies, eg: SLURM, Lustre, k8s + Experience with HPC hardware ... Megatron Core) team. NVIDIA NeMo is an open-source, scalable and cloud -native framework built for researchers and developers working on Large Language… more
    NVIDIA (10/08/24)
    - Save Job - Related Jobs - Block Source