• Senior Production SRE

    NVIDIA (Santa Clara, CA)
    …and providing related services to support the overall stability and performance of the production systems. SRE at NVIDIA ensures that our internal and external ... Site Reliability Engineering ( SRE ) is an engineering discipline that involves designing, building, and maintaining large-scale production systems with high… more
    NVIDIA (10/29/24)
    - Save Job - Related Jobs - Block Source
  • Senior Network Operations Engineer

    NVIDIA (Santa Clara, CA)
    …Support and SRE team is in search of a seasoned Network Operations Engineer to help actualize the SRE vision for our network infrastructure. This role ... expertise in network operations, engineering, and observability. A proficient Network SRE is dedicated to enhancing network operations, diligently working to… more
    NVIDIA (11/03/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external facing GPU cloud… more
    NVIDIA (11/17/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …global scale PaaS and SaaS offering! We are seeking a highly motivated Senior Site Reliability Engineer to join our Omniverse Infrastructure organization which ... deploy, and run industrial Omniverse applications. Site Reliability Engineering ( SRE ) focuses on production health to prevent...also make feature development faster and safer. As a Senior Omniverse Cloud SRE , you will architect… more
    NVIDIA (11/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior DevOps Engineer

    NVIDIA (Santa Clara, CA)
    …and Processes organization where you will be working as a Principal DevOps & SRE Engineer to support the design and implementation of Kubernetes solutions for ... design, implement & maintain Kubernetes environments from planning to production /deployment to support CI/CD pipeline for Gitlab & Jenkins....in on-call support and critical issue coverage as a SRE engineer . + Take part in prototyping,… more
    NVIDIA (11/13/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of how our systems ... to both product quality and interesting dynamic day-to-day work. SRE 's culture of diversity, intellectual curiosity, problem solving and...Be part of an on call rotation to support production systems + Write and review code, develop documentation… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior DGX Cloud Software Engineer

    NVIDIA (Santa Clara, CA)
    …Engineers with previous experience building and running private and public clouds at production scale (ie beyond labs). In this organization, you will also ensure ... running large scale private or public cloud systems in production . + Experience in one or more of the...or developing multi-cloud infrastructure services. Experience teaching reliability (eg SRE ) or more general cloud systems good practices to… more
    NVIDIA (11/14/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    DGXC SRE at NVIDIA ensures that our internal and...running large scale private or public cloud systems in production . + Experience in one or more of the ... following: Python, Go, Typescript, C/C++, Java + In depth knowledge in one or more of Linux, Networking, Storage, and Containers. Ways to stand out from the crowd: + Experience building and integrating with incident tooling such as FireHydrant, Rootly,… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Sr Staff Site Reliability Engineer (Cortex)

    Palo Alto Networks (Santa Clara, CA)
    …Reliability Engineering + DevOps/ SRE Expertise - 5+ years of experience as a DevOps/ SRE engineer with a passion for technology and a strong motivation for ... our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability...DevOps team to provide follow-the-sun operational coverage in the production of our SaaS product + Collaborate - Work… more
    Palo Alto Networks (11/05/24)
    - Save Job - Related Jobs - Block Source
  • Principal Site Reliability Engineer (Cloud…

    Palo Alto Networks (Santa Clara, CA)
    …Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the Cloud Manager team, you will be part of ... Bash, Java, NodeJS and Go. **Your Impact** + Contribute to the success of SRE and DevOps + Develop expertise in new technologies + Work with developers, researchers,… more
    Palo Alto Networks (11/10/24)
    - Save Job - Related Jobs - Block Source