• Senior Production SRE

    NVIDIA (Santa Clara, CA)
    …and providing related services to support the overall stability and performance of the production systems. SRE at NVIDIA ensures that our internal and external ... Site Reliability Engineering ( SRE ) is an engineering discipline that involves designing, building, and maintaining large-scale production systems with high… more
    NVIDIA (10/29/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …them with the necessary resources and scale to foster innovation. We are seeking a Senior Site Reliability Engineer ( SRE ) to join our team. You'll be ... and availability of the platform, as well as applying SRE principles to improve production systems and...and performance is part of the role. As a Senior SRE at NVIDIA, you will have… more
    NVIDIA (10/23/24)
    - Save Job - Related Jobs - Block Source
  • Senior Network Operations Engineer

    NVIDIA (Santa Clara, CA)
    …Support and SRE team is in search of a seasoned Network Operations Engineer to help actualize the SRE vision for our network infrastructure. This role ... expertise in network operations, engineering, and observability. A proficient Network SRE is dedicated to enhancing network operations, diligently working to… more
    NVIDIA (11/03/24)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Systems Engineer , Site…

    Google (Sunnyvale, CA)
    …+ Master's degree in Computer Science or Engineering. Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run large-scale, ... massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have… more
    Google (10/31/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external facing GPU cloud… more
    NVIDIA (10/23/24)
    - Save Job - Related Jobs - Block Source
  • Senior DevOps Engineer

    NVIDIA (Santa Clara, CA)
    …and Processes organization where you will be working as a Principal DevOps & SRE Engineer to support the design and implementation of Kubernetes solutions for ... design, implement & maintain Kubernetes environments from planning to production /deployment to support CI/CD pipeline for Gitlab & Jenkins....in on-call support and critical issue coverage as a SRE engineer . + Take part in prototyping,… more
    NVIDIA (11/01/24)
    - Save Job - Related Jobs - Block Source
  • Senior Cloud Platform Software…

    NVIDIA (Santa Clara, CA)
    engineer owns their code - from development to commit to test to production . They will be responsible for supporting SRE teams with development support and ... Engineering (NVCNE) group's backend software team! As a Cloud Platform Software Engineer , you will work alongside architects, designers, frontend engineers, SREs and… more
    NVIDIA (11/02/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - DGX Cloud

    NVIDIA (Santa Clara, CA)
    …trouble-shooting of compute hardware and networking equipment . As a software engineer , you will work with other software engineers, product architects, and product ... code - from development to commit to test to production , including operational support . We expect you...communication protocols (mutual-TLS, IPsec, or similar). + Knowledge of SRE principles (observability, SLOs, logging, etc.) Ways to stand… more
    NVIDIA (11/01/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of how our systems ... to both product quality and interesting dynamic day-to-day work. SRE 's culture of diversity, intellectual curiosity, problem solving and...Be part of an on call rotation to support production systems + Write and review code, develop documentation… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior Engineer - Enterprise Cloud

    American Express (Palo Alto, CA)
    …to write modern scalable cloud native applications. **Job Description:** As a Software Engineer for Quality Engineering team of Enterprise Cloud, you will play a ... requires pro-active problem-solving skill to identify the issue before they impact production environments. You will also contribute to the continuous improvement of… more
    American Express (10/17/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    DGXC SRE at NVIDIA ensures that our internal and...running large scale private or public cloud systems in production . + Experience in one or more of the ... following: Python, Go, Typescript, C/C++, Java + In depth knowledge in one or more of Linux, Networking, Storage, and Containers. Ways to stand out from the crowd: + Experience building and integrating with incident tooling such as FireHydrant, Rootly,… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior DGX Cloud Software Engineer

    NVIDIA (Santa Clara, CA)
    …design developing tools for running large scale private or public cloud system in production . + Experience in one or more of the following: Python, Go, Perl or ... with or developing multi-cloud infrastructure services. + Experience teaching reliability (eg SRE ) or more general cloud systems good practices to peers or to… more
    NVIDIA (09/24/24)
    - Save Job - Related Jobs - Block Source
  • Sr Staff Site Reliability Engineer (Cortex)

    Palo Alto Networks (Santa Clara, CA)
    …Reliability Engineering. + **DevOps/ SRE Expertise:** 5+ years of experience as a DevOps/ SRE engineer with a passion for technology and a strong motivation ... our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability...DevOps team to provide follow-the-sun operational coverage in the production of our SaaS product. + Collaborate: Work with… more
    Palo Alto Networks (10/30/24)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    JPMorgan Chase (Palo Alto, CA)
    …quality of technical engineering documentation. Debug and solve issues in a production environment. Participate in SRE on-call rotations and escalation ... years of experience in the job offered or as Site Reliability Engineer , Systems Engineer , Senior Technical Analyst, or related occupation. Skills Required:… more
    JPMorgan Chase (10/18/24)
    - Save Job - Related Jobs - Block Source