• Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play ... a crucial role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build… more
    NVIDIA (11/19/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance our enterprise security ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
    NVIDIA (09/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    Google (Mountain View, CA)
    Senior Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Mountain View, CA, USA **Mid** Experience ... , written by Google SREs. + Read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a software engineer more
    Google (11/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer

    Google (Sunnyvale, CA)
    Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA; New York, NY, USA **Advanced** ... Reliability Engineering (https://landing.google.com/sre/book.html) or read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a Software Engineer more
    Google (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer

    Google (Sunnyvale, CA)
    Senior Systems Engineer , Site Reliability Engineering, Google Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to… more
    Google (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …behind our data and observability platforms keep the whole engine running! We're hiring Site Reliability Engineers who want to work on the systems that power ... how our AI and data systems are built, set reliability standards, and solve scaling challenges that come with...large-scale production systems in roles such as SRE, Production Engineer , or Platform Engineer and 5+ years… more
    NVIDIA (12/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …We take great pride in providing excellent, comprehensive support to our customers! ​Sr Site Reliability Engineer in this role will significantly impact and ... in Computer Science or related field. + 8+ years of experience in site reliability engineering and/or software development roles. + Fluency in Python + In-depth… more
    NVIDIA (10/28/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
    NVIDIA (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    LinkedIn (Sunnyvale, CA)
    …Streams IO Reliability Team, you would be you will be a reliability -focused software engineer responsible for driving success of our Pubsub ecosystems, ... in Mountain View, CA. Come join the Streams IO Reliability Team responsible for maintaining one of the largest...Streaming ecosystems on the planet. The LinkedIn Streams IO Reliability Team team is responsible for maintaining LinkedIn's pubsub… more
    LinkedIn (11/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior DevOps Service Reliability

    NVIDIA (Santa Clara, CA)
    …Support), you will partner with other key members of our organization including Site Reliability Engineering, Security Operations Center, DevOps teams, and other ... engineers to design, develop and implement a global, dynamic, innovative Service Reliability Operations Center, to provide extraordinary levels of support for our… more
    NVIDIA (11/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    General Motors (Sunnyvale, CA)
    …Go, or Groovy + On-call and fire-fighting experience + Experience with modern site reliability practice including but not limited to post mortem, SLO/SLI, ... Tracing, Synthetic monitoring, etc. **What Will Give You A Competitive Edge (Preferred Qualifications)** + You have experience managing Azure cloud platform and are the domain expert + You are well known for being customer focused. + You had demonstrated low… more
    General Motors (11/05/25)
    - Save Job - Related Jobs - Block Source
  • (USA) Senior Director, Site

    Walmart (Sunnyvale, CA)
    …infrastructure management, or related area., SRE certification (for example, IBM Cloud Site Reliability Engineer )., We value candidates with a ... and household necessities **Qualifications** * 16+ years of experience in Site Reliability Engineering, Production Engineering, and Infrastructure Reliability more
    Walmart (11/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Quality & Reliability Engineer

    Amazon (Cupertino, CA)
    …designs cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that… more
    Amazon (12/05/25)
    - Save Job - Related Jobs - Block Source
  • Principal Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... data for reporting, alerting, monitoring. + Collaborate with NVIDIA leadership, senior engineers, program managers, and product managers to develop compelling IT… more
    NVIDIA (11/20/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    …equivalent role at a high-growth or web-scale technology company Suggested Skills + Site Reliability Engineering (SRE) + Leadership + Large scale infrastructure ... in Sunnyvale, CA or San Francisco, CA. **Responsibilities** + Serve as a senior technical leader driving the long-term reliability and observability strategy… more
    LinkedIn (09/24/25)
    - Save Job - Related Jobs - Block Source
  • Junior Site Reliability

    Insight Global (Santa Clara, CA)
    …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... that develops and maintains sophisticated internal cloud provisioning products. The team works with various other business units such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their… more
    Insight Global (12/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    Oracle (Redwood City, CA)
    …experience Knowledge of Oracle databases would be a plus. **Responsibilities** Work with Site Reliability Engineering (SRE) team on the shared full stack ... characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.… more
    Oracle (12/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Supplier Quality Engineer

    Google (Sunnyvale, CA)
    Senior Supplier Quality Engineer , Battery and UPS _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... practical experience. + 8 years of experience in manufacturing, quality, reliability or product engineering. **Preferred qualifications:** + Master's degree in… more
    Google (12/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior , Software Engineer

    Walmart (Sunnyvale, CA)
    …of orders daily through our high-performance checkout services running in Edge and Cloud. As a Site Reliability Engineer in the CPC Team, you will work with ... criteria (for example, probability of failure, frequency of failure) to measure site reliability . Monitors site reliability conditions and new … more
    Walmart (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior Supplier Quality Engineer

    Google (Sunnyvale, CA)
    Senior Supplier Quality Engineer , Optics, Google Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, ... practical experience. + 8 years of experience in manufacturing, quality, reliability or product engineering. **Preferred qualifications:** + Master's degree or PhD… more
    Google (11/28/25)
    - Save Job - Related Jobs - Block Source