• Senior Manager - Storage Production…

    NVIDIA (Santa Clara, CA)
    As a Sr Manager in Site Reliability Engineering ( SRE ), you will lead a team dedicated to the design, construction, and maintenance of expansive ... What We Need To See: + Extensive experience in a senior-level role within Site Reliability Engineering , particularly in managing storage infrastructure. +… more
    NVIDIA (09/12/24)
    - Save Job - Related Jobs - Block Source
  • Senior Production SRE Engineer - Storage

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) is an engineering discipline that involves designing, building, and maintaining large-scale production systems ... and availability. It encompasses various areas, including software and systems engineering practices, storage, data management, and services. SRE professionals… more
    NVIDIA (08/10/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager II,…

    Google (Sunnyvale, CA)
    …in Computer Science or Engineering . + 1 year of people management experience. Site Reliability Engineering ( SRE ) combines software and systems ... to learn and grow. To learn more: check out our books on Site Reliability Engineering (https://landing.google.com/ sre /book.html) or read a career profile… more
    Google (10/05/24)
    - Save Job - Related Jobs - Block Source
  • Technical Program Manager II, Site

    Google (Sunnyvale, CA)
    …projects that are well-defined under supervision. The Technical Program Manager (TPM) role within Site Reliability Engineering ( SRE ) is at the heart ... of fulfilling SRE 's mission: making things faster, more reliable, and preparing for the continued growth of Google's infrastructure. As a TPM, you ensure that systems and services are carefully planned and deployed, taking into account multiple variables… more
    Google (08/31/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer…

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high ... efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge… more
    NVIDIA (08/10/24)
    - Save Job - Related Jobs - Block Source
  • Software Developer II, Site

    Google (Mountain View, CA)
    …languages. Preferred qualifications: + Master's degree in Computer Science or Engineering . Site Reliability Engineering ( SRE ) combines software and ... to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our… more
    Google (09/27/24)
    - Save Job - Related Jobs - Block Source
  • Staff Technical Program Manager

    LinkedIn (Mountain View, CA)
    …LinkedIn is looking for a Staff Technical Program Manager to work within LinkedIn's Site Reliability Engineering ( SRE ) program management team. The ... is expected to perform using best practices for execution of our SRE projects as well as initiating, planning, and executing intermediate-to-large scale, cross… more
    LinkedIn (08/02/24)
    - Save Job - Related Jobs - Block Source
  • Senior Wireless Network SRE

    NVIDIA (Santa Clara, CA)
    …field, or equivalent experience. + 8+ years of industry experience in wireless site reliability engineering , wireless network operations, or related areas. ... This role demands a unique blend of hands-on expertise in network operations, engineering , and observability. A proficient Network SRE is dedicated to enhancing… more
    NVIDIA (10/02/24)
    - Save Job - Related Jobs - Block Source
  • Senior SRE Architect

    EPAM Systems (San Jose, CA)
    As a ** SRE Lead - Toil Analysis** , you will be...+ Previous experience in a leadership role within a Site Reliability Engineering team + Proven ... implementing initiatives to reduce toil and improve overall system reliability . You will work closely with cross-functional teams to...to date on industry trends and best practices in SRE and automation **Requirements** + 10+ years of experience… more
    EPAM Systems (09/19/24)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer

    Abbott (Pleasanton, CA)
    …**The Opportunity** **Staff Site Reliability Engineer** As senior member of Site Reliability Engineering , you will play a critical role in ... medicines. Our 114,000 colleagues serve people in more than 160 countries. **Staff Site Reliability Engineer** **Working at Abbott** At Abbott, you can do… more
    Abbott (08/18/24)
    - Save Job - Related Jobs - Block Source
  • Staff Engineer, Applications, Site

    General Motors (Mountain View, CA)
    …BS/MS/PhD in Computer Science/ Engineering + 8+ years of experience engineering reliability + Experience building and operating enterprise cloud applications ... experiences to life. As a key member of our SRE team, you'll have the opportunity to shape the...and ensuring secure and compliant infrastructure as part of reliability engineering + Ability to lead technical… more
    General Motors (09/28/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer, AI…

    NVIDIA (Santa Clara, CA)
    …with the necessary resources and scale to foster innovation. We are seeking a Senior Site Reliability Engineer ( SRE ) to join our team. You'll be instrumental ... training and inferencing. The responsibilities include implementing software and systems engineering practices to ensure high efficiency and availability of the… more
    NVIDIA (09/24/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer, Site

    LinkedIn (Mountain View, CA)
    …expectation of participating in an oncall ~1x per month. Come join the Software Engineering SRE team responsible for maintaining one of the largest Streaming ... is a combination development and operational role ensuring the reliability for centralized Pubsub systems at LinkedIn. There will...working closely with our customers across all of LinkedIn Engineering . Additionally, as an embedded SRE , you… more
    LinkedIn (10/02/24)
    - Save Job - Related Jobs - Block Source
  • Principal Site Reliability Engineer…

    Palo Alto Networks (Santa Clara, CA)
    …in infrastructure automation tasks using Python and shell scripting. + Experience in Site Reliability Engineering , Production Engineering , or DevOps ... one of the largest GCP customers. As a Principal Site Reliability Engineer, you will be part...Go. **Your Impact** + Contribute to the success of SRE and DevOps + Develop expertise in new technologies… more
    Palo Alto Networks (10/03/24)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    We are seeking a highly motivated Site Reliability Engineer ( SRE ) to join our Applications Infrastructure organization. This team is responsible for ... Metropolis, ACE, and Riva hosted in the cloud. The SRE role focuses on ensuring production health to prevent...to prevent outages by defining and developing robust software engineering solutions and practices. These efforts simplify the operating… more
    NVIDIA (10/04/24)
    - Save Job - Related Jobs - Block Source
  • Principal Site Reliability Engineer…

    Palo Alto Networks (Santa Clara, CA)
    …is the market leader in this space. We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production services ... architecture to improve scalability in networking like BGP, OSPF, service reliability , capacity, and performance + Collaborate with development teams to ensure… more
    Palo Alto Networks (10/04/24)
    - Save Job - Related Jobs - Block Source
  • Sr Staff Site Reliability Engineer…

    Palo Alto Networks (Santa Clara, CA)
    …automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, ... Go. **Your Impact** + Contribute to the success of SRE and DevOps + Develop expertise in new technologies...as an engineer in Infrastructure, Operations, DevOps, or System Engineering + 3+ years building high availability, scalable cloud-native… more
    Palo Alto Networks (09/28/24)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer L5 - Live…

    Netflix (Los Gatos, CA)
    …pipeline team and day-to-day live-streaming operations for Netflix. As a Live Streaming Pipeline SRE , you will be responsible for the reliability of our live ... the world is a hard challenge, demanding exceptional levels of stability and reliability from dozens of services and systems between camera and device screens. About… more
    Netflix (07/25/24)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    Job Description Insight Global is looking for a seasoned SRE to join one of our largest technology clients' multifaceted and fast-paced Infrastructure, Planning and ... organization where you will be working as a Senior SRE Engineer. The position will be part of a...working in conjunction with various teams such as software engineering to deploy these new products and manage our… more
    Insight Global (09/26/24)
    - Save Job - Related Jobs - Block Source
  • Lead Site Reliability Engineer…

    EPAM Systems (San Jose, CA)
    We are looking for a candidate to join a multi-functional SRE team with the focus on Google Cloud Platform. You should have cloud engineering experience in such ... Program **About EPAM** + EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on… more
    EPAM Systems (08/29/24)
    - Save Job - Related Jobs - Block Source