- NVIDIA (Santa Clara, CA)
- As a Sr Manager in Site Reliability Engineering ( SRE ), you will lead a team dedicated to the design, construction, and maintenance of expansive ... What We Need To See: + Extensive experience in a senior-level role within Site Reliability Engineering , particularly in managing storage infrastructure. +… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering ( SRE ) is an engineering discipline that involves designing, building, and maintaining large-scale production systems ... and availability. It encompasses various areas, including software and systems engineering practices, storage, data management, and services. SRE professionals… more
- Google (Sunnyvale, CA)
- …in Computer Science or Engineering . + 1 year of people management experience. Site Reliability Engineering ( SRE ) combines software and systems ... to learn and grow. To learn more: check out our books on Site Reliability Engineering (https://landing.google.com/ sre /book.html) or read a career profile… more
- Google (Sunnyvale, CA)
- …systems. Preferred qualifications: + Master's degree in Computer Science or Engineering . Site Reliability Engineering ( SRE ) combines software and ... to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our… more
- Google (Sunnyvale, CA)
- …SDN). Preferred qualifications: + Master's degree in Computer Science or Engineering . Site Reliability Engineering ( SRE ) combines software and ... to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high ... efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge… more
- Palo Alto Networks (Santa Clara, CA)
- …Management:** Clear understanding of incident and alerts management in Site Reliability Engineering . + **DevOps/ SRE Expertise:** 5+ years of experience ... of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear...performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: +… more
- NVIDIA (Santa Clara, CA)
- …field, or equivalent experience. + 8+ years of industry experience in wireless site reliability engineering , wireless network operations, or related areas. ... This role demands a unique blend of hands-on expertise in network operations, engineering , and observability. A proficient Network SRE is dedicated to enhancing… more
- EPAM Systems (San Jose, CA)
- As a ** SRE Lead - Toil Analysis** , you will be...+ Previous experience in a leadership role within a Site Reliability Engineering team + Proven ... implementing initiatives to reduce toil and improve overall system reliability . You will work closely with cross-functional teams to...to date on industry trends and best practices in SRE and automation **Requirements** + 10+ years of experience… more
- Abbott (Pleasanton, CA)
- …**The Opportunity** **Staff Site Reliability Engineer** As senior member of Site Reliability Engineering , you will play a critical role in ... medicines. Our 114,000 colleagues serve people in more than 160 countries. **Staff Site Reliability Engineer** **Working at Abbott** At Abbott, you can do… more
- NVIDIA (Santa Clara, CA)
- …with the necessary resources and scale to foster innovation. We are seeking a Senior Site Reliability Engineer ( SRE ) to join our team. You'll be instrumental ... training and inferencing. The responsibilities include implementing software and systems engineering practices to ensure high efficiency and availability of the… more
- LinkedIn (Mountain View, CA)
- …expectation of participating in an oncall ~1x per month. Come join the Software Engineering SRE team responsible for maintaining one of the largest Streaming ... is a combination development and operational role ensuring the reliability for centralized Pubsub systems at LinkedIn. There will...working closely with our customers across all of LinkedIn Engineering . Additionally, as an embedded SRE , you… more
- NVIDIA (Santa Clara, CA)
- …designing and operating large scale compute infrastructure + Proven experience in site reliability engineering for high-performance computing environments ... and drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer, you are responsible for the big picture… more
- JPMorgan Chase (Palo Alto, CA)
- …of study plus 5 years of experience in the job offered or as Site Reliability Engineer, Systems Engineer, Senior Technical Analyst, or related occupation. Skills ... and drive efficiencies in systems and processes. Develop new cloud engineering strategies and implementations for the firm. Responsible for ensuring the… more
- Palo Alto Networks (Santa Clara, CA)
- …we all win with precision. **Your Career** We are looking for an exceptional Principal Site Reliability Engineer to enhance our ATP Infra team. This role will ... that will ensure the highest levels of availability and reliability of all our applications. We need creative and...SRE in design reviews and work cross-functionally with Engineering teams on operational readiness **Your Experience** + BS… more