- Cisco (Milpitas, CA)
- …and multi-cloud environments. You will be actively a part of an engineering focused team that cultivates innovation, collaboration and diversity. Read more at: ... be working on operating the infrastructure and applications as part of Nexus Cloud engineering team. Nexus Cloud is a SaaS service that provides the easiest way to… more
- NVIDIA (Santa Clara, CA)
- As a Sr Manager in Site Reliability Engineering ( SRE ), you will lead a team dedicated to the design, construction, and maintenance of expansive ... What We Need To See: + Extensive experience in a senior-level role within Site Reliability Engineering , particularly in managing storage infrastructure. +… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering ( SRE ) is an engineering discipline that involves designing, building, and maintaining large-scale production systems ... and availability. It encompasses various areas, including software and systems engineering practices, storage, data management, and services. SRE professionals… more
- Google (Sunnyvale, CA)
- …in Computer Science or Engineering . + 1 year of people management experience. Site Reliability Engineering ( SRE ) combines software and systems ... to learn and grow. To learn more: check out our books on Site Reliability Engineering (https://landing.google.com/ sre /book.html) or read a career profile… more
- Google (San Bruno, CA)
- …leadership. Preferred qualifications: + Master's degree in Computer Science or Engineering . Site Reliability Engineering ( SRE ) combines software and ... to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our… more
- ServiceNow, Inc. (Santa Clara, CA)
- …Development, Systems Engineering , and Networking. We provide leadership to a strong Site Reliability Engineering ( SRE ) team. We attack problems ... skills to provide technical leadership for a team of on- site engineers who are responsible for the availability and...6+ years of experience in Linux enterprise service operations, SRE , or Systems Engineering . + An in-depth… more
- Google (Sunnyvale, CA)
- …projects that are well-defined under supervision. The Technical Program Manager (TPM) role within Site Reliability Engineering ( SRE ) is at the heart ... of fulfilling SRE 's mission: making things faster, more reliable, and preparing for the continued growth of Google's infrastructure. As a TPM, you ensure that systems and services are carefully planned and deployed, taking into account multiple variables… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high ... efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demands knowledge… more
- Google (Sunnyvale, CA)
- …languages. Preferred qualifications: + Master's degree in Computer Science or Engineering . Site Reliability Engineering ( SRE ) combines software and ... to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our… more
- Google (Sunnyvale, CA)
- …of GCS components. + Work cross-functionally within the broader engineering community and Site Reliability Engineering ( SRE ) to deliver solutions for ... Minimum qualifications: + Bachelor's degree in Engineering , Computer Science or related technical field or...major platform changes which scaled GCS foundations - durability, reliability and performance - by orders of magnitude. As… more
- LinkedIn (Mountain View, CA)
- …LinkedIn is looking for a Staff Technical Program Manager to work within LinkedIn's Site Reliability Engineering ( SRE ) program management team. The ... is expected to perform using best practices for execution of our SRE projects as well as initiating, planning, and executing intermediate-to-large scale, cross… more
- EPAM Systems (San Jose, CA)
- As a ** SRE Lead - Toil Analysis** , you will be...+ Previous experience in a leadership role within a Site Reliability Engineering team + Proven ... implementing initiatives to reduce toil and improve overall system reliability . You will work closely with cross-functional teams to...to date on industry trends and best practices in SRE and automation **Requirements** + 10+ years of experience… more
- NVIDIA (Santa Clara, CA)
- …What you'll be doing: + Work alongside our first-class Network Automation, Engineering , and Operations teams to understand current manual processes + Build services ... troubleshoots, and analyzes system disruptions and develop solutions for improved reliability + Owning and driving integrations with various service APIs such… more
- Abbott (Pleasanton, CA)
- …**The Opportunity** **Staff Site Reliability Engineer** As senior member of Site Reliability Engineering , you will play a critical role in ... medicines. Our 114,000 colleagues serve people in more than 160 countries. **Staff Site Reliability Engineer** **Working at Abbott** At Abbott, you can do… more
- NVIDIA (Santa Clara, CA)
- …culture? If so, we have a great opportunity for you! NVIDIA is seeking a Senior Site Reliability Engineer ( SRE ) for the Data Science & ML Platform(s) team. ... training and inferencing. The responsibilities include implementing software and systems engineering practices to ensure high efficiency and availability of the… more
- NVIDIA (Santa Clara, CA)
- …where everyone is inspired to do their best work. We are looking for a Manager for Site Reliability Engineering to help build and lead its cloud service team ... SLOs. We partner with Service Owners to drive the reliability of the service. What you will be doing:...availability and performance of critically meaningful services in a live- site production environment, either as an SRE … more
- Palo Alto Networks (Santa Clara, CA)
- …is the market leader in this space. We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production services ... architecture to improve scalability in networking like BGP, OSPF, service reliability , capacity, and performance + Collaborate with development teams to ensure… more
- JPMorgan Chase (Palo Alto, CA)
- …study plus 5 years of experience in the job offered or as Site Reliability Engineer, Systems Administrator, Application Engineer, Technical Services Specialist, ... DESCRIPTION: Duties: Responsible for engineering and operating JPMC's cloud infrastructure and platforms ensuring reliability , resiliency, security,… more
- Netflix (Los Gatos, CA)
- …pipeline team and day-to-day live-streaming operations for Netflix. As a Live Streaming Pipeline SRE , you will be responsible for the reliability of our live ... the world is a hard challenge, demanding exceptional levels of stability and reliability from dozens of services and systems between camera and device screens. About… more
- EPAM Systems (San Jose, CA)
- We are looking for a candidate to join a multi-functional SRE team with the focus on Google Cloud Platform. You should have cloud engineering experience in such ... Program **About EPAM** + EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on… more