- NVIDIA (Santa Clara, CA)
- …and providing related services to support the overall stability and performance of the production systems. SRE at NVIDIA ensures that our internal and external ... Site Reliability Engineering ( SRE ) is an engineering discipline that involves designing, building, and maintaining large-scale production systems with high… more
- NVIDIA (Santa Clara, CA)
- …them with the necessary resources and scale to foster innovation. We are seeking a Senior Site Reliability Engineer ( SRE ) to join our team. You'll be ... and availability of the platform, as well as applying SRE principles to improve production systems and...and performance is part of the role. As a Senior SRE at NVIDIA, you will have… more
- Microsoft Corporation (Mountain View, CA)
- Microsoft is looking for a Senior Site Reliability Engineer ( SRE ) to support and expand Viva Engage. Viva Engage (formerly Yammer) is the industry-defining ... will be verified via a valid passport. **Preferred Qualifications:** + Experience applying SRE principles in a large production environment. + Proficiency in… more
- NVIDIA (Santa Clara, CA)
- …Support and SRE team is in search of a seasoned Network Operations Engineer to help actualize the SRE vision for our network infrastructure. This role ... expertise in network operations, engineering, and observability. A proficient Network SRE is dedicated to enhancing network operations, diligently working to… more
- Google (Sunnyvale, CA)
- …+ Master's degree in Computer Science or Engineering. Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run large-scale, ... massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have… more
- NVIDIA (Santa Clara, CA)
- …global scale PaaS and SaaS offering! We are seeking a highly motivated Senior Site Reliability Engineer to join our Omniverse Infrastructure organization which ... deploy, and run industrial Omniverse applications. Site Reliability Engineering ( SRE ) focuses on production health to prevent...also make feature development faster and safer. As a Senior Omniverse Cloud SRE , you will architect… more
- NVIDIA (Santa Clara, CA)
- …and Processes organization where you will be working as a Principal DevOps & SRE Engineer to support the design and implementation of Kubernetes solutions for ... design, implement & maintain Kubernetes environments from planning to production /deployment to support CI/CD pipeline for Gitlab & Jenkins....in on-call support and critical issue coverage as a SRE engineer . + Take part in prototyping,… more
- NVIDIA (Santa Clara, CA)
- …improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of how our systems ... to both product quality and interesting dynamic day-to-day work. SRE 's culture of diversity, intellectual curiosity, problem solving and...Be part of an on call rotation to support production systems + Write and review code, develop documentation… more
- American Express (Palo Alto, CA)
- …to write modern scalable cloud native applications. **Job Description:** As a Software Engineer for Quality Engineering team of Enterprise Cloud, you will play a ... requires pro-active problem-solving skill to identify the issue before they impact production environments. You will also contribute to the continuous improvement of… more
- NVIDIA (Santa Clara, CA)
- …Engineers with previous experience building and running private and public clouds at production scale (ie beyond labs). In this organization, you will also ensure ... running large scale private or public cloud systems in production . + Experience in one or more of the...or developing multi-cloud infrastructure services. Experience teaching reliability (eg SRE ) or more general cloud systems good practices to… more
- NVIDIA (Santa Clara, CA)
- DGXC SRE at NVIDIA ensures that our internal and...running large scale private or public cloud systems in production . + Experience in one or more of the ... following: Python, Go, Typescript, C/C++, Java + In depth knowledge in one or more of Linux, Networking, Storage, and Containers. Ways to stand out from the crowd: + Experience building and integrating with incident tooling such as FireHydrant, Rootly,… more
- Palo Alto Networks (Santa Clara, CA)
- …Reliability Engineering + DevOps/ SRE Expertise - 5+ years of experience as a DevOps/ SRE engineer with a passion for technology and a strong motivation for ... our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability...DevOps team to provide follow-the-sun operational coverage in the production of our SaaS product + Collaborate - Work… more
- Palo Alto Networks (Santa Clara, CA)
- …Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the Cloud Manager team, you will be part of ... Bash, Java, NodeJS and Go. **Your Impact** + Contribute to the success of SRE and DevOps + Develop expertise in new technologies + Work with developers, researchers,… more