- NVIDIA (Santa Clara, CA)
- …and providing related services to support the overall stability and performance of the production systems. SRE at NVIDIA ensures that our internal and external ... Site Reliability Engineering ( SRE ) is an engineering discipline that involves designing, building, and maintaining large-scale production systems with high… more
- General Motors (Mountain View, CA)
- …communities where we live and deliver a better future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements of the ... innovate! **What You'll Do** + Implement scalable, reliable, secure SRE and Observability platform to monitor health of our... and Observability platform to monitor health of our production system and provide a holistic view of the… more
- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... reviews, assist in root cause identification, and write RCA reports. + Deliver SRE solutions in a globally distributed, multi-cloud hybrid environment - AWS, GCP,… more
- JPMorgan Chase (Palo Alto, CA)
- …pushing the envelope to enhance, build, and deliver top-notch technology products. As a Senior Lead Software Engineer at JPMorgan Chase within the (insert LOB or ... teams, contractors, and vendors + Develops secure and high-quality production code, and reviews and debugs code written by...with Apache Kafka, MQ, or Kinesis. + Collaborate with SRE /DevOps teams to deploy and monitor OLTP databases in… more
- NVIDIA (Santa Clara, CA)
- …improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of how our systems ... to both product quality and interesting dynamic day-to-day work. SRE 's culture of diversity, intellectual curiosity, problem solving and...Be part of an on call rotation to support production systems + Write and review code, develop documentation… more
- Walmart (Sunnyvale, CA)
- …bring:** + 6+ years in a software development, DevOps role, or SRE role. + Experience in designing, investigating, analyzing, and troubleshooting large-scale ... client-server protocols along the way. Experience administering Linux systems in a production environment. + Programming experience in one or more of the following… more
- LinkedIn (Mountain View, CA)
- …automation is a key component to operating large-scale systems. We are looking for a Senior Site Reliability Engineer to make a real impact on our production ... productivity and Business Applications at LinkedIn. Productivity Engineering Site Reliability Engineers ( SRE ) at LinkedIn fill the critical role of ensuring that our… more