- Google (Kirkland, WA)
- Senior Staff Software Engineer, SRE , ML Fleet Systems _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA; +2 more; +1 more **Advanced** Experience ... consensus across organizational boundaries. **About the job** Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run… more
- Microsoft Corporation (Redmond, WA)
- …Engineer. This role is a dynamic blend of Platform Engineering, DevOps/ SRE , and Big Data Infrastructure Engineering, focused on enabling large-scale data ... Infrastructure for mission-critical AI applications. + Champion DevOps and SRE best practices-automated deployments, service monitoring, and incident response. +… more
- Meta (Redmond, WA)
- …Relevant industry experience is important (Software Engineer, Site Reliability Engineer ( SRE ), Systems Engineer, DevOps Engineer, Network Engineer, or similar role), ... support and lead incident root cause analysis through multiple data engineering layers ( compute , storage, network) for GPU clusters and act as a final escalation… more
- Microsoft Corporation (Redmond, WA)
- …and brings them to the attention of their Site Reliability Engineering ( SRE ) and/or product engineering teams. + Utilizes insights from performance and resource ... of component and feature code, or if changes to compute resources are required. Models the predicted effect of...Models the predicted effect of changes to code and/or compute resources across components or features to document the… more