- NVIDIA (Santa Clara, CA)
- …bold and visionary leaders to join us on an exciting journey as Senior SRE Engineering Leader . Lead our globally distributed clusters, ensuring ... such as life sciences and natural language processing. As SRE Leader , you will build and operate...Manage distributed, multi-location GPU clusters for AI research. + Lead a team of SREs, driving cluster operational excellence… more
- NVIDIA (Santa Clara, CA)
- …software and systems engineering , cloud-scale storage, data management, and services. SRE Senior Managers bring specialized expertise in areas such as ... As a Sr Manager in Site Reliability Engineering ( SRE ), you will lead...We Need To See: + Extensive experience in a senior -level role within Site Reliability Engineering , particularly… more
- NVIDIA (Santa Clara, CA)
- The Network Support and SRE team is in search of a seasoned Network SRE technical lead to help actualize the SRE vision for our network infrastructure. ... This role demands a unique blend of hands-on expertise in network operations, engineering , and observability. A proficient Network SRE is dedicated to enhancing… more
- EPAM Systems (San Jose, CA)
- As a ** SRE Lead - Toil Analysis** , you will be responsible for driving and implementing initiatives to reduce toil and improve overall system reliability. You ... to focus on innovation and strategic initiatives. Req.#628495452 **\#LI-DNI** **Responsibilities** + Lead a team of engineers and collaborate with other teams to… more
- NVIDIA (Santa Clara, CA)
- …is inspired to do their best work . We are seeking a highly skilled Senior Staff Infrastructure Performance Engineer to join our dynamic team. Our company is at the ... cloud. Join us in this exciting endeavor! What you will be doing: + Lead initiatives to transform IT Compute platform architecture to build new service offerings… more
- NVIDIA (Santa Clara, CA)
- …and cloud. Join us in this exciting endeavor! What you will be doing: + Lead initiatives and guide the development of On-Prem and Cloud infrastructure to ensure its ... data for reporting, alerting, and monitoring. + Collaborate with NVIDIA leadership, senior engineers, program managers, and product managers to develop compelling IT… more
- Wells Fargo (San Francisco, CA)
- **About this role:** Wells Fargo is seeking a Platform Site Reliability Engineering Senior Manager to help design durable and reliable services, automate ... production related activities. **In this role, you will:** + Lead by example - focus on key aspects of...technology background in software engineering and systems engineering to ensure the applications on-boarded to SRE… more
- Google (Sunnyvale, CA)
- …and hardware, SDN). Preferred qualifications: + Master's degree in Computer Science or Engineering . Site Reliability Engineering ( SRE ) combines software and ... to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our… more
- Microsoft Corporation (Mountain View, CA)
- Microsoft is looking for a Senior Site Reliability Engineer ( SRE ) to support and expand Viva Engage. Viva Engage (formerly Yammer) is the industry-defining ... scale and modernize our tech stack. We need a SRE who knows how to manage the conflicting priorities...engineering efficiently via written and verbal communications. + Lead blameless postmortems for root cause and production resiliency.… more
- Gap Inc. (San Francisco, CA)
- …that they provide. As a people leader first and delivery manager second, this leader must build, inspire, and lead the technical teams. This person will have ... planet. Ready to learn fast, create with audacity and lead boldly? Join our team. **About the Role** This...traditional network engineering team to a platform engineering , automation, cloud first, SRE mindset +… more
- Abbott (Pleasanton, CA)
- …female executives, and scientists. **The Opportunity** **Staff Site Reliability Engineer** As senior member of Site Reliability Engineering , you will play a ... Abbott is a global healthcare leader that helps people live more fully at...software systems. Expertise in the value and principles of SRE (SLI/SLO, Error Budgets, Toil, Observability, Release Engineering… more
- Cisco (San Jose, CA)
- Who We Are The SaaS Engineering organization within the Cisco Networking Group is committed to revolutionizing the deployment of generative AI applications. Our ... innovative solutions. Who You Are You are a seasoned leader with a robust background in driving the execution...objectives. As a mentor and guide, you inspire and lead your team of software engineers with empathy, fostering… more
- General Motors (Mountain View, CA)
- …Role:** The Cloud Platform Engineering and Services Team is seeking an experienced Senior Software Engineer to lead and work with engineering teams ... Engineering and Services Team is seeking an experienced Senior Software Engineer to lead and work...multi-cloud ecosystem. Additionally, this team works closely with the Engineering , Architecture and SRE teams to develop… more
- Cisco (San Jose, CA)
- Who We Are We are SaaS Engineering organization within the Cisco Networking Engineering . We are committed to revolutionizing the deployment of generative AI ... with AI/ML solutions. Who You Are You are a Senior Software development manager with a background in end-to-end...software for an innovative cloud service within Cisco Networking Engineering . You will be entrusted with the following key… more
- NVIDIA (Santa Clara, CA)
- …and Processes organization where you will be working as a Principal DevOps & SRE Engineer to support the design and implementation of Kubernetes solutions for the ... Actively participate in product workshops, roadmap and design sessions. Lead technical demos, whiteboards and working sessions. + Defend...in on-call support and critical issue coverage as a SRE engineer. + Take part in prototyping, crafting and… more
- NVIDIA (Santa Clara, CA)
- …within the organization. You will collaborate closely with teams, including engineering , product management, supply chain, for execution and alignment with ... organizational objectives. In partnership with senior IT leaders, you will be responsible for global...issues for quick resolution. What you'll be doing: + Lead the planning, execution, and monitoring of projects. Develop… more
- General Motors (Mountain View, CA)
- …and ensuring secure and compliant infrastructure as part of reliability engineering + Ability to lead technical decision-making, balancing reliability, ... dictated by the business. **The Role:** This is a senior technical leadership role, and we are looking for...experiences to life. As a key member of our SRE team, you'll have the opportunity to shape the… more
- American Express (Palo Alto, CA)
- …service Mesh configurations, traffic policies and security features * Collaborate with SRE and Platform Engineering teams to integrate observability tooling, ... **Description** **You Lead the Way. We've Got Your Back.** With...latest technologies and encourages you to back the broader engineering community through open source. And because we understand… more
- NVIDIA (Santa Clara, CA)
- …can build visualizations, to a once-in-a-lifetime opportunity. NVIDIA is seeking a Senior Manager of Network Deployments Engineering within the IT. ... role as you build strong teams that partner with engineering and operations teams across NVIDIA. We are looking...across NVIDIA. We are looking for a passionate network leader who enjoys designing and delivering best in class… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is the leader in AI, machine learning and datacenter acceleration. NVIDIA is expanding that leadership into datacenter networking with ethernet switches, NICs ... is key to both product quality and interesting dynamic day-to-day work. SRE 's culture of diversity, intellectual curiosity, problem solving and openness is important… more