- Celonis (San Francisco, CA)
- …of a highly technical, collaborative and creative team, with a focus on SRE & Software Engineering. + Responsible for the design, implementation, reliability and ... platform resiliency. + Share acquired knowledge and document accordingly while implementing SRE best practices. **The qualifications you need:** + A bachelors or… more
- DataRobot (San Francisco, CA)
- …drive greater impact and value from AI. The Director of Site Reliability Engineering ( SRE ) will lead our SRE team, ensuring the reliability, scalability, and ... CS, MIS, or equivalent experience + 7-8 years of relevant experience in a SRE or DevOps role with equivalent responsibilities + Demonstrated success managing a team… more
- EPAM Systems (San Francisco, CA)
- As a ** SRE Lead - Toil Analysis** , you will...to date on industry trends and best practices in SRE and automation **Requirements** + 10+ years of experience ... and reducing operational overhead through automation + Strong understanding of SRE principles and best practices + Proficiency in programming and scripting… more
- Insight Global (Mountain View, CA)
- …worked their way into L2/L3 support, doing extensive scripting as they moved into the SRE function. This role is the right fit for someone who loves to be ... with monitoring tools such as Grafana or Prometheus -Experience working in a SRE or L2/L3 support (on-call environment) -Extremely motivated, excited to be part of… more
- Intuit (Mountain View, CA)
- …Our Senior Manager is an engineering leader who works with the engineering staff to innovate and build new engineering solutions, improve, and enhance existing ... solutions as well as leverage engineering solutions to solve critical operational problems. You will also be responsible for growing and developing a high performing engineering team, who are driven and autonomous in nature. You will support the development of… more
- Intuit (Mountain View, CA)
- …Intuit Fintech is your trusted financial expert empowering financial prosperity for businesses and consumers through a convenient, powerful, AI-native fintech ... platform providing fast and easy access to funds at the time of need. We process millions of transactions every day across various payment methods. Millions of customers and merchants send billions of dollars moving at light-speed through our systems annually.… more
- Intuit (Mountain View, CA)
- …Intuit Fintech is your trusted financial expert empowering financial prosperity for businesses and consumers through a convenient, powerful, AI-native fintech ... platform providing fast and easy access to funds at the time of need. We process millions of transactions every day across various payment methods. Millions of customers and merchants send billions of dollars moving at light-speed through our systems annually.… more
- Abbott (Pleasanton, CA)
- …solutions for our customers. You will be responsible for implementing SRE improvement processes, procedures and influencing change within the organization. You ... environment and have DevOps or formal test automation, load testing or SRE experience. You will need extensive technical knowledge in the development, delivery,… more
- Google (San Francisco, CA)
- …analyzing, and troubleshooting large-scale distributed systems. Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run ... large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical...customer's needs and a fast rate of improvement. Additionally SRE 's will keep an ever-watchful eye on our systems… more
- Google (San Bruno, CA)
- …+ Master's degree in Computer Science or Engineering. Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run large-scale, ... massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical...customer's needs and a fast rate of improvement. Additionally SRE 's will keep an ever-watchful eye on our systems… more
- LiveRamp (San Francisco, CA)
- …to build and maintain products operational documentation and setting up product SRE practices.** + **Support Security and Compliance governance support in production ... environments.** + **Work in close collaboration with SRE team members and Engineering organizations based in California, Paris, London, Singapore, Australia and… more
- Cisco (San Francisco, CA)
- …role in shaping a more inclusive future for all. As a Site Reliability Engineer ( SRE ) Intern/Co-op on Meraki SRE , you will work on projects spanning everything ... monitor, scale and deploy our distributed cloud services. Meraki's SRE organization is key in building and scaling the...the underlying infrastructure. What we look for in our SRE intern/co-op candidates: * You are pursuing a technical… more
- McAfee, Inc. (San Jose, CA)
- …monitoring, logging, and alerting solutions to maintain system health and security. ** SRE Leadership:** + Drive SRE practices by implementing strategies that ... and guide junior engineers in cloud architecture, DevOps, and SRE best practices. + Act as a subject matter...subject matter expert on AWS cloud solutions, DevOps, and SRE practices within the organization. **Documentation & Reporting:** +… more
- LinkedIn (Mountain View, CA)
- …Technical Program Manager to work within LinkedIn's Site Reliability Engineering ( SRE ) program management team. The right candidate will have demonstrated solid ... to perform using best practices for execution of our SRE projects as well as initiating, planning, and executing...roadmap management and experience in Agile Methods within an SRE or infrastructure team Preferred Qualifications: * 2+ years… more
- PagerDuty (San Francisco, CA)
- …first. You demonstrate a deep knowledge of IT monitoring tools or within DevOps, SRE or IT Operations. You run the implementation process from design to delivery. ... ETL like activities. + Knowledge of infrastructure as code and DevOps SRE toolchains (GitHub, Terraform, Chef, Artifactory, JFrog, Nomad, Consul, Vault) + Ability… more
- Robert Half Technology (Alameda, CA)
- …On-Call Support for addressing security tickets and serve as a Security System SRE on a rotational basis Collaborate with engineering and operations teams toward ... implementing controls and processes that address identified gaps Identify and remediate security vulnerabilities and incidents Requirements What we expect: BS or equivalent. Minimum of 4 years of experience in security engineering. Strong understanding of… more
- LiveRamp (San Francisco, CA)
- …and libraries. **About you:** + 7+ years of experience in the fields of DevOps, SRE , or production engineering, with at least 2 years in a leadership or management ... position + Experienced architecting complex cloud environments that leverage industry best practices + Experience deploying and supporting containerized applications with Kubernetes, GKE, EKS, or the like + Experienced with deployment and monitoring of highly… more
- Amazon (San Francisco, CA)
- …Serverless ) with background in Software Development, Architecture or Operations ( SRE , DevOps) - Experienced in customer-facing or developer community facing roles ... - Ability to translate technical features into tangible customer business benefits Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender… more
- Autodesk (San Francisco, CA)
- …**Position Overview** We are looking for a **Principal Site Reliability Engineer ( SRE )** who is passionate about cloud infrastructure and proficient in MySQL ... database administration. This pivotal role centers around ensuring the utmost reliability, availability, and performance of our database systems, particularly MySQL databases hosted in AWS Aurora and other NoSQL databases. **Responsibilities** + Architect,… more
- EPAM Systems (San Francisco, CA)
- We are looking for a candidate to join a multi-functional SRE team with the focus on Google Cloud Platform. You should have cloud engineering experience in such ... areas acting as the SME on operation automation and monitoring, identifying TOIL within the team's existing systems and processes, and recommending, and implementing automated solutions to reduce TOIL and improve the efficiency and effectiveness of the team.… more