- Abbott (Pleasanton, CA)
- …generic medicines. Our 114,000 colleagues serve people in more than 160 countries. **Staff Site Reliability Engineer ** **Working at Abbott** At Abbott, you ... work for diversity, working mothers, female executives, and scientists. **The Opportunity** **Staff Site Reliability Engineer ** As senior member of Site … more
- Palo Alto Networks (Santa Clara, CA)
- …operates a vast hybrid infrastructure and is among the largest GCP customers. As a Site Reliability Engineer on the CDSS Advanced URL Filtering team, you ... or a related field, with 8+ years of hands-on industry experience in Site Reliability Engineering or a similar role managing and improving complex systems at… more
- NVIDIA (Santa Clara, CA)
- …and drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big ... and operating large scale compute infrastructure + Proven experience in site reliability engineering for high-performance computing environments with operational… more
- NVIDIA (Santa Clara, CA)
- …global scale PaaS and SaaS offering! We are seeking a highly motivated Senior Site Reliability Engineer to join our Omniverse Infrastructure organization ... people in the world working for us. Are you a creative and autonomous Site Reliability Engineer , who loves challenges? Do you have a genuine passion for… more
- Cisco (San Jose, CA)
- …success of the next wave of innovation in the security industry. Your Impact As a Site Reliability Engineer , you'll be operating the next generation of Cloud ... improving our playbooks and streamlining our processes to improve efficiency. As an engineer within Umbrella's FedRAMP SRE team, your success in this role is highly… more
- Palo Alto Networks (Santa Clara, CA)
- …where we all win with precision. **Your Career** We are looking for an exceptional Sr Site Reliability Engineer to enhance our ATP Infra team. This role will ... tools, and processes that will ensure the highest levels of availability and reliability of all our applications. We need creative and innovative problem solvers who… more
- Insight Global (Santa Clara, CA)
- Job Description Insight Global is looking for a Site Reliability Engineer to work for one of our largest technology clients in the Bay Area. Some ... responsibilities include: at you'll be doing: . Fleet monitoring & recovery of assets in a private cloud environment that houses several compute servers GPUs. . Specific focus on building and stabilizing our virtualization infrastructure of ESXi, KVM and… more
- NVIDIA (Santa Clara, CA)
- …accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial ... role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build automation tools,… more
- General Motors (Sunnyvale, CA)
- …for automakers and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General motors, you'll join a dedicated ... team focused on enhancing the reliability , efficiency, and scalability of our distributed systems. We...closely with software development teams, acting as specialists in reliability and production engineering, with a focus on automation,… more
- Palo Alto Networks (Santa Clara, CA)
- …and Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering + DevOps/SRE Expertise - 5+ years of experience ... influence the operability of the product and ensure the reliability and availability of our services **Your Experience** +...as a DevOps/SRE engineer with a passion for technology and a strong… more
- NVIDIA (Santa Clara, CA)
- …of study. + 12+ years of experience in Software Development and/or Site Reliability Engineering/Production Engineering. + Strong software development using ... lasting impact on the world. As a Sr Staff Engineer , you will drive the design and development of...operating our existing infrastructure to the highest level of reliability and security. You will work side by side… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Netflix (Los Gatos, CA)
- …the world is a hard challenge, demanding exceptional levels of stability and reliability from dozens of services and systems between camera and device screens. About ... a Live Streaming Pipeline SRE, you will be responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin). Instrumenting… more
- Palo Alto Networks (Santa Clara, CA)
- …infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the CDL/SLS team, you will be part of a team supporting the services ... This includes automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab… more
- NVIDIA (Santa Clara, CA)
- …intelligence. We are seeking a highly skilled and experienced Staff Software Engineer to lead the design, deployment, and management of our large-scale GPU ... clusters. These clusters will power AI workloads across multiple teams and projects, making a significant impact on the future of machine learning and artificial intelligence at NVIDIA. Join our engineering team and collaborate with researchers, AI engineers,… more
- Google (Sunnyvale, CA)
- …the US. + Must be able to start a full-time role in 2020. Site Reliability (https://landing.google.com/sre/) Software engineers working in Site ... value of new algorithms that improve a product's capabilities, speed, efficiency and reliability or skill in isolating problems to a database subsystem. + Working… more
- Google (Sunnyvale, CA)
- …Preferred qualifications: + Master's degree in Computer Science or Engineering. Site Reliability Engineering (SRE) combines software and systems engineering ... Google Cloud's services-both our internally critical and our externally-visible systems-have reliability , uptime appropriate to customer's needs and a fast rate of… more
- LinkedIn (Sunnyvale, CA)
- …problems at the pace of LinkedIn's accelerating growth. It's highly-visible work that impacts our site every day, and we try to do it in a way that makes our ... lives easier. Come join us in this mission supporting LinkedIn's reach to every member of the global workforce. At LinkedIn, we trust each other to do our best work where it works best for us and our teams. This role offers a hybrid work option, meaning you… more
- Meta (Sunnyvale, CA)
- …engineering efforts underway in the company.Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems ... **Summary:** Meta is seeking an experienced engineer to join our Production Engineering team. Production... Engineer , DevOps Engineer , Network Engineer , or similar role), but ultimately less so than… more
- Meta (Sunnyvale, CA)
- …backgrounds, from new grads to industry experts. Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems ... Engineer ,, DevOps Engineer , Network Engineer , or similar role), but ultimately less so than your demonstrated abilities and attitude. We sail into uncharted… more