- Palo Alto Networks (Santa Clara, CA)
- …runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer, you will be part of a team supporting the services ... This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability . Central Infrastructure & Platform Engineering Team |… more
- Google (Sunnyvale, CA)
- Principal Site Reliability Engineer, Cloud...of the Cloud AI portfolio is the Vertex AI platform ; this platform runs both first-party models like ... AI search. **About the job** We are seeking a Principal Engineer for Cloud AI Reliability , Resiliency,...hardware, including Google Cloud's Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers. The… more
- JPMorgan Chase (Palo Alto, CA)
- …projects. You've discovered the perfect environment to have a major impact. As a ** Principal Site Reliability Engineer** at JPMorgan Chase within the ... qualifications, capabilities, and skills** + Formal training or certification on site reliability engineering concepts and 10+ years applied experience.… more
- Palo Alto Networks (Santa Clara, CA)
- …all win with precision. **Your Career** We are looking for a proactive and innovative Site Reliability Engineer (SRE) to join our growing team. In this role, you ... secure infrastructure. **Your Experience** + Proven experience as a Site Reliability Engineer, DevOps Engineer, or in...self-starter and jump on new ideas to make the platform more stable, secure and feature-rich, this is your… more
- NVIDIA (Santa Clara, CA)
- …where everyone is inspired to do their best work. We are seeking a highly skilled Principal Staff SRE to join our dynamic team. Our company is at the forefront of ... NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity… more
- JPMorgan Chase (Palo Alto, CA)
- …Agentic-based AIOps platforms. Our mission is to enhance scalability, security, and reliability for CDAO-hosted managed services. As a Principal Software ... data pipelines and workflows to collect, process, and analyze large volumes of platform data in real-time + Ensure the reliability , availability, and performance… more
- Palo Alto Networks (Santa Clara, CA)
- …you build **Your Experience** + 10+ years of experience in an Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a ... outcomes. **Your Career** We are seeking a highly skilled and experienced Platform & Infrastructure Engineer to join our core infrastructure development team. In… more
- LinkedIn (Mountain View, CA)
- …technical discipline, or equivalent practical experience + Experience working in the site reliability space + Professional experience working in the technology ... a critical role in designing and implementing end-to-end solutions for the platform . The organization encompasses Site Health and Operations, Embedded… more
- Palo Alto Networks (Santa Clara, CA)
- …includes automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, ... a large infrastructure and is one of the biggest GCP customers. As a Principal SRE, you'll be at the forefront of building and maintaining highly reliable, scalable,… more
- Celestica (San Jose, CA)
- …State/Province: California City: San Jose **General Overview** **Job Title:** Principal , Design Engineering **Functional Area:** Engineering (ENG) **Career Stream:** ... Engineering (ENG) **Role:** Principal (PRI) **Job Code:** PRI-ENG-DSGN **Job Band:** 12 **Direct/Indirect Indicator:** Indirect **Summary** Celestica is expanding… more
- Palo Alto Networks (Santa Clara, CA)
- …Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering + Troubleshooting - Ability to effectively troubleshoot ... and monitoring tools and practices. **Your Impact** As a Principal SRE with the Prisma Cloud DevOps team, you...passion for technology and a strong motivation for high reliability at the service level + Data Platform… more
- Palo Alto Networks (Santa Clara, CA)
- …efficiency + On-Call Rotation - Participate in on-call support to maintain platform reliability and resolve production incidents **Your Experience** + 5+ ... Architect, deploy, and manage cloud-native solutions in Google Cloud Platform (GCP) to ensure scalability, reliability , and...years of experience in DevOps, Site Reliability Engineering, or Cloud Infrastructure roles… more
- Cisco (San Jose, CA)
- ** Principal Applied AI Scientist** **Meet the Team** Splunk, a Cisco company, is building a safer, more resilient digital world with an end‑to‑end, full‑stack ... platform designed for hybrid, multi‑cloud environments. Join the AI...experience - designing and deploying foundation models that enhance reliability , strengthen security, prevent downtime, and deliver predictive insights… more
- Roche (South San Francisco, CA)
- …teams and operates in cross-functional squads and circles. **The Opportunity** The Principal Large Molecule DP Product Steward is responsible for driving seamless ... product steward holds primary accountability for ensuring DP product quality, reliability , and the execution of long-term strategic goals and operational targets… more
- Snap Inc. (Palo Alto, CA)
- …family, and the world; Lens Studio (https://ar.snap.com/lens-studio) , an augmented reality platform that powers AR across Snapchat and other services; and its AR ... and always execute with privacy at the forefront. We're looking for a Principal Machine Learning Engineer to join the Content Relevance team Snap! What you'll… more
- Comcast (Sunnyvale, CA)
- …in determining the technical requirements for a machine learning application or platform . Leads the design and implementation of machine learning applications and ... architectures across hybrid Edge-Cloud environments, ensuring performance, cost-efficiency, and reliability . + **Research & Development:** Guidealgorithmic innovation in areas… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …Cadence's Hardware Emulation Cloud to develop scalable and secure monitoring platform and processes to improve operations. Key Responsibilities: + Implement ... monitoring framework to improve infrastructure reliability , observability, and alerts. + Identifying and implementing automation...the company and want to make sure our job site is accessible to all. If you experience difficulty… more
- Cisco (San Jose, CA)
- …defenders on the front lines. Our team blends deep expertise in AI, cybersecurity, and platform engineering. We are driven by a shared belief that the only way to ... criteria to evaluate iterations. + **Evaluate model outputs** for accuracy, reliability , and usability, then prototype and deploy improvements based on structured… more
- Palo Alto Networks (Santa Clara, CA)
- …+ **Cross-Functional Influence:** Lead cross-org architectural alignment across product, Site Reliability Engineering (SRE), privacy, and Machine Learning ... the platform meets the highest standards of performance and reliability through technical execution and optimization. + **Distributed Systems Mastery:** Guide… more
- Cisco (Milpitas, CA)
- …solving complex technical challenges and setting new benchmarks for performance and reliability . The MIG Routing team is known for its supportive culture, commitment ... through customer delivery to develop networking software solutions and platform capabilities for Cisco's next generation Network Operation System. Your… more