- EPAM Systems (San Jose, CA)
- As a ** SRE Lead - Toil Analysis** , you will...to date on industry trends and best practices in SRE and automation **Requirements** + 10+ years of experience ... and reducing operational overhead through automation + Strong understanding of SRE principles and best practices + Proficiency in programming and scripting… more
- Insight Global (Mountain View, CA)
- …worked their way into L2/L3 support, doing extensive scripting as they moved into the SRE function. This role is the right fit for someone who loves to be ... with monitoring tools such as Grafana or Prometheus -Experience working in a SRE or L2/L3 support (on-call environment) -Extremely motivated, excited to be part of… more
- Abbott (Pleasanton, CA)
- …solutions for our customers. You will be responsible for implementing SRE improvement processes, procedures and influencing change within the organization. You ... environment and have DevOps or formal test automation, load testing or SRE experience. You will need extensive technical knowledge in the development, delivery,… more
- LinkedIn (Mountain View, CA)
- …career growth. Join us to challenge yourself with work that matters. Streaming SRE team is a combination development and operational role ensuring the reliability ... oncall ~1x per month. Come join the Software Engineering SRE team responsible for maintaining one of the largest...ecosystems on the planet including Kafka. The LinkedIn Streaming SRE team is responsible for maintaining LinkedIn's Kafka, Samza,… more
- McAfee, Inc. (San Jose, CA)
- …monitoring, logging, and alerting solutions to maintain system health and security. ** SRE Leadership:** + Drive SRE practices by implementing strategies that ... and guide junior engineers in cloud architecture, DevOps, and SRE best practices. + Act as a subject matter...subject matter expert on AWS cloud solutions, DevOps, and SRE practices within the organization. **Documentation & Reporting:** +… more
- Microsoft Corporation (Mountain View, CA)
- Microsoft is looking for a Senior Site Reliability Engineer ( SRE ) to support and expand Viva Engage. Viva Engage (formerly Yammer) is the industry-defining social ... scale and modernize our tech stack. We need a SRE who knows how to manage the conflicting priorities...contribute to the overall growth and development of the SRE team. + Stay current with industry trends, emerging… more
- Saildrone (Alameda, CA)
- …and maintain comprehensive documentation of monitoring setups, incident responses, and SRE best practices. * Capacity Planning: Collaborate on capacity planning ... * Mentorship: Provide guidance and mentorship to our new SRE team and to the Software group as a...observability, and incident management. Minimum Experience * 8+ years SRE experience. BA/BS in related field or equivalent experience.… more
- Cisco (San Jose, CA)
- …Services as a service and solving all technical challenges it offers? Does SRE and Reliability engineering, solving Production and customer issues excite you? If you ... role, you will be part of the Managed Services SRE team and work on: * Multi-Region config and...Multi-Region config and maintenance of Stateful services Infrastructure. * SRE - Continuous Tuning Optimizations of Managed Data Services.… more
- Meta (Fremont, CA)
- …industry experience is important (Software Engineer, Site Reliability Engineer ( SRE ), Systems Engineer,, DevOps Engineer, Network Engineer, or similar role), ... but ultimately less so than your demonstrated abilities and attitude. We sail into uncharted waters every day at Meta in Production Engineering, and we are always learning.This position is full-time. **Required Skills:** Production Engineering… more
- Wells Fargo (San Leandro, CA)
- …+ Understanding of Agile, CI/CD, DevOps concepts and Site Reliability Engineer ( SRE ) principles + Knowledge on container-based solution services, have handled 2-3 ... large scale Kubernetes based infrastructure build out, provisioning of services in Azure or GCP + Experience in scripting (Shell, Python) + Basic understanding of networking, firewalls, load balancing concepts (IP, DNS, Guardrails, Vnets) and exposure to… more
- Meta (Menlo Park, CA)
- …industry experience is important (Software Engineer, Site Reliability Engineer ( SRE ), Systems Engineer, DevOps Engineer, Network Engineer, or similar role), ... but ultimately less so than your demonstrated abilities and attitude. We sail into uncharted waters every day at Meta in Production Engineering, and we are always learning. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services… more
- Zoom (San Jose, CA)
- …the globe Develops tools to automate and improve production support tasks; SRE monitoring for critical systems Implement advanced DevOps technologies and best ... practices to continue refine our global deployment infrastructure with goals to improve deployment velocity, reliability Collaborate with other functional teams: R&D, Security, Data team to ensure production services reliability and availability Provide… more
- Zoom (San Jose, CA)
- …What we're looking for + 5+ years in a Site Reliability Engineer ( SRE ) or DevOps Engineer role + Experience with containerization and orchestration tools (eg ... Docker, Kubernetes) + Have proficiency with Python and Shell scripting, and configuration management tools (eg Ansible) + Have experience installing, configuring, and monitoring Linux systems in physical data centers + Experience with CI/CD pipelines and… more
- Zoom (San Jose, CA)
- …What we're looking for + 5+ years in a Site Reliability Engineer ( SRE ) or DevOps Engineer role + Experience with containerization and orchestration tools (eg ... Docker, Kubernetes) + Have proficiency with Python and Shell scripting, and configuration management tools (eg Ansible) + Have experience installing, configuring, and monitoring Linux systems in physical data centers + Experience with CI/CD pipelines and… more
- EPAM Systems (San Jose, CA)
- …platform architecture and design standards complement the support model + Works with the SRE team on the support of the platform **Requirements** + 5+ years of ... experience in DevOps with an enterprise-level cloud environment + 5+ years of experience with Azure Cloud Services + Advanced knowledge of PowerShell scripting and its application within the Azure framework + In-depth understanding of Azure PowerShell modules… more
- Cisco (San Jose, CA)
- …close on 10/04/24 What You'll Do * Design, develop, and implement high quality SRE applications and tools for Cisco SDWAN management layer. * Collaborate with team ... members to determine the root cause of problems and the best course of action to resolve problems and avoid them in the future. * Maintain cloud-hosted development, staging, and production environments and their monitoring and alerting systems. * Constantly… more
- EPAM Systems (San Jose, CA)
- …Cloud native development, Cloud managed services, Cloud Governance, CCoE, DevOps, and SRE + Executive relationships skills + Strategic Sales Pursuit Management and ... multi-team orchestration experience + Direct or indirect sales experience with aggressive revenue targets + Ability to understand the 'big picture' and convert it into an actionable plan + Experience working with Fortune 1000 customers **We offer** + Medical,… more
- Cisco (San Jose, CA)
- …products in public cloud infrastructure such as AWS, Azure, or GCP * Work with SRE and operation teams on SaaS offering products * Ability to lead and mentor ... engineers globally, champion creative technical trends, drive decisions collaboratively, resolve conflicts, and ensure diligent follow-through * Experience collaborating with senior engineers, product management, senior management, and other multi-functional… more
- Cisco (San Jose, CA)
- …a public cloud infrastructure such as AWS, Azure, or GCP. * Work with SRE and operation teams on SaaS products. * Drive decisions collaboratively, resolve conflicts ... and ensure follow through with diligence. Why Cisco? #WeAreCisco. We are all unique, but collectively we bring our talents to work as a team, to develop innovative technology and power a more inclusive, digital future for everyone. How do we do it? Well, for… more
- LinkedIn (Mountain View, CA)
- …and AI platform infrastructure space that span across various platforms, infrastructure, SRE , and product domains, covering both software and hardware aspects. *Lead ... the technical programs through the full lifecycle process from initiating and planning, to execution, monitoring, and landing impact. *Define, scope, and track ancillary operations projects including capacity planning, hardware procurement, and deployment.… more