• Senior SRE Architect

    EPAM Systems (San Jose, CA)
    As a ** SRE Lead - Toil Analysis** , you will...to date on industry trends and best practices in SRE and automation **Requirements** + 10+ years of experience ... and reducing operational overhead through automation + Strong understanding of SRE principles and best practices + Proficiency in programming and scripting… more
    EPAM Systems (10/31/24)
    - Save Job - Related Jobs - Block Source
  • SRE

    Insight Global (Mountain View, CA)
    …worked their way into L2/L3 support, doing extensive scripting as they moved into the SRE function. This role is the right fit for someone who loves to be ... with monitoring tools such as Grafana or Prometheus -Experience working in a SRE or L2/L3 support (on-call environment) -Extremely motivated, excited to be part of… more
    Insight Global (11/12/24)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer

    Abbott (Pleasanton, CA)
    …solutions for our customers. You will be responsible for implementing SRE improvement processes, procedures and influencing change within the organization. You ... environment and have DevOps or formal test automation, load testing or SRE experience. You will need extensive technical knowledge in the development, delivery,… more
    Abbott (08/18/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer, Site Reliability

    LinkedIn (Mountain View, CA)
    …career growth. Join us to challenge yourself with work that matters. Streaming SRE team is a combination development and operational role ensuring the reliability ... oncall ~1x per month. Come join the Software Engineering SRE team responsible for maintaining one of the largest...ecosystems on the planet including Kafka. The LinkedIn Streaming SRE team is responsible for maintaining LinkedIn's Kafka, Samza,… more
    LinkedIn (11/02/24)
    - Save Job - Related Jobs - Block Source
  • Sr. AWS Cloud & DevOps Architect - Remote

    McAfee, Inc. (San Jose, CA)
    …monitoring, logging, and alerting solutions to maintain system health and security. ​ ** SRE Leadership:** + Drive SRE practices by implementing strategies that ... and guide junior engineers in cloud architecture, DevOps, and SRE best practices. + Act as a subject matter...subject matter expert on AWS cloud solutions, DevOps, and SRE practices within the organization. **Documentation & Reporting:** +… more
    McAfee, Inc. (10/22/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    Microsoft Corporation (Mountain View, CA)
    Microsoft is looking for a Senior Site Reliability Engineer ( SRE ) to support and expand Viva Engage. Viva Engage (formerly Yammer) is the industry-defining social ... scale and modernize our tech stack. We need a SRE who knows how to manage the conflicting priorities...contribute to the overall growth and development of the SRE team. + Stay current with industry trends, emerging… more
    Microsoft Corporation (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer

    Saildrone (Alameda, CA)
    …and maintain comprehensive documentation of monitoring setups, incident responses, and SRE best practices. * Capacity Planning: Collaborate on capacity planning ... * Mentorship: Provide guidance and mentorship to our new SRE team and to the Software group as a...observability, and incident management. Minimum Experience * 8+ years SRE experience. BA/BS in related field or equivalent experience.… more
    Saildrone (10/29/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer

    Cisco (San Jose, CA)
    …Services as a service and solving all technical challenges it offers? Does SRE and Reliability engineering, solving Production and customer issues excite you? If you ... role, you will be part of the Managed Services SRE team and work on: * Multi-Region config and...Multi-Region config and maintenance of Stateful services Infrastructure. * SRE - Continuous Tuning Optimizations of Managed Data Services.… more
    Cisco (10/22/24)
    - Save Job - Related Jobs - Block Source
  • Production Engineering

    Meta (Fremont, CA)
    …industry experience is important (Software Engineer, Site Reliability Engineer ( SRE ), Systems Engineer,, DevOps Engineer, Network Engineer, or similar role), ... but ultimately less so than your demonstrated abilities and attitude. We sail into uncharted waters every day at Meta in Production Engineering, and we are always learning.This position is full-time. **Required Skills:** Production Engineering… more
    Meta (11/08/24)
    - Save Job - Related Jobs - Block Source
  • Cloud Engineering Associate IaC

    Wells Fargo (San Leandro, CA)
    …+ Understanding of Agile, CI/CD, DevOps concepts and Site Reliability Engineer ( SRE ) principles + Knowledge on container-based solution services, have handled 2-3 ... large scale Kubernetes based infrastructure build out, provisioning of services in Azure or GCP + Experience in scripting (Shell, Python) + Basic understanding of networking, firewalls, load balancing concepts (IP, DNS, Guardrails, Vnets) and exposure to… more
    Wells Fargo (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Production Engineer

    Meta (Menlo Park, CA)
    …industry experience is important (Software Engineer, Site Reliability Engineer ( SRE ), Systems Engineer, DevOps Engineer, Network Engineer, or similar role), ... but ultimately less so than your demonstrated abilities and attitude. We sail into uncharted waters every day at Meta in Production Engineering, and we are always learning. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services… more
    Meta (10/18/24)
    - Save Job - Related Jobs - Block Source
  • Staff DevOps Engineer

    Zoom (San Jose, CA)
    …the globe Develops tools to automate and improve production support tasks; SRE monitoring for critical systems Implement advanced DevOps technologies and best ... practices to continue refine our global deployment infrastructure with goals to improve deployment velocity, reliability Collaborate with other functional teams: R&D, Security, Data team to ensure production services reliability and availability Provide… more
    Zoom (11/15/24)
    - Save Job - Related Jobs - Block Source
  • Realtime DevOps Engineer

    Zoom (San Jose, CA)
    …What we're looking for + 5+ years in a Site Reliability Engineer ( SRE ) or DevOps Engineer role + Experience with containerization and orchestration tools (eg ... Docker, Kubernetes) + Have proficiency with Python and Shell scripting, and configuration management tools (eg Ansible) + Have experience installing, configuring, and monitoring Linux systems in physical data centers + Experience with CI/CD pipelines and… more
    Zoom (11/13/24)
    - Save Job - Related Jobs - Block Source
  • Realtime DevOps Engineer

    Zoom (San Jose, CA)
    …What we're looking for + 5+ years in a Site Reliability Engineer ( SRE ) or DevOps Engineer role + Experience with containerization and orchestration tools (eg ... Docker, Kubernetes) + Have proficiency with Python and Shell scripting, and configuration management tools (eg Ansible) + Have experience installing, configuring, and monitoring Linux systems in physical data centers + Experience with CI/CD pipelines and… more
    Zoom (11/09/24)
    - Save Job - Related Jobs - Block Source
  • Azure DevOps Engineer

    EPAM Systems (San Jose, CA)
    …platform architecture and design standards complement the support model + Works with the SRE team on the support of the platform **Requirements** + 5+ years of ... experience in DevOps with an enterprise-level cloud environment + 5+ years of experience with Azure Cloud Services + Advanced knowledge of PowerShell scripting and its application within the Azure framework + In-depth understanding of Azure PowerShell modules… more
    EPAM Systems (10/31/24)
    - Save Job - Related Jobs - Block Source
  • Sr Software Engineer

    Cisco (San Jose, CA)
    …close on 10/04/24 What You'll Do * Design, develop, and implement high quality SRE applications and tools for Cisco SDWAN management layer. * Collaborate with team ... members to determine the root cause of problems and the best course of action to resolve problems and avoid them in the future. * Maintain cloud-hosted development, staging, and production environments and their monitoring and alerting systems. * Constantly… more
    Cisco (10/28/24)
    - Save Job - Related Jobs - Block Source
  • Cloud Engagement Lead - Remote

    EPAM Systems (San Jose, CA)
    …Cloud native development, Cloud managed services, Cloud Governance, CCoE, DevOps, and SRE + Executive relationships skills + Strategic Sales Pursuit Management and ... multi-team orchestration experience + Direct or indirect sales experience with aggressive revenue targets + Ability to understand the 'big picture' and convert it into an actionable plan + Experience working with Fortune 1000 customers **We offer** + Medical,… more
    EPAM Systems (10/22/24)
    - Save Job - Related Jobs - Block Source
  • SaaS Controller Senior Software Development…

    Cisco (San Jose, CA)
    …products in public cloud infrastructure such as AWS, Azure, or GCP * Work with SRE and operation teams on SaaS offering products * Ability to lead and mentor ... engineers globally, champion creative technical trends, drive decisions collaboratively, resolve conflicts, and ensure diligent follow-through * Experience collaborating with senior engineers, product management, senior management, and other multi-functional… more
    Cisco (09/19/24)
    - Save Job - Related Jobs - Block Source
  • SaaS Controller - Senior Software Development…

    Cisco (San Jose, CA)
    …a public cloud infrastructure such as AWS, Azure, or GCP. * Work with SRE and operation teams on SaaS products. * Drive decisions collaboratively, resolve conflicts ... and ensure follow through with diligence. Why Cisco? #WeAreCisco. We are all unique, but collectively we bring our talents to work as a team, to develop innovative technology and power a more inclusive, digital future for everyone. How do we do it? Well, for… more
    Cisco (09/19/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Staff Technical Program Manager

    LinkedIn (Mountain View, CA)
    …and AI platform infrastructure space that span across various platforms, infrastructure, SRE , and product domains, covering both software and hardware aspects. *Lead ... the technical programs through the full lifecycle process from initiating and planning, to execution, monitoring, and landing impact. *Define, scope, and track ancillary operations projects including capacity planning, hardware procurement, and deployment.… more
    LinkedIn (10/25/24)
    - Save Job - Related Jobs - Block Source