- Regions Bank (Charlotte, NC)
- …logging into the careers section of the system. **Job Description:** At Regions, the Site Reliability Engineer is responsible for ensuring the dependability ... Solid familiarity with Splunk, Elastic, OpenSearch, Prometheus, Grafana + Implementing Site Reliability Engineering (SRE) principles SLO/SLI + Experience… more
- Saildrone (Alameda, CA)
- …generations. The Role We are seeking a talented Staff Site Reliability Engineer with a strong focus on observability and mentorship to join our dynamic ... tools and practices will play a crucial role in scaling up Saildrone's Site Reliability Engineering team, helping to ensure the quality of service that… more
- American Express (Phoenix, AZ)
- …place in technology on #TeamAmex. We are seeking a highly skilled Senior Engineer specializing in Cloud Observability and Monitoring. The ideal candidate will ... Stack. You will play a crucial role in ensuring the performance, reliability , and scalability of our cloud infrastructure by developing and implementing effective… more
- Amazon (Seattle, WA)
- Description CloudWatch is seeking a talented Software Development Engineer to join our team building the next generation of monitoring, observability and ... build and operate massively scaled and highly available systems to provide observability into AWS services, applications and infrastructure to millions of customers… more
- Amazon (New York, NY)
- …one! We are looking for a passionate, results-oriented, inventive Software Development Engineer to help a driven, friendly and supportive infrastructure team build ... the data engineering, analytics and intelligence infrastructure necessary to drive observability for all Amazon retail traffic. The ideal candidate will understand… more
- Amazon (Seattle, WA)
- …hundreds of peta-bytes of logs every month. We are building monitoring and observability services that are shaping the future for the next generation of Insights ... a better-rounded professional. Key job responsibilities As a Software Development Engineer , you will deliver working features spanning the full software lifecycle… more
- Truist (Charlotte, NC)
- …detect and resolve issues in cloud services. o Work closely with SRE ( Site Reliability Engineering) teams to set up alerting mechanisms and dashboards ... with us, you can log in to check status.** Need Help? (https://www.brainshark.com/bbandt/careers- site -faq) _If you have a disability and need assistance with the… more
- Amazon (Bellevue, WA)
- Description As a Software Engineer on the team you will own significant portions of the product and will have influence on our strategy by helping define the next ... experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more
- Amazon (Seattle, WA)
- …projects and tasks based on what will help each team member into a better-rounded engineer and enable them to take on more complex tasks in the future. Inclusive ... experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more
- Capital One (San Francisco, CA)
- …(61049), United States of America, San Francisco, California Sr. Product Manager Observability Tools Platform Capital One is a high-tech company, a scientific ... in this space, we are building a world class Cloud Infrastructure, Reliability , Recovery and AI/ML data platform for deploying data-driven, resilient, customer… more
- Amazon (Seattle, WA)
- …experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience ... visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site… more
- Amazon (Seattle, WA)
- …Dashboards for complete operational visibility. We are building new monitoring and observability services and are shaping the future for the next generation of ... Technical Problems We are looking for a software development engineer who thrives on creative problem solving and enjoys...our Customer's requests and delight them improving latency and reliability of our services. For instance, our team shipped… more
- Bank of America (Chicago, IL)
- Site Reliability Engineer Jersey...for partnering with engineering and technology teams to improve reliability and observability for the services it ... Job expectations include supporting code enhancements to automate services and improve reliability and observability while expanding knowledge to identify gaps… more
- Microsoft Corporation (Elkridge, MD)
- Microsoft has an exciting opportunity for a **Senior Site Reliability Engineer ** in the Cloud+Artificial Intelligence (C+AI) Silver SQL Team. This team is ... Ownership for end-to-end project lifecycle with solid project management and communication skills. Site Reliability Engineering IC4 - The typical base pay range… more
- Bank of America (Chandler, AZ)
- Cloud Senior Site Reliability Engineer Chandler, Arizona;Atlanta, Georgia; Richmond, Virginia; Jersey City, New Jersey; Pennington, New Jersey; Jacksonville, ... solutions to visualize key production support metrics enabling Operational Readiness and Site Reliability Engineer teams to identify scenarios requiring… more
- Cisco (Research Triangle Park, NC)
- …and reliability . Key Responsibilities: * Design, build, and maintain observability systems for leading NVIDIA DGX clusters, ensuring flawless monitoring of AI ... We are seeking a highly skilled and experienced Senior Engineer to join our team, focusing on the design...services and capabilities tailored to IT's GPU-based AI Clusters observability . This role involves reshaping how we lead alerts,… more
- Mastercard (O'Fallon, MO)
- …drives innovation and delivers better business results. **Title and Summary** Lead Site Reliability Engineer Enterprise Data Accessibility BizOps team ... is looking for a Site Reliability Engineer who can...and Dynatrace * Experience with performing data analysis, data observability , data ingestion and data integration. * 5+ DevOps,… more
- Mastercard (Tampa, FL)
- …better decisions, drives innovation and delivers better business results. **Title and Summary** Lead Site Reliability Engineer Overview This is a Lead Cloud ... Operations Engineer role (Lead SRE Engineer role) that will manage Cloud Platforms in Mastercard...work towards strengthening Mastercard's Cloud Platforms and Automate Deployment, Observability and other tasks. Role: The L3 Cloud Operations… more
- Palo Alto Networks (Santa Clara, CA)
- …and Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering + DevOps/SRE Expertise - 5+ years of experience ... including the design, implementation, and continuous enhancement of our comprehensive observability systems. To meet the opportunities that such a role provides,… more
- Medtronic (Northridge, CA)
- …more connected, compassionate world. **A Day in the Life** Medtronic is hiring a Principal Site Reliability Engineer (SRE). The Principal Site ... Terraform, or CloudFormation. + Experience with monitoring, logging, and observability tools such as Prometheus, Grafana, ELK Stack, or...operations roles, with at least 5 years in a Site Reliability Engineering or DevOps position. +… more