- NVIDIA (Santa Clara, CA)
- …take great pride in providing excellent, comprehensive support to our customers! Sr Site Reliability Engineer in this role will significantly impact and ... experience in Computer Science or related field. + 8+ years of experience in site reliability engineering and/or software development roles. + Fluency in Python… more
- Insight Global (Blue Ash, OH)
- Job Description An employer is looking for a Senior SRE in the Cincinnati area for an onsite role to support and enhance our on‑premises infrastructure ecosystem. ... clusters using Rancher, automating configuration with Ansible, and ensuring reliability across self‑managed environments. You'll also provide technical leadership… more
- KBR (Beavercreek, OH)
- Title: Senior Platform Engineer/Kubernetes SME Belong. Connect. Grow. with KBR! KBR's National Security Solutions team provides high-end engineering and advanced ... the future of space defense. Role summary KBR is seeking a highly experienced Senior Platform Engineer to join our team in Beavercreek, OH. The ideal candidate will… more
- Realtor (Austin, TX)
- …and building confidence through expert guidance. **About the Role** We are seeking a Senior Site Reliability Engineer to join our newly formed Operations ... our platform infrastructure serving millions of users. As a Senior SRE, you will be a strong technical contributor...You'll Bring** **Experience & Expertise** + 5+ years in Site Reliability Engineering, DevOps, or Infrastructure Engineering… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …experienced AI Systems Engineer to join our team. This is a hands-on, senior individual contributor role that will be pivotal in leading the development, operations, ... solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI...services on both GCP and Azure. + Hands-on GPU Cluster Management: Take a leadership role in the configuration,… more
- Applied Research Solutions (Washington, DC)
- …in a government environment. This role combines Tier 1 and Tier 2 Site Reliability Engineering (SRE) support responsibilities with deep technical expertise in ... **Description** We are seeking a Senior Operations & Maintenance Support Specialist to provide...and containerized application support + SRE Practices: Understanding of Site Reliability Engineering principles including monitoring, incident… more
- Mastercard (O'Fallon, MO)
- …everything you can? The Business Operations (BizOps) team is seeking a Business Operations Site Reliability Engineer (SRE). The role of BizOps is to be the ... alerting strategy and create the framework to achieve zero downtime during deployment. Site Reliability Engineering: * Serve as the primary contact responsible… more
- Google (Kirkland, WA)
- …the ability to build consensus across organizational boundaries. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering ... Senior Staff Software Engineer, SRE, ML Fleet Systems...Understanding of resource management systems (eg, Borg, Kubernetes, Flex), cluster management, and scheduling algorithms. + Familiarity with Machine… more
- NetApp (Morrisville, NC)
- …excellent operational knowledge in managing Opensearch clusters, look no further!! As a Site Reliability Engineer (Opensearch), you are in the frontline team ... **Job Summary** NetApp is looking for a Senior TechOps Engineer to join our growing Instaclustr team in North Carolina, US. NetApp's Instaclustr offering provides… more
- Mastercard (O'Fallon, MO)
- …people, businesses and governments realize their greatest potential._ **Title and Summary** Senior Enterprise Cloud Operations Engineer Job Summary We are seeking an ... experienced Senior Cloud Engineer to join our Cloud Operations team....environments. o Optimize cloud infrastructure for performance, cost, and reliability . o Implement and manage Infrastructure as Code (IaC)… more
- Wolters Kluwer (Houston, TX)
- …are fully automated (IaC, GitOps, policy-as-code, CI/CD). + Establish site reliability practices-SLO/SLI, error budgets, incident management, post-incident ... Director in Software Engineering, you will provide comprehensive leadership to senior managers and high-level professionals. You will have primary responsibility for… more
- CVS Health (Woonsocket, RI)
- …ensuring seamless workload management, automation at scale, and platform reliability across VMware vSphere, Kubernetes, OpenShift, and enterprise storage systems. ... services. + Foster an engineering culture that values infrastructure reliability , operational excellence, and continuous improvement in data center operations.… more
- Amazon (Seattle, WA)
- …our existing data center portfolio, ensuring maximum capacity, reliability , and cost-effectiveness. Proactively identifying opportunities to remodel buildings ... projects or actions needed are aligned to the overall site strategy. The remodel program will ensure that AWS...lack of affordable land or sufficient power within a cluster . The ideal candidate will be an exceptionally strong… more
- RTX Corporation (Tucson, AZ)
- …create end-of-end solutions to solve complex DT infrastructure problems. As a ** Senior Principal level System Administrator,** this role will require a seasoned ... to solve complex infrastructure problems. + Assure a high level of reliability and performance through systems monitoring and infrastructure management. A deep… more
- DXC Technology (New Orleans, LA)
- …of Dynatrace solutions. This role focuses on ensuring optimal performance and reliability of Dynatrace installations and resolving any issues that arise. The ideal ... implement solutions to prevent future occurrences. + Escalate complex issues to senior support staff or Dynatrace vendor support as needed. **Configuration and… more
- Cisco (Milpitas, CA)
- …Impact** You will work on Cisco HyperFabric, an innovative cloud-operated AI cluster solution. Cisco HyperFabric aims to provide customers with a unified compute ... + Lead a focused and collaborative team environment by empowering senior team members to operate independently while fostering strong cross-functional teamwork… more
- Artera Technologies (Annapolis Junction, MD)
- …CI/CD principles, methodologies, and tools such as GitLab CI + Familiarity with Site Reliability Engineering (SRE) principles and applications + Experience with ... DEVOPS ENGINEERS Job Type: Full Time Level: Mid, Senior , Principal Location: Maryland (Annapolis Junction / Fort...Linux cluster /node monitoring tools + Experience with the Atlassian Tool… more