- McAfee, Inc. (San Jose, CA)
- **_Job Title:_** Site Reliability Engineer **_Role Overview:_** We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team, with a ... This role is ideal for someone passionate about toolchain reliability , automation, and the integration of AI -driven...and incidents. **About You:** + 3+ years' experience in site reliability engineering, DevOps, or a related… more
- Zscaler (San Jose, CA)
- …you will be responsible for: + Designing and maintaining scalable infrastructure solutions to support Zscaler's global cloud services + Enhancing observability ... practices across infrastructure and applications through monitoring, logging, tracing, and automated...(eg, Clickhouse, Redis) + Knowledge of MLOps and Generative AI applications within SRE environments \#LI-Hybrid \#LI-CM3 Zscaler's salary… more
- Memorial Sloan-Kettering Cancer Center (New York, NY)
- …AI development with healthcare delivery. + Drive the adoption of scalable AI solutions that meet clinical performance and reliability standards. **Key ... and transformer-based models tailored for pathology applications. + Fine-tune large-scale AI and foundation models for clinical use cases. + Optimize self-supervised… more
- CyrusOne (Chandler, AZ)
- …Provides duties with constant awareness of the need to preserve the reliability of the critical load. Also provides operational support for building systems ... work orders, and Reactive Work tickets. + Use SOPS/MOPS/EOP's/MMRs as appropriate for site operations and maintenance. + Act as "Smart Hands and Feet" for client… more
- CyrusOne (Dallas, TX)
- …Maintain proactive communication and transparency with customer leadership to reinforce reliability , responsiveness, and partnership trust. + Conduct QBRs to present ... KPI insights to identify trends, risks, and opportunities to enhance reliability , responsiveness, and operational excellence. **Qualifications:** + 10+ years of… more
- Micron Technology, Inc. (Boise, ID)
- …and infrastructure . + Analyze system performance data to improve reliability , efficiency, and cost savings. + Troubleshoot complex chemical system issues and ... environment alongside other engineering disciplines to design, maintain, and optimize site Facilities systems. Your efforts will ensure safe, reliable, and efficient… more
- CyrusOne (Allen, TX)
- …understanding of how to optimize the performance of critical mechanical infrastructure . **Responsibilities:** + Provide guidance and oversight of the operation and ... preventive maintenance of CDUs to ensure consistent cooling distribution and system reliability . + Provide guidance and plans for managing and optimizing the cooling… more
- Micron Technology, Inc. (Boise, ID)
- …Technical Field Domain Expert, providing consultation and support to global Site Facilities and Manufacturing teams. + Lead cross-functional collaboration with ... and best known methods (BKMs). + Support large-scale Facilities Infrastructure Expansion projects by reviewing design packages, ensuring compliance with… more
- Micron Technology, Inc. (Boise, ID)
- …systems. This role highlights Global Standardization and Alignment, Innovation, Reliability and Performance, and Sustainability. Your duties encompass Semiconductor ... domain expert, providing guidance and support to global, regional, and site facilities and manufacturing teams. + Collaborate with internal teams (Technology… more
- ABBTECH Professional Resources, Inc. (Washington, DC)
- …Architect** **Public Trust** DevOps Architect Washington, DC (5 days a week on- site ) Public Trust Key Responsibilities: -Define and implement the DevOps architecture ... environments, including staging and production. -Promote and drive the use of Infrastructure as Code (IaC) to automate infrastructure provisioning and management… more
- Verdantas (Wilmington, DE)
- …sectors like water resources, resilient land use, energy transformation, and civil infrastructure . Our commitment to excellence, across more than 90 offices, is ... with clients to deliver smart, data-driven solutions to complex environmental and infrastructure challenges. We don't just solve problems; we help shape a more… more
- Entrust (Dallas, TX)
- …performance. **Responsibilities:** + Design, implement, and support a fully on-premises infrastructure using VMware, NetApp, and modern automation tools to manage a ... highly distributed microservices environment. + Expert in immutable infrastructure using IaC tools (eg, Terraform, Ansible) to automate and manage secure,… more
- Entrust (Shakopee, MN)
- …(comprising Ops/SRE and DevOps functions) and work closely with the Infrastructure , DBA, Implementation, Support, R&D, Compliance, Security and Product Management ... quality of IFIaaS services to ensure the service meets reliability , scalability, and customer satisfaction targets. Day to day...is all about balance. Whether you're remote, hybrid, or on- site , we offer flexible options that fit your lifestyle.?… more
- Sallie Mae (Sterling, VA)
- …services across both on-premises and cloud-based environments, ensuring high reliability and performance. + Execute modernization initiatives that transition ... best practices, including Zero Trust principles. + Collaborate closely with infrastructure , application development, security, and business stakeholders to align on… more
- Tradeweb (Jersey City, NJ)
- …Experience managing time series data + Understands and able to apply site reliability engineering principles + Financial Services experience **Additional ... and run Tradeweb's data platform using such technologies as public cloud infrastructure (AWS and GCP), Kafka, databases and containers + Develop Tradeweb's data… more
- Cisco (Research Triangle Park, NC)
- …that solve real-world problems and shape the future of technology. Your Impact As an AI Site Reliability Engineer, you will: - Leverage SRE practices to ... Remote withing USA Meet the Team At Cisco, the AI Infrastructure Services team is at the...related technologies. - Previous experience as a compute or site /systems reliability engineer. - Experience with hybrid… more
- Deloitte (San Diego, CA)
- …time in the future. Preferred: + Experience in edge and AI infrastructure operations, including monitoring, site reliability engineering. + OT relevant ... scalable, secure, reliable, and optimized for OT. + Oversee edge and AI infrastructure operations, including monitoring, reliability engineering, and AI… more
- Walmart (Sunnyvale, CA)
- …necessities **Qualifications** * 16+ years of experience in Site Reliability Engineering, Production Engineering, and Infrastructure Reliability , with ... area and7 years' experience in site reliability engineering, site and system administration, infrastructure management, or related area.Option 2: 9… more
- CVS Health (Scottsdale, AZ)
- …of reliability and continuous improvement. + Drive innovation in AI infrastructure , optimizing for GPU/TPU resource management, distributed training, and ... & Consumer Wellness) SRE team is seeking a Staff Site Reliability Engineer (SRE) to lead the...AI , LLMs, and multi-agent systems. + Familiarity with AI infrastructure optimization, such as GPU/TPU resource… more
- MongoDB (Chicago, IL)
- …Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure . As a Staff SRE, you will be very hands-on technically while also ... closely with other engineering teams to ensure that our infrastructure adheres to the highest security standards. They build...the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt… more