- LinkedIn (Mountain View, CA)
- …you will serve as a key technical leader and role model within the Infra SRE organization, driving the development of scalable and reliable infrastructure ... platform. The organization encompasses Site Health and Operations, Embedded Infrastructure SRE (supporting Data Infrastructure and Data/ AI Platform partners),… more
- LinkedIn (Mountain View, CA)
- …Serving performance optimizations across billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra team, you will play ... billions of parameters models and large scale feature engineering infra for all AI use cases from...complex models across LLM and Personalization models. As an engineer , you will build compute efficient infra … more
- NVIDIA (Santa Clara, CA)
- We are seeking a AI Infrastructure Engineer to integrate third-party infrastructure partners into NVIDIA's operational excellence programs. This cross-functional ... of operational excellence best practices across all infrastructure providers, partnering with SRE , infra , product, and security teams + Define and operationalize… more
- The Walt Disney Company (Glendale, CA)
- …entertainment content, across all media platforms. **Job Summary:** We are hiring a Lead Engineer to establish and guide our AI Operations and Tooling practice, ... **Leadership & Cross-Team Collaboration** + Mentor engineers in DevOps, infra , and AI operations. + Drive adoption...evaluation frameworks (LangSmith, PromptLayer, etc.). + Prior work in AI operations, SRE , or ML platform DevOps… more
- DoorDash (San Francisco, CA)
- …can design scalable, resilient services that are easy to operate. + Cloud & Infra Expertise or even SRE : You're comfortable with AWS primitives, security best ... remediate production systems. About the Role As a Software Engineer on Reliability Platforms, you'll help design and build...+ Safely roll out configuration changes both service and infra with progressive delivery + See at a glance… more
- Microsoft Corporation (Redmond, WA)
- …everyone can realize its benefits. We're looking for an experienced **Site Reliability Engineer ( SRE )** to join our infrastructure team. In this role, you'll ... **Overview** As Microsoft continues to push the boundaries of AI , we are on the lookout for passionate individuals to work with us on the most interesting and… more
- Coinbase (Charlotte, NC)
- …on developer velocity and platform reliability. * Collaborate cross-functionally with product, infra , SRE , and compliance teams to deliver secure, observable, ... all Coinbase accounts, users and organizations for businesses. The engineer will be tasked with making Coinbase APIs and...our use and processing of your data as required. * AI Disclosure* For select roles, Coinbase is piloting an… more
- MetLife (Cary, NC)
- …or automation programs within large, regulated, global enterprises. * Exposure to SRE , MLOps, or AI -driven operational analytics. * Certifications in relevant ... (Key Responsibilities) * Architect end-to-end protection patterns across app, data, infra , and security layers. * Design early-detection logic for drift, latency… more