- Charles Schwab (San Francisco, CA)
- …scale and explore next-generation GenAI efforts that will redefine how we serve our clients. As a Senior AI Site Reliability Engineer on AI .x, you ... your career in one of the most exciting areas of technology today. As a Senior AI Site Reliability Engineer, you will design, implement, and manage the… more
- Google (Sunnyvale, CA)
- Principal Site Reliability Engineer, Cloud AI _corporate_fare_ Google _place_ Sunnyvale, CA, USA _bar_chart_ Director **Minimum qualifications:** + ... job** We are seeking a Principal Engineer for Cloud AI Reliability , Resiliency, and Scalability. This technical...we support. You will serve as a counterpart to senior leaders and domain experts, advising on the architectural… more
- ServiceNow, Inc. (San Diego, CA)
- …to today - ServiceNow stands as a global market leader, bringing innovative AI -enhanced technology to over 8,100 customers, including 85% of the Fortune 500(R). Our ... highly technical engineers who are tasked with maintaining and developing the reliability , scalability and performance of the ServiceNow infrastructure. The SRE is… more
- NVIDIA (Santa Clara, CA)
- …take great pride in providing excellent, comprehensive support to our customers! Sr Site Reliability Engineer in this role will significantly impact and ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...or related field. + 8+ years of experience in site reliability engineering and/or software development roles.… more
- MongoDB (San Francisco, CA)
- …remotely in the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to join the ... at the speed of the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt industries with software. MongoDB's… more
- MongoDB (San Francisco, CA)
- We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Staff SRE, you ... on security work, with ideally 2+ years in a senior or staff engineering role Security Mindset: + A...the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt… more
- Zscaler (San Jose, CA)
- …databases (eg, Clickhouse, Redis) + Knowledge of MLOps and Generative AI applications within SRE environments \#LI-Hybrid \#LI-CM3 Zscaler's salary ranges are ... benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors,… more
- Charles Schwab (San Francisco, CA)
- …the next era of AI at Schwab, with a special emphasis on site reliability , monitoring, observability, and operations. You'll ensure that the systems you ... explores next-generation GenAI initiatives that redefine client experiences. We're seeking a Senior AI Engineer who will design and deliver cutting-edge GenAI… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …on-premise GPU clusters, storage solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI ... technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is...Engineer to join our team. This is a hands-on, senior individual contributor role that will be pivotal in… more
- Coinbase (Sacramento, CA)
- …science leader of complex initiatives, setting technical direction across teams, mentoring senior engineers, and ensuring that our AI /ML solutions are robust, ... * *Architect Scalable Models & Systems: *Architect and build production-grade AI /ML models & pipelines, that enable low-latency, high- reliability predictions.… more
- Cisco (San Diego, CA)
- …on automating the code generation process using GenAI techniques. We combine deep AI research expertise with the scale and operational excellence of Splunk and ... customer experience, designing and deploying foundation models that enhance reliability , strengthen security, prevent downtime, and deliver predictive insights… more
- Google (Sunnyvale, CA)
- Senior Staff Program Manager, AI Infrastructure Investment Planning _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes ... We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and… more
- NVIDIA (Santa Clara, CA)
- …by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts ... as the brains of computers, generative AI , robots, and self-driving cars that can understand the...DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability,… more
- Amazon (San Francisco, CA)
- Description The Generative AI Innovation Center at AWS empowers customers to harness state of the art AI technologies for transformative business opportunities. ... with customers across industries to fine-tune and deploy customized generative AI applications at scale. Additionally, we work closely with foundational model… more
- Amazon (Cupertino, CA)
- …hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization, our team works across ... and distributed architectures, where you'll help shape the future of AI acceleration technology You will architect and implement business critical features,… more
- Amazon (Cupertino, CA)
- …hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization, our team works across ... and distributed architectures, where you'll help shape the future of AI acceleration technology You will architect and implement business critical features,… more
- Cisco (San Diego, CA)
- …Foundational Modeling team at Splunk, where we advance the state of AI for high‑volume, real‑time, multi‑modal machine‑generated data - including logs, time series, ... traces, and events. We combine deep AI research expertise with the scale and operational excellence of Splunk and Cisco's global engineering capabilities. Our work… more
- Cisco (San Diego, CA)
- …Foundational Modeling team at Splunk, where we advance the state of AI for high‑volume, real‑time, multi‑modal machine‑generated data - including logs, time series, ... traces, and events. We combine deep AI research expertise with the scale and operational excellence of Splunk and Cisco's global engineering capabilities. Our work… more
- Amazon (Cupertino, CA)
- … AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that designs cutting… more
- Amazon (Cupertino, CA)
- …we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... experience - 5+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Fundamentals of… more