AI Inference Engineer Jobs in Berkeley, CA

25 jobs (page 1)

Categories

All Categories

Engineering (5)

AI Inference Engineer

quadric.io, Inc (Burlingame, CA)

…GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... of AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the… more

quadric.io, Inc (11/25/25)
- Save Job - Related Jobs - Block Source
Lead AI Engineer (FM Hosting, LLM…

Capital One (San Francisco, CA)

Lead AI Engineer (FM Hosting, LLM Inference ) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700… more

Capital One (11/04/25)
- Save Job - Related Jobs - Block Source
AI Kernel Engineer

quadric.io, Inc (Burlingame, CA)

…GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number ... of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel… more

quadric.io, Inc (11/25/25)
- Save Job - Related Jobs - Block Source
Senior/Principal Software Engineer…

Genentech (South San Francisco, CA)

…scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a software engineer in AI Enablement with a focus on full stack engineering, you will be… more

Genentech (12/06/25)
- Save Job - Related Jobs - Block Source
Senior/Principal Software Engineer…

Genentech (South San Francisco, CA)

…scale and optimize workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a software engineer in AI Enablement with a focus on backend and data engineering, you will… more

Genentech (12/06/25)
- Save Job - Related Jobs - Block Source
Sr. Software Development Engineer , FAR…

Amazon (San Francisco, CA)

Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers like Pieter ... run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more

Amazon (12/02/25)
- Save Job - Related Jobs - Block Source
AI Applications Engineer

quadric.io, Inc (Burlingame, CA)

…both NN graph code and conventional C++ DSP and control code. Role: The AI Applications Engineer is the key bridge between development engineering and hands-on ... and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and...users in the field. The AI Application Engineer will [1] integrate Quadric… more

quadric.io, Inc (11/25/25)
- Save Job - Related Jobs - Block Source
Senior AI /ML Tooling Engineer

General Motors (San Francisco, CA)

**Job Description** **Senior AI /ML Tooling Engineer ** Role: We are looking for an ML tooling engineer to build tools to analyze and optimize distillation, ... training, and inference of ML models. You will develop and enhance...toolchain and stack, to leverage the latest advancements in AI + Influence model architecture decisions and strategy within… more

General Motors (11/04/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer , Frontier…

Amazon (San Francisco, CA)

Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll contribute to breakthrough foundation models run at production ... scale. As a Software Development Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more

Amazon (12/02/25)
- Save Job - Related Jobs - Block Source
SDE- ML Engineer , Frontier AI…

Amazon (San Francisco, CA)

Description We are seeking a highly skilled Machine Learning Systems Engineer to join Frontier AI Robotics team. This role focuses on building and optimizing ... scientists and engineers to deliver scalable, high-performance systems that power state-of-the-art AI research and applications. About the team At Frontier AI … more

Amazon (11/26/25)
- Save Job - Related Jobs - Block Source
Lead AI Platform Engineer

Elevance Health (San Francisco, CA)

**Lead AI Platform Engineer ** **Location:** This role requires associates to be in-office 1 - 2 days per week, fostering collaboration and connectivity, while ... for employment, unless an accommodation is granted as required by law. The **Lead AI Platform Engineer ** will own technical outcomes for core areas of the… more

Elevance Health (11/18/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Machine Learning…

DoorDash (San Francisco, CA)

…systems that empower efficient machine learning at scale, especially in the Generative AI area. This is a remote opportunity, with Pacific Time working hours ... data and ML pipelines-from retrieval (eg, RAG) to batch inference -that adapt quickly to new technologies. + Develop an...fast iteration and deployment of products powered by Generative AI . + Improve the reliability, scalability, and monitoring of… more

DoorDash (10/28/25)
- Save Job - Related Jobs - Block Source
Software Engineer , ML Infrastructure,…

Snap Inc. (San Francisco, CA)

…forefront. You'll play a critical role in scaling our ML Infrastructure, optimizing AI training and inference systems, and driving innovations that make ... reliability and efficiency improvements across Snapchat's ML Infrastructure + Develop high-performance inference systems to ensure fast and efficient AI model… more

Snap Inc. (12/06/25)
- Save Job - Related Jobs - Block Source
Sr Staff R&D Engineer

The Walt Disney Company (Nicasio, CA)

…Skywalker Sound Development Group is seeking a highly accomplished **Sr Staff R&D Engineer ( AI /ML)** to lead the development of transformative audio intelligence ... + Own end-to-end model lifecycle management: pretraining, fine-tuning, validation, inference optimization, and CI/CD integration. + Guide the development of… more

The Walt Disney Company (11/20/25)
- Save Job - Related Jobs - Block Source
Sr ML Ops Engineer

The Walt Disney Company (Nicasio, CA)

The Skywalker Sound Development Group is seeking a highly skilled Sr ML Ops Engineer to build and maintain the infrastructure powering our machine learning and AI ... for model training, retraining, and deployment, ensuring that cutting-edge AI solutions operate reliably at scale. As a Sr...operate reliably at scale. As a Sr ML Ops Engineer , you will act as the backbone of our… more

The Walt Disney Company (09/20/25)
- Save Job - Related Jobs - Block Source
Principal Software Engineer

DataRobot (San Francisco, CA)

…makes sense for their business - today and in the future. As a Principal Software Engineer for Generative AI at DataRobot, you will be the technical anchor for ... **Job Description:** DataRobot delivers AI that maximizes impact and minimizes business risk. Our platform and applications integrate into core business processes so… more

DataRobot (10/10/25)
- Save Job - Related Jobs - Block Source
Staff Software Engineer

DataRobot (San Francisco, CA)

…GCP, Azure) and/or hybrid environments. **Nice to Have:** + Experience with AI /ML model deployment, serving, inference , and production integration. + Experience ... **Job Description:** DataRobot delivers AI that maximizes impact and minimizes business risk....today and in the future. As a Staff Software Engineer focused on Application Scalability & Performance, you will… more

DataRobot (10/18/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer

Oracle (San Francisco, CA)

…building the core cloud services that power modern enterprise applications. The Generative AI Service team focuses on building and scaling systems that enable Large ... Language Models (LLMs) and agent-based AI applications to run reliably and efficiently in production...area within OCI. Role Summary As a Senior Software Engineer (IC3), you will be responsible for developing and… more

Oracle (11/25/25)
- Save Job - Related Jobs - Block Source
Research Scientist - Generative AI…

Meta (Burlingame, CA)

…developers, and engineers dedicated to advancing AR and VR technologies. Our Core AI group is seeking a Research Scientist to tackle cutting-edge research challenges ... in image restoration using Generative AI methods, such as diffusion and autoregressive approaches. This...designing machine learning algorithms, including experience with training and inference of novel ML architectures 8. Proven experience in… more

Meta (10/30/25)
- Save Job - Related Jobs - Block Source
Staff Full-Stack Software Engineer

ServiceNow, Inc. (San Francisco, CA)

…group of builders, thinkers, and problem-solvers dedicated to delivering scalable, AI -powered software products that elevate how organizations work. We value clean ... architecture, intuitive user experiences, and a culture of continuous improvement. Every engineer here plays a key role in shaping the quality and reliability of our… more

ServiceNow, Inc. (10/31/25)
- Save Job - Related Jobs - Block Source

"Juju

Account Login

Sign Up

Forgot your password?

Advanced Search