Machine Learning Engineer Inference Jobs

259 jobs (page 1)

Categories

All Categories

Engineering (75)

Software/IT (43)

Management (6)

Machine Learning Engineer…

Red Hat (Raleigh, NC)

…provides a stable platform for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM, you will be at the ... LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings...solving challenging technical problems at the forefront of deep learning in the open source way, this is the… more

Red Hat (12/31/25)
- Save Job - Related Jobs - Block Source
Senior Principal Machine Learning…

Red Hat (Boston, MA)

…on Github. As a Machine Learning ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...challenges in model performance and efficiency. Your work with machine learning and high performance computing will… more

Red Hat (01/08/26)
- Save Job - Related Jobs - Block Source
Senior Principal Machine Learning…

Red Hat (Boston, MA)

…for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM (https://github.com/vllm-project/) ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...and guide fellow engineers, fostering a culture of continuous learning and innovation. **What you will bring** + Strong… more

Red Hat (01/08/26)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer…

Amazon (Seattle, WA)

…and the Trn2 and future Trn3 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. ... Llama3, GPT OSS, Qwen3, DeepSeek and beyond. The Neuron Inference Technology team works side by side with the... Technology team works side by side with the Inference Model Enablement, compiler runtime engineers to create, build… more

Amazon (12/24/25)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer…

Amazon (Seattle, WA)

…and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. ... compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trn1. Experience optimizing inference performance for… more

Amazon (12/13/25)
- Save Job - Related Jobs - Block Source
Principal Machine Learning Platform…

Palo Alto Networks (Santa Clara, CA)

…while ensuring a formidable security posture from development through runtime. As a Principal Machine Learning Inference Engineer , you will serve as ... and long-term strategy of our AI platform - ML inference . Beyond individual contribution, you will lead complex technical...a deep focus on MLOps, ML systems, or productionizing machine learning models at scale. + Expert-level… more

Palo Alto Networks (11/18/25)
- Save Job - Related Jobs - Block Source
Software Engineer -AI/ML, AWS Neuron…

Amazon (Seattle, WA)

… accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development ... at least one software programming language - Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work… more

Amazon (12/21/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer - AI/ML, AWS…

Amazon (Seattle, WA)

…scaling) of new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with ... learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement… more

Amazon (12/31/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer…

Amazon (Seattle, WA)

…scaling) of new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with ... learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement… more

Amazon (01/06/26)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer - Model…

NVIDIA (Santa Clara, CA)

…out from the crowd: + Contributions to PyTorch, JAX, vLLM, SGLang, or other machine learning training and inference frameworks. + Hands-on experience ... strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Staff ML Engineer , Inference…

General Motors (Sunnyvale, CA)

…use cases. Our platform supports the serving of state-of-the-art (SOTA) machine learning models for experimental and bulk inference , with a focus on ... eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the AI Compute Platforms...+ 8+ years of industry experience, with focus on machine learning systems or high performance backend… more

General Motors (10/21/25)
- Save Job - Related Jobs - Block Source
AI Inference Engineer

quadric.io, Inc (Burlingame, CA)

…network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and ... conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM models and Quadric unique… more

quadric.io, Inc (11/25/25)
- Save Job - Related Jobs - Block Source
Lead AI Engineer (FM Hosting, LLM…

Capital One (San Francisco, CA)

…for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in ... Lead AI Engineer (FM Hosting, LLM Inference ) **Overview**...world-class talent - along with our deep experience in machine learning - position us to be… more

Capital One (11/04/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer AI/ML,…

Amazon (Cupertino, CA)

…is the software stack powering AWS Inferentia and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. ... The Neuron Serving team develops infrastructure to serve modern machine learning models-including large language models (LLMs) and multimodal workloads-reliably… more

Amazon (12/21/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Inference…

NVIDIA (Durham, NC)

We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer who ... to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does...years or relevant experience + Strong mathematical foundation in machine learning and deep learning … more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer - vLLM…

Red Hat (Boston, MA)

…you will do + Collaborate with research and product development teams to scale machine learning products for internal and external applications + Create and ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...learning products and software. As an ML Ops engineer , you will work closely with our technical and… more

Red Hat (12/06/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Machine…

Google (Sunnyvale, CA)

Senior Software Engineer , Machine Learning , Kernel...+ 3 years of experience in software development for machine learning model inference or ... training, and 1 year of experience with ML model inference and training optimization on modern GPU/TPU architectures. **Preferred...Cloud is searching for a highly skilled and motivated engineer to optimize machine learning … more

Google (12/27/25)
- Save Job - Related Jobs - Block Source
Senior, Software Engineer ( Machine…

Walmart (Sunnyvale, CA)

**Position Summary ** **What you'll do ** As a **Senior Software Engineer - Machine Learning ** , you are a technical leader working at the intersection of ... machine learning and software engineering. You have...scalable model serving platforms for both batch and real-time inference + Build model deployment pipelines with automated testing… more

Walmart (01/09/26)
- Save Job - Related Jobs - Block Source
Senior AI Machine Learning Research…

CACI International (Florham Park, NJ)

Senior AI Machine Learning Research Engineer Job Category: Engineering Time Type: Full time Minimum Clearance Required to Start: None Employee Type: Regular ... in Florham Park, NJ for a Senior AI and Machine Learning Research Engineer . Apply...and experience with 3D edge object identification, tracking, and inference . + Experience with computer vision frameworks, software &… more

CACI International (11/12/25)
- Save Job - Related Jobs - Block Source
Artificial Intelligence & Machine…

Honeywell (San Jose, CA)

We're seeking a highly skilled Artificial Intelligence & Machine Learning Systems Engineer to architect, design, and develop advanced AI/ML systems that ... with cross-functional teams to deliver intelligent, scalable, and production-ready AI and machine learning technologies. You will be responsible for researching,… more

Honeywell (01/07/26)
- Save Job - Related Jobs - Block Source

"Juju

Account Login

Sign Up

Forgot your password?

Advanced Search