Inference Engineer Scalable AI Jobs

209 jobs (page 1)

Categories

All Categories

Engineering (69)

Software/IT (50)

Senior Software Engineer , AI…

NVIDIA (CA)

…distributed model management systems, including Rust-based runtime components, for large-scale AI inference workloads. + Implement inference scheduling ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...+ Collaborate with infrastructure engineers and researchers to develop scalable APIs, services, and end-to-end inference workflows.… more

NVIDIA (11/29/25)
- Save Job - Related Jobs - Block Source
Lead AI Engineer (FM Hosting, LLM…

Capital One (San Francisco, CA)

Lead AI Engineer (FM Hosting, LLM Inference ) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. ... upon number of hours to be regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700 for Lead AI Engineer… more

Capital One (11/04/25)
- Save Job - Related Jobs - Block Source
Software Engineer - AI…

Argonne National Laboratory (Lemont, IL)

… working in the space of enabling AI for science, specifically targeting scalable inference leveraging HPC systems and AI accelerators. The successful ... In this position, the candidate can expect to explore and engineer solutions for AI inference integrated within scientific workflows, via programmatic access… more

Argonne National Laboratory (01/06/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer AI…

Amazon (Cupertino, CA)

…optimization and system reliability across the Neuron ecosystem * Design and implement scalable solutions for both offline and online inference workloads * Lead ... stack About the team The Neuron Serving team is at the forefront of scalable and resilient AI infrastructure at AWS. We focus on developing model-agnostic … more

Amazon (12/21/25)
- Save Job - Related Jobs - Block Source
Senior Engineer - AI…

Bank of America (Addison, TX)

Senior Engineer - AI Inference Addison, Texas;Plano, Texas; Newark, Delaware; Charlotte, North Carolina; Kennesaw, Georgia **To proceed with your application, ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Senior- Engineer - AI - Inference \_25029879) **Job Description:** At Bank… more

Bank of America (12/22/25)
- Save Job - Related Jobs - Block Source
Lead Engineer , Inference Platform

MongoDB (Palo Alto, CA)

We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, ... retrieval, and AI -native features across MongoDB Atlas. This role is part...Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with… more

MongoDB (12/27/25)
- Save Job - Related Jobs - Block Source
Senior Principal Machine Learning Engineer…

Red Hat (Boston, MA)

…will collaborate with our team to tackle the most pressing challenges in scalable inference systems and Kubernetes-native deployments. Your work with distributed ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise... infrastructure leveraging Kubernetes APIs, operators, and the Gateway Inference Extension API for scalable LLM deployments.… more

Red Hat (01/08/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Inference…

MongoDB (Palo Alto, CA)

…with ML researchers from Voyage. ai to bring novel ideas into scalable systems + Tackle challenging problems in inference , observability, and distributed ... **About the Role** We're looking for a Senior Engineer to help build the next-generation inference...supporting semantic search and hybrid retrieval + Collaborate with AI engineers and researchers to productionize inference … more

MongoDB (01/08/26)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer - Model…

NVIDIA (Santa Clara, CA)

…with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, ... What you'll be doing: + Design and build modular, scalable model optimization software platforms that deliver exceptional user...NVIDIA platform integration and expand market adoption across the AI inference ecosystem. What we need to… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior System Software Engineer - Dynamo…

NVIDIA (CA)

…to stand out from the crowd: + Prior work experience improving performance of AI inference systems. + Background with deep learning algorithms and frameworks. ... We are now looking for a Senior System Software Engineer to work on Dynamo & Triton Inference...role, you will develop open source software to serve inference of trained AI models running on… more

NVIDIA (01/08/26)
- Save Job - Related Jobs - Block Source
Solutions Architect, Inference Deployments

NVIDIA (Santa Clara, CA)

We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions ... Solutions Architecture with a proven track record of moving AI inference from POC to production on...AWS Lambda, NVCF) with NVIDIA GPUs. + NVIDIA Certified AI Engineer or similar. + Active contributions… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
AI / ML Engineer

Guidehouse (Huntsville, AL)

…solutions that leverage large language models (LLMs), retrieval systems, and secure, scalable inference pipelines to enable data-driven decision-making. You will ... Active Top Secret (TS) Guidehouse is seeking a Lead AI /ML Engineer to join our Technology /...the FBI adjudication platform. Drives design and implementation of inference pipelines, RAG workflows, retrieval systems, prompt architectures, and… more

Guidehouse (01/01/26)
- Save Job - Related Jobs - Block Source
Sr. Software Engineer - AI /ML, AWS…

Amazon (Seattle, WA)

…and Llama, and collaborate directly with silicon architects to influence the future of AI hardware. Our systems handle millions of inference calls daily, while ... accelerators Inferentia and Trainium. As a Senior Software Engineer in our Machine Learning Applications team, you'll be...deep technical experts transform complex ML challenges into elegant, scalable solutions that define how AI workloads… more

Amazon (10/31/25)
- Save Job - Related Jobs - Block Source
Sr Principal AI Software Engineer…

Oracle (San Juan, PR)

**Job Description** The Senior Principal AI /ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI ... AI infrastructure offerings, spearheads the design and implementation of scalable orchestration for AI /ML workloads-incorporating the latest research in… more

Oracle (11/25/25)
- Save Job - Related Jobs - Block Source
Summer Intern - AI /ML Engineer…

General Motors (Sunnyvale, CA)

…such as Embodied AI , Simulation, Data Science, and more. We enable scalable and efficient ML experimentation, enhance the productivity of ML engineers, and drive ... the adoption of cutting-edge ML techniques. Our AV ML infrastructure includes: ** AI Validation & Inference ** : Ensures robust model performance by running… more

General Motors (12/13/25)
- Save Job - Related Jobs - Block Source
Principal Software Engineer - OCI AI…

Oracle (Frankfort, KY)

…serving frameworks like vLLM, DeepSpeed, or FasterTransformer. - Exposure to agent-based AI systems or tool-based inference workflows. - Knowledge of ... future of cloud computing-designed for enterprises, engineered for performance, and optimized for AI at scale. We are a fast-paced, mission-driven team within one of… more

Oracle (12/20/25)
- Save Job - Related Jobs - Block Source
Lead, AI Data Engineer

Mastercard (O'Fallon, MO)

…mining, cleaning, processing, and validation. -Contribute to the development of scalable AI infrastructure and reusable components for enterprise-wide use. ... businesses and governments realize their greatest potential._ **Title and Summary** Lead, AI Data Engineer Mastercard Security Services is responsible for… more

Mastercard (01/08/26)
- Save Job - Related Jobs - Block Source
Staff AI /ML Engineer

General Motors (Austin, TX)

**Job Description** **Staff AI /ML Engineer , AV ML Infra** We're General Motors (GM), a company driving the future of mobility with advanced self-driving and ... AI , Simulation, Data Science, and more. We enable scalable and efficient ML experimentation, enhance the productivity of...of cutting-edge ML techniques. Our ML infrastructure includes: + ** AI Validation & Inference :** Ensures robust model… more

General Motors (10/31/25)
- Save Job - Related Jobs - Block Source
AI and LLM Engineer

SAIC (VA)

**Description** We are seeking a **hands-on AI & LLM Engineer ** to join the AWS AI /GenAI Solutions Team within the IRS Advanced Analytics Program (AAP). This ... Step Functions) in production workflows. + Proven ability to build and deploy ** inference endpoints and APIs** for AI /ML workloads. + Familiarity with **CI/CD… more

SAIC (01/13/26)
- Save Job - Related Jobs - Block Source
Principal AI Engineer , Enterprise…

Palo Alto Networks (Santa Clara, CA)

…is to create an environment where we all win with precision. **Your Career** As a Principal AI Engineer for the Enterprise AI Platform, you will be a pivotal ... assistant, agent, app, supporting both traditional and Generative AI model development, deployment, and real-time inference ...Research: Actively learn and incorporate advances in Agents, Generative AI , LLMs, and scalable AI … more

Palo Alto Networks (12/17/25)
- Save Job - Related Jobs - Block Source

"Juju

Account Login

Sign Up

Forgot your password?

Advanced Search