Senior AI ML Inference Jobs

261 jobs (page 1)

Categories

All Categories

Engineering (69)

Software/IT (22)

Management (12)

Senior AI / ML…

Amazon (Seattle, WA)

A leading technology company based in Seattle is seeking a Senior Software Engineer for its Machine Learning Inference Applications team. The role focuses on ... development and optimization for LLM inference on specialized hardware. Ideal candidates will have significant software development experience and an understanding… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI / ML Software…

Amazon (San Francisco, CA)

…Virginia is seeking a Senior Software Development Engineer to work on AI / ML projects. You will design and optimize machine learning models for deployment ... learning principles. This role fosters a collaborative environment emphasizing continuous learning and performance tuning for clients' ML workloads. #J-18808-Ljbffr more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Inference Engineer…

quadric.io, Inc (Burlingame, CA)

A pioneering tech company is looking for an experienced AI Inference Engineer to bridge AI models and advanced processing platforms. This role requires ... expertise in AI model algorithms, strong C/C++ and Python skills, and...experience with deployment frameworks. You will optimize and benchmark AI models, ensuring efficient deployment in edge devices. The… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior ML Engineer: On-Device…

Nutanix (San Diego, CA)

A global technology leader in San Diego is seeking an AI Software Engineer to develop on-device software solutions using Python and C/C++. The ideal candidate will ... work in a dynamic environment, collaborating with researchers to advance Gen AI technology. A strong background in software engineering and experience with deep… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer,…

Amazon (San Francisco, CA)

Senior Software Development Engineer, AI / ML , AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer, AI…

Amazon (San Francisco, CA)

Software Development Engineer, AI / ML , AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Software Engineer- AI / ML , AWS…

Amazon (Seattle, WA)

…Inferentia and Trainium cloud-scale machine-learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior ML Inference Engineer…

Comfy (San Francisco, CA)

…platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal candidate will ... engage in building efficient AI models and tackling complex challenges. The role requires...limits. Join a dynamic team focused on creating innovative AI solutions and shaping the future of visual generative… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Technical Marketing Engineer…

NVIDIA Corporation (Santa Clara, CA)

Senior Technical Marketing Engineer - AI Inference at Scale page is loaded## Senior Technical Marketing Engineer - AI Inference at ... AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated...a consistent, high-impact go-to-market strategy.This role will focus on AI inference at scale, ensuring that customers… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Principal MLOps Engineer, AI…

Red Hat (Boston, MA)

…this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...optimize, and scale LLM deployments.We are seeking an experienced ML Ops engineer to work closely with our product… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Principal MLOps Engineer, AI…

Red Hat, Inc. (Boston, MA)

…this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...and scale LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Engineer ( AI…

Capital One (Annapolis, MD)

Senior AI Engineer ( AI Foundations,...AI services Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity ... answering their questions in real time, our applications of AI & ML are bringing humanity and...be regularly worked. Cambridge, MA: $158,600 - $181,000 for Senior AI Engineer McLean, VA: $158,600 -… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior AI Inference Platform…

Etched.ai, Inc. (San Jose, CA)

A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This ... role requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior GPU ML Infra Architect…

Apple Inc. (Cupertino, CA)

…seeks a senior /principal engineer to architect and build distributed ML infrastructure. This role involves optimizing GPU compute systems and collaborating with ... silicons teams to enhance AI capabilities. Candidates should have substantial experience in GPU programming and distributed systems, alongside a technical degree.… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Principal Software Engineer - Akamai…

Akamai Technologies GmbH (Cambridge, MA)

… AI systems at massive scale Demonstrate deep expertise across multiple AI / ML disciplines, including inference frameworks, model optimization, platform ... Senior Principal Software Engineer - Akamai Inference...across the organization Serving as principal technical advisor on AI inference , providing expert guidance on complex… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Backend Engineer: Distributed…

Neara (Palo Alto, CA)

…and maintain distributed systems that support high-throughput, low-latency AI model inference and data services. Partner with ML researchers and product ... Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's...ML engineers, and product teams to bring cutting-edge AI capabilities into productionat scale, with reliability, and under… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Engineer- AI Inference

The Association of Technology, Management and Applied… (Morgan Hill, CA)

…effectiveness in delivering technology in fast-paced, demanding, industry driven environment for AI / ML , and advanced analytics. Hands on experience in both ... Bank of America, at the forefront of innovation in AI . We are building the next generation of Gen...within the organization. Experience with deploying models using vLLM/Triton Inference Server Performance Tuning those models and deployment to… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Manager, AI Engineering…

Capital One (New York, NY)

…Cloud, Azure, or equivalent private cloud) Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, ... Senior Manager, AI Engineering (People Leader)...answering their questions in real time, our applications of AI & ML are bringing humanity and… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Inference Platform Engineer…

Hamilton Barnes Associates Limited (San Francisco, CA)

…systems. Requirements 5+ years' experience building large-scale, fault-tolerant distributed systems ( ML inference , HPC, or similar). Proficiency in Python, Go, ... most advanced AI workloads worldwide. They're now building a serverless inference platform, beginning with cost-efficient batch inference and expanding into… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Model Inference Engineer…

OpenAI (San Francisco, CA)

A leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal ... over 5 years of software engineering experience, strong familiarity with ML architectures, and experience with distributed systems. This role involves collaboration… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source

"Juju

Account Login

Sign Up

Forgot your password?

Advanced Search