• Senior Software Engineer , AI

    NVIDIA (CA)
    …distributed model management systems, including Rust-based runtime components, for large-scale AI inference workloads. + Implement inference scheduling ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...+ Collaborate with infrastructure engineers and researchers to develop scalable APIs, services, and end-to-end inference workflows.… more
    NVIDIA (11/29/25)
    - Save Job - Related Jobs - Block Source
  • Lead AI Engineer (FM Hosting, LLM…

    Capital One (San Francisco, CA)
    Lead AI Engineer (FM Hosting, LLM Inference ) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. ... upon number of hours to be regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700 for Lead AI Engineer more
    Capital One (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - AI

    Argonne National Laboratory (Lemont, IL)
    … working in the space of enabling AI for science, specifically targeting scalable inference leveraging HPC systems and AI accelerators. The successful ... In this position, the candidate can expect to explore and engineer solutions for AI inference integrated within scientific workflows, via programmatic access… more
    Argonne National Laboratory (01/06/26)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer AI

    Amazon (Cupertino, CA)
    …optimization and system reliability across the Neuron ecosystem * Design and implement scalable solutions for both offline and online inference workloads * Lead ... stack About the team The Neuron Serving team is at the forefront of scalable and resilient AI infrastructure at AWS. We focus on developing model-agnostic … more
    Amazon (12/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Engineer - AI

    Bank of America (Addison, TX)
    Senior Engineer - AI Inference Addison, Texas;Plano, Texas; Newark, Delaware; Charlotte, North Carolina; Kennesaw, Georgia **To proceed with your application, ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Senior- Engineer - AI - Inference \_25029879) **Job Description:** At Bank… more
    Bank of America (12/22/25)
    - Save Job - Related Jobs - Block Source
  • Lead Engineer , Inference Platform

    MongoDB (Palo Alto, CA)
    We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, ... retrieval, and AI -native features across MongoDB Atlas. This role is part...Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with… more
    MongoDB (12/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Machine Learning Engineer

    Red Hat (Boston, MA)
    …will collaborate with our team to tackle the most pressing challenges in scalable inference systems and Kubernetes-native deployments. Your work with distributed ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise... infrastructure leveraging Kubernetes APIs, operators, and the Gateway Inference Extension API for scalable LLM deployments.… more
    Red Hat (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Inference

    MongoDB (Palo Alto, CA)
    …with ML researchers from Voyage. ai to bring novel ideas into scalable systems + Tackle challenging problems in inference , observability, and distributed ... **About the Role** We're looking for a Senior Engineer to help build the next-generation inference...supporting semantic search and hybrid retrieval + Collaborate with AI engineers and researchers to productionize inference more
    MongoDB (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Senior GenAI Algorithms Engineer - Model…

    NVIDIA (Santa Clara, CA)
    …with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, ... What you'll be doing: + Design and build modular, scalable model optimization software platforms that deliver exceptional user...NVIDIA platform integration and expand market adoption across the AI inference ecosystem. What we need to… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer - Dynamo…

    NVIDIA (CA)
    …to stand out from the crowd: + Prior work experience improving performance of AI inference systems. + Background with deep learning algorithms and frameworks. ... We are now looking for a Senior System Software Engineer to work on Dynamo & Triton Inference...role, you will develop open source software to serve inference of trained AI models running on… more
    NVIDIA (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect, Inference Deployments

    NVIDIA (Santa Clara, CA)
    We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions ... Solutions Architecture with a proven track record of moving AI inference from POC to production on...AWS Lambda, NVCF) with NVIDIA GPUs. + NVIDIA Certified AI Engineer or similar. + Active contributions… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • AI / ML Engineer

    Guidehouse (Huntsville, AL)
    …solutions that leverage large language models (LLMs), retrieval systems, and secure, scalable inference pipelines to enable data-driven decision-making. You will ... Active Top Secret (TS) Guidehouse is seeking a Lead AI /ML Engineer to join our Technology /...the FBI adjudication platform. Drives design and implementation of inference pipelines, RAG workflows, retrieval systems, prompt architectures, and… more
    Guidehouse (01/01/26)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer - AI /ML, AWS…

    Amazon (Seattle, WA)
    …and Llama, and collaborate directly with silicon architects to influence the future of AI hardware. Our systems handle millions of inference calls daily, while ... accelerators Inferentia and Trainium. As a Senior Software Engineer in our Machine Learning Applications team, you'll be...deep technical experts transform complex ML challenges into elegant, scalable solutions that define how AI workloads… more
    Amazon (10/31/25)
    - Save Job - Related Jobs - Block Source
  • Sr Principal AI Software Engineer

    Oracle (San Juan, PR)
    **Job Description** The Senior Principal AI /ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI ... AI infrastructure offerings, spearheads the design and implementation of scalable orchestration for AI /ML workloads-incorporating the latest research in… more
    Oracle (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Summer Intern - AI /ML Engineer

    General Motors (Sunnyvale, CA)
    …such as Embodied AI , Simulation, Data Science, and more. We enable scalable and efficient ML experimentation, enhance the productivity of ML engineers, and drive ... the adoption of cutting-edge ML techniques. Our AV ML infrastructure includes: ** AI Validation & Inference ** : Ensures robust model performance by running… more
    General Motors (12/13/25)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer - OCI AI

    Oracle (Frankfort, KY)
    …serving frameworks like vLLM, DeepSpeed, or FasterTransformer. - Exposure to agent-based AI systems or tool-based inference workflows. - Knowledge of ... future of cloud computing-designed for enterprises, engineered for performance, and optimized for AI at scale. We are a fast-paced, mission-driven team within one of… more
    Oracle (12/20/25)
    - Save Job - Related Jobs - Block Source
  • Lead, AI Data Engineer

    Mastercard (O'Fallon, MO)
    …mining, cleaning, processing, and validation. -Contribute to the development of scalable AI infrastructure and reusable components for enterprise-wide use. ... businesses and governments realize their greatest potential._ **Title and Summary** Lead, AI Data Engineer Mastercard Security Services is responsible for… more
    Mastercard (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Staff AI /ML Engineer

    General Motors (Austin, TX)
    **Job Description** **Staff AI /ML Engineer , AV ML Infra** We're General Motors (GM), a company driving the future of mobility with advanced self-driving and ... AI , Simulation, Data Science, and more. We enable scalable and efficient ML experimentation, enhance the productivity of...of cutting-edge ML techniques. Our ML infrastructure includes: + ** AI Validation & Inference :** Ensures robust model… more
    General Motors (10/31/25)
    - Save Job - Related Jobs - Block Source
  • AI and LLM Engineer

    SAIC (VA)
    **Description** We are seeking a **hands-on AI & LLM Engineer ** to join the AWS AI /GenAI Solutions Team within the IRS Advanced Analytics Program (AAP). This ... Step Functions) in production workflows. + Proven ability to build and deploy ** inference endpoints and APIs** for AI /ML workloads. + Familiarity with **CI/CD… more
    SAIC (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Principal AI Engineer , Enterprise…

    Palo Alto Networks (Santa Clara, CA)
    …is to create an environment where we all win with precision. **Your Career** As a Principal AI Engineer for the Enterprise AI Platform, you will be a pivotal ... assistant, agent, app, supporting both traditional and Generative AI model development, deployment, and real-time inference ...Research: Actively learn and incorporate advances in Agents, Generative AI , LLMs, and scalable AI more
    Palo Alto Networks (12/17/25)
    - Save Job - Related Jobs - Block Source