• Senior Software Engineer, Inference

    MongoDB (Palo Alto, CA)
    …We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, ... in MongoDB Atlas. You'll join the broader Search and AI Platform organization and collaborate with ML...Do** + Design and build components of a multi-tenant inference platform integrated directly with MongoDB Atlas,… more
    MongoDB (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Senior Engineer- AI Inference

    Bank of America (Addison, TX)
    Senior Engineer- AI Inference Addison, Texas;Plano,.... We are building the next generation of Gen AI platform , empowering new AI initiatives ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/ Senior -Engineer- AI - Inference \_25029879) **Job Description:** At Bank… more
    Bank of America (12/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Principal MLOps Engineer, AI

    Red Hat (Raleigh, NC)
    …this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM… more
    Red Hat (01/07/26)
    - Save Job - Related Jobs - Block Source
  • AI Inference Engineer

    quadric.io, Inc (Burlingame, CA)
    …unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform ; [2] optimize the model deployment for efficient ... and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the...; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI more
    quadric.io, Inc (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Sr. SDM, AI Inference Technology,…

    Amazon (Seattle, WA)
    …accelerators that power the latest AI models As the Sr. SDM for the Inference Technology Team, you will lead a strong team of managers and engineers to build ... inference technology building blocks and libraries to enable AI developers to optimize model for inference ...You will work with the executive leadership and other senior management and technical leaders to define product directions… more
    Amazon (12/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Inference Technical Product…

    NVIDIA (Santa Clara, CA)
    …will influence NVIDIA's entire technical marketing strategy to showcase our leadership position in AI inference . Want to join a fun, creative company that is at ... companies! What You'll Be Doing: + Help drive NVIDIA's inference platform technical go-to-market efforts + Work...cache implementations) and identify intersection points between the latest AI models and NVIDIA's platform to maximize… more
    NVIDIA (12/25/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Manager, AI

    Amazon (Seattle, WA)
    Inference Technology building blocks team, you will guide your expert AI engineers to build fundamental inference technology building blocks and libraries ... enable AI developers to optimize model for inference on Trainium and Inferentia devices. We're currently focusing...day in the life You will work with your senior management and technical leaders to define the building… more
    Amazon (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior GenAI Algorithms Engineer - Model…

    NVIDIA (Santa Clara, CA)
    …performance to strengthen NVIDIA platform integration and expand market adoption across the AI inference ecosystem. What we need to see: + Master's, PhD, or ... to neural architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Inference

    NVIDIA (Durham, NC)
    …Architecture team does groundbreaking hardware-software co-design work that focuses on accelerating AI Inference workloads. In this role, you will write ... We are now looking for a Senior Deep Learning Inference Performance Architect!...architectures to extend the state of the art in AI Inference performance and efficiency + Model,… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Machine Learning Engineer,…

    Red Hat (Boston, MA)
    …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to ... Summary** At Red Hat we believe the future of AI is open and we are on a mission...model quantization and sparsification, our team provides a stable platform for enterprises to build, optimize, and scale LLM… more
    Red Hat (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer - Dynamo…

    NVIDIA (CA)
    …to stand out from the crowd: + Prior work experience improving performance of AI inference systems. + Background with deep learning algorithms and frameworks. ... We are now looking for a Senior System Software Engineer to work on Dynamo...role, you will develop open source software to serve inference of trained AI models running on… more
    NVIDIA (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Machine Learning Engineer,…

    Red Hat (Boston, MA)
    …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to ... high performance computing will directly impact the development of our cutting-edge software platform , helping to shape the future of AI deployment and… more
    Red Hat (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Software Development Manager, LLM Inference

    Amazon (Cupertino, CA)
    …as Llama and GPT-OSS to run really fast on Trainium. As the SDM for the LLM Inference Model Enablement team, you will lead a team of expert AI /ML engineers to ... open-source and customer LLMs, both dense and MoE, for inference on Neuron and Trainium and Inferentia accelerators. You...day in the life You will work with your senior management and technical leaders to define the model… more
    Amazon (12/06/25)
    - Save Job - Related Jobs - Block Source
  • Staff Applied Scientist (Causal Inference )

    Coinbase (Charlotte, NC)
    …in the job description. For select roles, Coinbase is also piloting an AI interview intelligence platform to transcribe and summarize interview notes, allowing ... us, every day, as we build the emerging onchain platform - and with it, the future global financial...(ie. Job Duties)* * Develop and* deploy robust causal inference models* to quantify the holistic business impact of… more
    Coinbase (11/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Engineer (Gen AI

    Capital One (New York, NY)
    Senior AI Engineer (Gen AI Platform Services, Agentic Systems) **Overview:** At Capital One, we are creating responsible and reliable AI systems, ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...be regularly worked. Cambridge, MA: $158,600 - $181,000 for Senior AI Engineer McLean, VA: $158,600 -… more
    Capital One (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Platform Engineer

    PennyMac (Westlake Village, CA)
    …achieve aspirations of homeownership through the complete mortgage journey. A Typical Day The Senior AI Platform Engineer will: + Design, implement, and ... ensuring they can be securely and efficiently consumed by platform services. + Enable AI /ML workflows by...AI workloads, including monitoring and resource scaling for inference services. + Implement and enforce security best practices… more
    PennyMac (01/07/26)
    - Save Job - Related Jobs - Block Source
  • Senior Developer Relations Manager,…

    NVIDIA (Santa Clara, CA)
    We are looking for a Developer Relations Manager - AI Platform SW, passionate about developing modern Artificial Intelligence, and Generative AI applications ... Focus will be on accelerating GenAI model training and inference , which is making a major impact across research...throughout the developer applications. At NVIDIA, we are enabling AI platform software solutions for accelerated compute… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Senior Data & AI Platform

    Veralto (MA)
    Position Summary We're seeking a Senior Data & AI Platform Architect to lead the design and implementation of TraceGains' next-generation data and MLOps ... 80% of their platform needs * Successfully hire and onboard 2-3 senior data engineers and AI /ML engineers Required Qualifications Experience & Background *… more
    Veralto (12/09/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer, AI

    LinkedIn (Mountain View, CA)
    …Attention. PyTorch Lightning and more. Feature Engineering: this team shapes the future of AI with the state-of-the-art Feature Platform , which empowers AI ... models together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with hundreds...infra on top of native cloud, enable GPU based inference for a large variety of use cases, cuda… more
    LinkedIn (12/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Manager, AI Engineering…

    Capital One (San Jose, CA)
    Senior Manager, AI Engineering (People Leader) (GenAI Platform Services) **Overview** : At Capital One, we are creating responsible and reliable AI ... us to be at the forefront of enterprises leveraging AI . From informing customers about unusual charges to answering...AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
    Capital One (11/04/25)
    - Save Job - Related Jobs - Block Source