• Amazon (Seattle, WA)
    A leading technology company based in Seattle is seeking a Senior Software Engineer for its Machine Learning Inference Applications team. The role focuses on ... development and optimization for LLM inference on specialized hardware. Ideal candidates will have significant software development experience and an understanding… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    …Virginia is seeking a Senior Software Development Engineer to work on AI / ML projects. You will design and optimize machine learning models for deployment ... learning principles. This role fosters a collaborative environment emphasizing continuous learning and performance tuning for clients' ML workloads. #J-18808-Ljbffr more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • quadric.io, Inc (Burlingame, CA)
    A pioneering tech company is looking for an experienced AI Inference Engineer to bridge AI models and advanced processing platforms. This role requires ... expertise in AI model algorithms, strong C/C++ and Python skills, and...experience with deployment frameworks. You will optimize and benchmark AI models, ensuring efficient deployment in edge devices. The… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Nutanix (San Diego, CA)
    A global technology leader in San Diego is seeking an AI Software Engineer to develop on-device software solutions using Python and C/C++. The ideal candidate will ... work in a dynamic environment, collaborating with researchers to advance Gen AI technology. A strong background in software engineering and experience with deep… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer, AI / ML , AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Software Development Engineer, AI / ML , AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (Seattle, WA)
    …Inferentia and Trainium cloud-scale machine-learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Comfy (San Francisco, CA)
    …platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal candidate will ... engage in building efficient AI models and tackling complex challenges. The role requires...limits. Join a dynamic team focused on creating innovative AI solutions and shaping the future of visual generative… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Technical Marketing Engineer - AI Inference at Scale page is loaded## Senior Technical Marketing Engineer - AI Inference at ... AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated...a consistent, high-impact go-to-market strategy.This role will focus on AI inference at scale, ensuring that customers… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat (Boston, MA)
    …this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...optimize, and scale LLM deployments.We are seeking an experienced ML Ops engineer to work closely with our product… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat, Inc. (Boston, MA)
    …this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...and scale LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Capital One (Annapolis, MD)
    Senior AI Engineer ( AI Foundations,...AI services Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity ... answering their questions in real time, our applications of AI & ML are bringing humanity and...be regularly worked. Cambridge, MA: $158,600 - $181,000 for Senior AI Engineer McLean, VA: $158,600 -… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Etched.ai, Inc. (San Jose, CA)
    A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This ... role requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (Cupertino, CA)
    …seeks a senior /principal engineer to architect and build distributed ML infrastructure. This role involves optimizing GPU compute systems and collaborating with ... silicons teams to enhance AI capabilities. Candidates should have substantial experience in GPU programming and distributed systems, alongside a technical degree.… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Akamai Technologies GmbH (Cambridge, MA)
    AI systems at massive scale Demonstrate deep expertise across multiple AI / ML disciplines, including inference frameworks, model optimization, platform ... Senior Principal Software Engineer - Akamai Inference...across the organization Serving as principal technical advisor on AI inference , providing expert guidance on complex… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Neara (Palo Alto, CA)
    …and maintain distributed systems that support high-throughput, low-latency AI model inference and data services. Partner with ML researchers and product ... Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's...ML engineers, and product teams to bring cutting-edge AI capabilities into productionat scale, with reliability, and under… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • The Association of Technology, Management and Applied… (Morgan Hill, CA)
    …effectiveness in delivering technology in fast-paced, demanding, industry driven environment for AI / ML , and advanced analytics. Hands on experience in both ... Bank of America, at the forefront of innovation in AI . We are building the next generation of Gen...within the organization. Experience with deploying models using vLLM/Triton Inference Server Performance Tuning those models and deployment to… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Capital One (New York, NY)
    …Cloud, Azure, or equivalent private cloud) Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, ... Senior Manager, AI Engineering (People Leader)...answering their questions in real time, our applications of AI & ML are bringing humanity and… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Hamilton Barnes Associates Limited (San Francisco, CA)
    …systems. Requirements 5+ years' experience building large-scale, fault-tolerant distributed systems ( ML inference , HPC, or similar). Proficiency in Python, Go, ... most advanced AI workloads worldwide. They're now building a serverless inference platform, beginning with cost-efficient batch inference and expanding into… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    A leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal ... over 5 years of software engineering experience, strong familiarity with ML architectures, and experience with distributed systems. This role involves collaboration… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source