- Red Hat (Raleigh, NC)
- …provides a stable platform for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM, you will be at the ... LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings...solving challenging technical problems at the forefront of deep learning in the open source way, this is the… more
- Red Hat (Boston, MA)
- …on Github. As a Machine Learning ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...challenges in model performance and efficiency. Your work with machine learning and high performance computing will… more
- Red Hat (Boston, MA)
- …for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM (https://github.com/vllm-project/) ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...and guide fellow engineers, fostering a culture of continuous learning and innovation. **What you will bring** + Strong… more
- Amazon (Seattle, WA)
- …and the Trn2 and future Trn3 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. ... Llama3, GPT OSS, Qwen3, DeepSeek and beyond. The Neuron Inference Technology team works side by side with the... Technology team works side by side with the Inference Model Enablement, compiler runtime engineers to create, build… more
- Amazon (Seattle, WA)
- …and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. ... compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trn1. Experience optimizing inference performance for… more
- Palo Alto Networks (Santa Clara, CA)
- …while ensuring a formidable security posture from development through runtime. As a Principal Machine Learning Inference Engineer , you will serve as ... and long-term strategy of our AI platform - ML inference . Beyond individual contribution, you will lead complex technical...a deep focus on MLOps, ML systems, or productionizing machine learning models at scale. + Expert-level… more
- Amazon (Seattle, WA)
- … accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development ... at least one software programming language - Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work… more
- Amazon (Seattle, WA)
- …scaling) of new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with ... learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement… more
- Amazon (Seattle, WA)
- …scaling) of new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with ... learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement… more
- NVIDIA (Santa Clara, CA)
- …out from the crowd: + Contributions to PyTorch, JAX, vLLM, SGLang, or other machine learning training and inference frameworks. + Hands-on experience ... strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like… more
- General Motors (Sunnyvale, CA)
- …use cases. Our platform supports the serving of state-of-the-art (SOTA) machine learning models for experimental and bulk inference , with a focus on ... eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the AI Compute Platforms...+ 8+ years of industry experience, with focus on machine learning systems or high performance backend… more
- quadric.io, Inc (Burlingame, CA)
- …network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and ... conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM models and Quadric unique… more
- Capital One (San Francisco, CA)
- …for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in ... Lead AI Engineer (FM Hosting, LLM Inference ) **Overview**...world-class talent - along with our deep experience in machine learning - position us to be… more
- Amazon (Cupertino, CA)
- …is the software stack powering AWS Inferentia and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. ... The Neuron Serving team develops infrastructure to serve modern machine learning models-including large language models (LLMs) and multimodal workloads-reliably… more
- NVIDIA (Durham, NC)
- We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer who ... to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does...years or relevant experience + Strong mathematical foundation in machine learning and deep learning … more
- Red Hat (Boston, MA)
- …you will do + Collaborate with research and product development teams to scale machine learning products for internal and external applications + Create and ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...learning products and software. As an ML Ops engineer , you will work closely with our technical and… more
- Google (Sunnyvale, CA)
- Senior Software Engineer , Machine Learning , Kernel...+ 3 years of experience in software development for machine learning model inference or ... training, and 1 year of experience with ML model inference and training optimization on modern GPU/TPU architectures. **Preferred...Cloud is searching for a highly skilled and motivated engineer to optimize machine learning … more
- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** As a **Senior Software Engineer - Machine Learning ** , you are a technical leader working at the intersection of ... machine learning and software engineering. You have...scalable model serving platforms for both batch and real-time inference + Build model deployment pipelines with automated testing… more
- CACI International (Florham Park, NJ)
- Senior AI Machine Learning Research Engineer Job Category: Engineering Time Type: Full time Minimum Clearance Required to Start: None Employee Type: Regular ... in Florham Park, NJ for a Senior AI and Machine Learning Research Engineer . Apply...and experience with 3D edge object identification, tracking, and inference . + Experience with computer vision frameworks, software &… more
- Honeywell (San Jose, CA)
- We're seeking a highly skilled Artificial Intelligence & Machine Learning Systems Engineer to architect, design, and develop advanced AI/ML systems that ... with cross-functional teams to deliver intelligent, scalable, and production-ready AI and machine learning technologies. You will be responsible for researching,… more