- Red Hat (Raleigh, NC)
- …platform for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM , you will be at the forefront of ... mission to bring the power of open-source LLMs and vLLM to every enterprise. The Red Hat Inference team...solving challenging technical problems at the forefront of deep learning in the open source way, this is the… more
- Red Hat (Boston, MA)
- …to build, optimize, and scale LLM deployments. As a Principal Machine Learning Engineer focused on distributed vLLM (http://github.com/ vllm -project/) ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...and guide fellow engineers, fostering a culture of continuous learning and innovation. **What you will bring** + Strong… more
- Red Hat (Boston, MA)
- …on Github. As a Senior Machine Learning Engineer focused on vLLM , ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...challenges in model performance and efficiency. Your work with machine learning and high performance computing will… more
- Red Hat (Boston, MA)
- …you will do + Collaborate with research and product development teams to scale machine learning products for internal and external applications + Create and ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...learning products and software. As an ML Ops engineer , you will work closely with our technical and… more
- Red Hat (Raleigh, NC)
- …on Github. As a Machine Learning ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...research **What you will bring** + Strong understanding of machine learning and deep learning … more
- Red Hat (Boston, MA)
- …a stable platform for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on llama.cpp, you will be at the ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...challenges in model performance and efficiency. Your work with machine learning and high performance computing will… more
- CAE USA INC (Orlando, FL)
- …development, advancement and having fun! Summary We are seeking a highly skilled and experienced Machine Learning Engineer to join our growing AI & Data ... about solving complex problems using data-driven approaches and deploying scalable machine learning solutions in production environments. Additionally, t his… more
- Palo Alto Networks (Santa Clara, CA)
- …while ensuring a formidable security posture from development through runtime. As a Principal Machine Learning Inference Engineer , you will serve as a ... in software engineering with a deep focus on MLOps, ML systems, or productionizing machine learning models at scale. + Expert-level programming skills in Python… more
- Cisco (San Jose, CA)
- …Relevant fields include: Computer Science, Electrical Engineering, Data Science, Machine Learning , Artificial Intelligence, Statistics, Mathematics, Software ... challenge of optimizing neural networks for natural language processing and machine perception, drawing on a toolkit that includes convolutional and… more
- NVIDIA (Santa Clara, CA)
- …+ Expertise in inference engines like vLLM and SGLang + Expertise in machine learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel ... We are now looking for a Senior Deep Learning Software Engineer , FlashInfer. NVIDIA has...teams + Contributing to open source communities like FlashInfer, vLLM , and SGLang What we need to see: +… more
- S&P Global (Cambridge, MA)
- …is looking for ML Engineer interns to join the group of Machine Learning Engineers working on developing a cutting-edge GenAI platform, LLM-powered ... is S&P Global's hub for AI innovation and transformation. With expertise in machine learning , natural language processing, and data discovery, we develop and… more
- NVIDIA (Santa Clara, CA)
- …a related field (or equivalent experience) + 4+ years of professional experience in deep learning or applied machine learning . + Strong foundation in deep ... DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing and deploying Large Language… more
- NVIDIA (Santa Clara, CA)
- …solutions. What you'll be doing: + Develop algorithms for AI/DL, data analytics, machine learning , or scientific computing + Contribute and advance open source ... (eg PyTorch, JAX), and/or inference and deployment environments (eg TRTLLM, vLLM , SGLang). + Proficient in Python programming, software design, debugging,… more
- NVIDIA (Santa Clara, CA)
- …solutions. What you'll be doing: + Develop algorithms for AI/DL, data analytics, machine learning , or scientific computing + Contribute and advance open source ... PyTorch, JAX, Ray), and/or inference and deployment environments (eg TRTLLM, vLLM , SGLang). + Proficient in Python programming, software design, debugging,… more
- NVIDIA (Santa Clara, CA)
- …environment. Ways to stand out from the crowd: + Contributions to PyTorch, JAX, vLLM , SGLang, or other machine learning training and inference frameworks. ... optimization. We are looking for passionate engineers with strong foundations in both machine learning and software systems/architecture who are eager to make a… more
- NVIDIA (Santa Clara, CA)
- …to stand out from the crowd: + Contributions to PyTorch, Megatron-LM, NeMo, TensorRT-LLM, vLLM , SGLang, or other machine learning training and inference ... optimization. We are looking for passionate engineers with strong foundations in both machine learning and software systems/architecture who are eager to make a… more
- NVIDIA (Santa Clara, CA)
- …related field (or equivalent experience). + 3+ years of professional experience in deep learning , applied machine learning , or physical AI development. + ... DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing and deploying Large Language… more
- Red Hat (Raleigh, NC)
- Red Hat(R) OpenShift(R) AI is a flexible, scalable artificial intelligence (AI) and machine learning (ML) platform that enables enterprises to create and deliver ... to join our rapidly growing engineering team. Our team focuses on making machine learning model deployment and monitoring seamless and scalable across the… more
- General Motors (Sunnyvale, CA)
- …ML-centric use cases. Our platform supports the serving of state-of-the-art (SOTA) machine learning models for experimental and bulk inference, with a ... the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms...+ 8+ years of industry experience, with focus on machine learning systems or high performance backend… more
- Amazon (Cupertino, CA)
- …AWS Neuron is the software stack powering AWS Inferentia and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at ... scale. The Neuron Serving team develops infrastructure to serve modern machine learning models-including large language models (LLMs) and multimodal… more