- Red Hat (Boston, MA)
- …for enterprises to build, optimize, and scale LLM deployments. We are seeking an experienced Senior ML Ops engineer to work closely with our product and research ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...teams to scale SOTA deep learning products and software . As an ML Ops engineer , you… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... + Contribute features and code to NVIDIA's inference libraries, vLLM and SGLang, FlashInfer and LLM software ...libraries, vLLM and SGLang, FlashInfer and LLM software solutions. + Work with cross-collaborative teams across frameworks,… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer , FlashInfer. NVIDIA has been transforming computer graphics, PC gaming, and accelerated ... engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software...teams + Contributing to open source communities like FlashInfer, vLLM , and SGLang What we need to see: +… more
- NVIDIA (Santa Clara, CA)
- We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. ... AI. What you'll be doing: + Contribute features to vLLM that empower the newest models with the latest...way to integrate research ideas and prototypes into NVIDIA's software products. What we need to see: + Bachelor's… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate ... our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... + Contribute features and code to NVIDIA's inference libraries, vLLM and SGLang, FlashInfer and LLM software ...are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want… more
- Red Hat (Raleigh, NC)
- …serve models, and deliver innovative apps. The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference Runtimes experience to join ... Contribute directly to upstream inference runtime communities such as vLLM (https://github.com/ vllm -project/ vllm ) , TGI (https://github.com/huggingface/text-generation-inference)… more
- LinkedIn (Mountain View, CA)
- …engineers to optimize their models and deliver the best performance possible. As a Senior Software Engineer , you will have first-hand opportunities to ... vision models. We optimize performance across algorithms, AI frameworks, data infra, compute software , and hardware to harness the power of our GPU fleet with… more
- MongoDB (Palo Alto, CA)
- **About the Role** We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic ... with Atlas and designed for developer-first experiences. As a Senior Engineer , you'll focus on building core...Atlas users + Gain hands-on experience with tools like vLLM and container orchestration with Kubernetes **Who You Are**… more
- NVIDIA (CA)
- …how you can make a lasting impact on the world. We are now looking for a Senior System Software Engineer to work on user facing tools for Dynamo Inference ... + Familiar with or able to quickly gain expertise in vLLM , SGLang, PyTorch, NVIDIA GPUs, and supporting software stacks such as NIXL, NCCL, CUDA, as well as HPC… more
- NVIDIA (Santa Clara, CA)
- …platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer focused on container and cloud infrastructure. You ... Inference Microservices (NIMs) and our hosted services. You will build enterprise-grade software and tooling for container build, packaging, and deployment. You will… more
- Red Hat (Boston, MA)
- **Job Summary:** The Red Hat Ecosystems Engineering group is seeking a Senior Principal Software Engineer in our Boston, MA office. In this role, you will ... + Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities… more
- NVIDIA (Santa Clara, CA)
- …team and see how you can make a lasting impact on the world. NVIDIA is seeking a Senior Software Engineer to serve as a Tech Lead, driving the design and ... experience; MS or PhD preferred + 8+ years of software engineering experience, including 2+ years as tech lead....TensorRT, Triton, NeMo) and LLM serving frameworks (eg, Dynamo, vLLM , SGLang). + Experience with RAG systems and communication… more
- JPMorgan Chase (Seattle, WA)
- …pushing the envelope to enhance, build, and deliver top-notch technology products. As a Senior Lead Software Engineer at JPMorgan Chase within the Corporate ... decision-making. + Advocate for firmwide frameworks, tools, and practices within the Software Development Life Cycle. + Promote a culture of diversity, equity,… more
- Amazon (Cupertino, CA)
- …The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's ... ML accelerators. Working across the stack from PyTorch till the hardware- software boundary, our engineers build systematic infrastructure, innovate new methods and… more
- NVIDIA (Santa Clara, CA)
- We are looking for a Software Test development engineer in NVIDIA's Deep Learning SWQA team. The position is in NVIDIA Deep Learning and AI Software Quality ... validate robustness and measure the performance of NVIDIA's Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech recognition,… more
- Amazon (Seattle, WA)
- …The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's ... ML accelerators. Working across the stack from PyTorch till the hardware- software boundary, our engineers build systematic infrastructure, innovate new methods and… more
- Red Hat (Boston, MA)
- …on Github. As a Senior Machine Learning Engineer focused on vLLM , you will be at ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...computing will directly impact the development of our cutting-edge software platform, helping to shape the future of AI… more
- NVIDIA (Santa Clara, CA)
- …Frameworks (eg PyTorch, JAX), and/or inference and deployment environments (eg TRTLLM, vLLM , SGLang). + Proficient in Python programming, software design, ... NVIDIA is now looking for AI Software Engineers for our GenAI Frameworks (Megatron Core...intersection of AI applications, libraries, frameworks, and the entire software stack. + Innovate and improve model architectures, distributed… more
- Amazon (Cupertino, CA)
- Description AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators. As a part of the Neuron ... motivation to achieve results. Experience with technologies and tools such as XLA, vLLM or Hugging Face transformers is highly valued. *Utility Computing (UC)* AWS… more