- Red Hat (Boston, MA)
- …an experienced Senior ML Ops engineer to work closely with our product and research teams to scale SOTA deep learning products and software. As an ML Ops ... AI ! What you will do + Collaborate with research and product development teams to scale machine learning...vLLM CI community is a big plus \#LI-MD2 \# AI -HIRING The salary range for this position is $133,650.00… more
- Microsoft Corporation (Redmond, WA)
- …all employees to positively impact our culture every day. **Responsibilities** + As a Research Engineer in AI Frontiers, you will design, develop, execute, ... **Overview** The ** AI Frontiers** lab at **Microsoft Research **...links to top academic institutions around the world. This Research Engineer position is a unique opportunity… more
- NVIDIA (Santa Clara, CA)
- …fundamentally transform how people live and work. Our mission is to advance AI research and development to create groundbreaking technologies that enable anyone ... and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme.... What you'll be doing: + Contribute features to vLLM that empower the newest models with the latest… more
- Amazon (Cupertino, CA)
- …and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a ... particular focus on large-scale generative AI applications. Key job responsibilities * Architect and lead...workloads * Lead integration efforts with frameworks such as vLLM , SGLang, Torch XLA, TensorRT, and Triton * Develop… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …an impact on the world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual ... will be pivotal in leading the development, operations, and support of our entire AI infrastructure. You will be responsible for the entire lifecycle of our AI… more
- Elevance Health (Norfolk, VA)
- **Gen AI Engineer ** **Location:** This role requires associates to be in-office 1 - 2 days per week, fostering collaboration and connectivity, while providing ... position is not eligible for current or future visa sponsorship._ The **Gen** ** AI Engineer ** is responsible for analyzing and modeling organizational data for… more
- Bank of America (Addison, TX)
- Software Engineer III -Gen AI Inferencing Addison, Texas;Charlotte, North Carolina; Kennesaw, Georgia; Newark, Delaware **To proceed with your application, you ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Software- Engineer -III--Gen- AI -Inferencing-\_25032986) **Job Description:** At Bank of America,… more
- Bank of America (Charlotte, NC)
- Senior Engineer - AI Inference Addison, Texas;Plano, Texas; Newark, Delaware; Charlotte, North Carolina; Kennesaw, Georgia **To proceed with your application, you ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Senior- Engineer - AI -Inference\_25029879) **Job Description:** At Bank of America,… more
- Robert Half Technology (Miami, FL)
- Description The AI /ML Engineer will design, build, and deploy production-grade machine learning and AI systems that power core products and features. This ... role bridges cutting-edge research with reliable, scalable engineering, turning prototypes into high-performance services that run 24/7 in production. Key… more
- Google (Sunnyvale, CA)
- Software Engineer III, AI /ML, TPU Inference _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... on and is growing every day. As a software engineer , you will work on a specific project critical...vLLM and integrating the latest academic and industry research , we ensure our solutions represent the cutting edge… more
- NVIDIA (Santa Clara, CA)
- …Performance tuning and optimizations of deep learning framework and software components. + Research , prototype, and develop robust and scalable AI tools and ... NVIDIA is now looking for AI Software Engineers for our GenAI Frameworks (Megatron...PyTorch, JAX), and/or inference and deployment environments (eg TRTLLM, vLLM , SGLang). + Proficient in Python programming, software design,… more
- Cisco (San Jose, CA)
- …Join our innovative engineering team focused on building next-generation AI /ML solutions. You'll collaborate with skilled colleagues across platform, security, ... Impact** Dive into the development and implementation of cutting-edge generative AI applications using the latest large language models-think GPT-4, Claude, Llama,… more
- NVIDIA (Santa Clara, CA)
- …will define the future of generative AI . We are looking for a research engineer who is passionate about open-source and excited to create our next-generation ... post-training software stack. You will work at the intersection of research and engineering, collaborating with the Post-Training and Frameworks teams to invent,… more
- Amazon (Seattle, WA)
- …cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible ... so on. Key job responsibilities Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from… more
- Amazon (Sunnyvale, CA)
- …General Intelligence (AGI) team is looking for a passionate, talented, and inventive Research Engineer with a strong hands-on machine learning background, to ... the development of industry-leading multimodal large-language foundational models (LLMs). As a Research Engineer with the AGI team, you will be responsible… more
- Red Hat (Raleigh, NC)
- …(https://github.blog/news-insights/octoverse/octoverse-a-new-developer-joins-github-every-second-as- ai ... Summary** At Red Hat we believe the future of AI is open and we are on a mission...mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates… more
- NVIDIA (Santa Clara, CA)
- …across diverse GPU platforms, particularly for physical AI and generative AI applications. You will collaborate with research scientists, software engineers, ... We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning...and hardware specialists to bring cutting-edge AI models from prototype to production. What you will… more
- NVIDIA (Santa Clara, CA)
- …strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, ... of research and engineering, pushing the boundaries of large-scale AI optimization. We are looking for passionate engineers with strong foundations in… more
- NVIDIA (Santa Clara, CA)
- …of research and engineering, pushing the boundaries of large-scale AI optimization. We are looking for passionate engineers with strong foundations in ... NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such… more
- General Motors (Sunnyvale, CA)
- …**About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure Platforms. Our team owns the cloud-agnostic, ... reliable, and cost-efficient platform that powers GM's AI efforts. We're proud to serve as the ...the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms… more