- Amazon (Seattle, WA)
- A leading technology company based in Seattle is seeking a Senior Software Engineer for its Machine Learning Inference Applications team. The role focuses on ... development and optimization for LLM inference on specialized hardware. Ideal candidates will have significant software development experience and an understanding… more
- Amazon (San Francisco, CA)
- …Virginia is seeking a Senior Software Development Engineer to work on AI / ML projects. You will design and optimize machine learning models for deployment ... learning principles. This role fosters a collaborative environment emphasizing continuous learning and performance tuning for clients' ML workloads. #J-18808-Ljbffr more
- quadric.io, Inc (Burlingame, CA)
- A pioneering tech company is looking for an experienced AI Inference Engineer to bridge AI models and advanced processing platforms. This role requires ... expertise in AI model algorithms, strong C/C++ and Python skills, and...experience with deployment frameworks. You will optimize and benchmark AI models, ensuring efficient deployment in edge devices. The… more
- Nutanix (San Diego, CA)
- A global technology leader in San Diego is seeking an AI Software Engineer to develop on-device software solutions using Python and C/C++. The ideal candidate will ... work in a dynamic environment, collaborating with researchers to advance Gen AI technology. A strong background in software engineering and experience with deep… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer, AI / ML , AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
- Amazon (San Francisco, CA)
- Software Development Engineer, AI / ML , AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
- Amazon (Seattle, WA)
- …Inferentia and Trainium cloud-scale machine-learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more
- NVIDIA Corporation (Santa Clara, CA)
- Senior Technical Marketing Engineer - AI Inference at Scale page is loaded## Senior Technical Marketing Engineer - AI Inference at ... AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated...a consistent, high-impact go-to-market strategy.This role will focus on AI inference at scale, ensuring that customers… more
- Comfy (San Francisco, CA)
- …platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal candidate will ... engage in building efficient AI models and tackling complex challenges. The role requires...limits. Join a dynamic team focused on creating innovative AI solutions and shaping the future of visual generative… more
- Apple Inc. (Cupertino, CA)
- …seeks a senior /principal engineer to architect and build distributed ML infrastructure. This role involves optimizing GPU compute systems and collaborating with ... silicons teams to enhance AI capabilities. Candidates should have substantial experience in GPU programming and distributed systems, alongside a technical degree.… more
- Red Hat (Boston, MA)
- …this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...optimize, and scale LLM deployments.We are seeking an experienced ML Ops engineer to work closely with our product… more
- Red Hat, Inc. (Boston, MA)
- …this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...and scale LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product… more
- Neara (Palo Alto, CA)
- …and maintain distributed systems that support high-throughput, low-latency AI model inference and data services. Partner with ML researchers and product ... Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's...ML engineers, and product teams to bring cutting-edge AI capabilities into productionat scale, with reliability, and under… more
- The Association of Technology, Management and Applied… (Morgan Hill, CA)
- …effectiveness in delivering technology in fast-paced, demanding, industry driven environment for AI / ML , and advanced analytics. Hands on experience in both ... Bank of America, at the forefront of innovation in AI . We are building the next generation of Gen...within the organization. Experience with deploying models using vLLM/Triton Inference Server Performance Tuning those models and deployment to… more
- Etched.ai, Inc. (San Jose, CA)
- A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This ... role requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should… more
- NVIDIA Corporation (Santa Clara, CA)
- …technology company is seeking a Senior Technical Marketing Engineer for AI Inference at Scale. This role involves developing impactful messaging around ... Applicants should have a strong background in product marketing, experience with AI / ML workloads, and exceptional communication skills. The position offers a… more
- Akamai Technologies GmbH (Cambridge, MA)
- … AI systems at massive scale Demonstrate deep expertise across multiple AI / ML disciplines, including inference frameworks, model optimization, platform ... Senior Principal Software Engineer - Akamai Inference...across the organization Serving as principal technical advisor on AI inference , providing expert guidance on complex… more
- Hamilton Barnes Associates Limited (San Francisco, CA)
- …systems. Requirements 5+ years' experience building large-scale, fault-tolerant distributed systems ( ML inference , HPC, or similar). Proficiency in Python, Go, ... most advanced AI workloads worldwide. They're now building a serverless inference platform, beginning with cost-efficient batch inference and expanding into… more
- OpenAI (San Francisco, CA)
- A leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal ... over 5 years of software engineering experience, strong familiarity with ML architectures, and experience with distributed systems. This role involves collaboration… more
- Capital One (San Francisco, CA)
- …financial services provider in San Francisco is seeking a Technical Specialist to develop AI and ML solutions. You'll need a strong foundation in engineering and ... 4 years of experience programming in Python and deploying AI on cloud platforms. The ability to optimize solutions...The ability to optimize solutions and a passion for AI research are essential. This role offers a competitive… more