- Broadcom (Austin, TX)
- …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
- Meta (Lansing, MI)
- …strategy that delivers a highly flexible platform to train & serve new DL/ ML model architectures, combined with auto-tuned high performance for production ... be working on one of the core areas such as PyTorch framework components, AI compiler and runtime , high-performance kernels and tooling to accelerate machine… more
- Lowe's (Charlotte, NC)
- …sophisticated deep learning (DL) frameworks and large language models (LLMs). The Sr AI Engineer will work on scaling model performance, building essential tools ... Optimizes data flows to support the demands of high-velocity AI - CV model training and inference....pruning, quantization, and optimized inference with TensorRT or ONNX Runtime . Nvidia deep learning stacks and model … more
- Cisco (San Diego, CA)
- …you will: Own the full lifecycle of research and deployment of next-generation AI systems for intelligent code generation, including model design, evaluation, ... but reason, plan, self-correct, and integrate with tools and runtime environments. Advance the state of the art in... health monitoring. Research Leadership - Publications in premier AI / ML venues (NeurIPS, ICML, ICLR, ACL, AAAI,… more
- Broadcom (Austin, TX)
- …control plane that automates the lifecycle of AI Services: Model Gallery, Model Runtime and ML API Gateway, Data Indexing and Retrieval, and Agent ... Kubernetes Platform Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...key to building a best in class private cloud AI platform. You will have a high impact by… more
- Palo Alto Networks (Santa Clara, CA)
- …address. In response, Prisma AIRS delivers model security, posture management, AI red teaming, and runtime protection. Our customers can confidently deploy ... office full time, with flexibility when it's needed. This model supports real-time problem-solving, stronger relationships, and the kind...for the architectural design and long-term strategy of our AI platform - ML inference. Beyond individual… more
- Oracle (San Jose, CA)
- …At Oracle Cloud Infrastructure (OCI), we are building the next generation of AI applications that transform how enterprises get work done. One of our flagship ... generate insights, and automate workflows through natural language. As part of the AI Applications team, you will design and build agentic systems that use LLMs… more
- Palo Alto Networks (Santa Clara, CA)
- …platform for AI security, encompassing model security, posture management, AI red teaming, and runtime protection. Job Summary As the Senior Director ... threat detection, network security, cloud security, application security). Direct experience with AI / ML platforms, MLOps, or building security solutions for … more
- Meta (Nashville, TN)
- …and ML concepts at a core level. (data collection, model training, deployment, runtime performance) Preferred Qualifications: Preferred Qualifications Proven ... implement new capabilities in partnership with cross functional teams Implement runtime logic, tooling within a complex real-time environment with performance… more
- Texas Instruments (Dallas, TX)
- …and implementing ML and AI methods for improving model accuracy and runtime . Implementing advanced optimization techniques. Working with process ... and process integration teams on wafer verification to validate model quality. Working with OPC verification engineers to improve...fabs within TI. Developing and maintaining a suite of model quality checks for unit testing. These include but… more
- Broadcom (Palo Alto, CA)
- …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
- Amazon (Cupertino, CA)
- …an experienced Technical Product Manager to define and drive product strategy for Neuron Runtime and ML Infrastructure integration. You will be part of the AWS ... ML performance in the cloud. You will lead runtime and infrastructure requirements working backward from customer needs,...to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the… more
- pony.ai (Fremont, CA)
- …evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to ... went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony. ai provides a...compute platform architecture design and software infrastructure. + Apply model optimization and efficient deep learning techniques to models… more
- Palo Alto Networks (Santa Clara, CA)
- …collaborate with internal teams to tackle challenging security problems. As a senior AI / ML -powered cloud security product manager, you will be instrumental in ... office full time, with flexibility when it's needed. This model supports real-time problem-solving, stronger relationships, and the kind...abreast of the latest trends in cloud security and AI / ML + Define product requirements and manage… more
- Amazon (Seattle, WA)
- …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * would with ... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that...of the most interesting and impactful infrastructure challenges in AI / ML today. Basic Qualifications - 5+ years… more
- Amazon (Cupertino, CA)
- …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design, ... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that...of the most interesting and impactful infrastructure challenges in AI / ML today. Basic Qualifications - Bachelor's degree… more
- HP Inc. (Vancouver, WA)
- …+ Engage deeply with technical leaders to understand HP's capabilities across AI / ML , compute systems, runtime optimization, model compression, device SW, ... hands-on experience in AI / ML , software engineering, data systems, runtime optimization, model compression, or adjacent technical domains + Ability to… more
- Amazon (Cupertino, CA)
- …use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible for ... and performance tuning of a wide variety of LLM model families, including massive scale large language models like...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
- Google (Sunnyvale, CA)
- Software Engineer III, AI / ML , TorchTPU _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and mentoring ... ML field. + 1 year of experience with ML infrastructure (eg, model deployment, model...Experience in compiler design and development, or related language runtime environments. **Preferred qualifications:** + Master's degree or PhD… more
- Google (Sunnyvale, CA)
- Senior Software Engineer, AI / ML , TPU E2E _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and mentoring ... ML field. + 3 years of experience with ML infrastructure (eg, model deployment, model...push technology forward. Our team builds the compiler and runtime which enables TPUs, Google's in-house custom designed processor,… more