- HP Inc. (Palo Alto, CA)
- …+ Engage deeply with technical leaders to understand HP's capabilities across AI / ML , compute systems, runtime optimization, model compression, device SW, ... hands-on experience in AI / ML , software engineering, data systems, runtime optimization, model compression, or adjacent technical domains + Ability to… more
- Broadcom (Palo Alto, CA)
- …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
- pony.ai (Fremont, CA)
- …evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to ... went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony. ai provides a...compute platform architecture design and software infrastructure. + Apply model optimization and efficient deep learning techniques to models… more
- Palo Alto Networks (Santa Clara, CA)
- …collaborate with internal teams to tackle challenging security problems. As a senior AI / ML -powered cloud security product manager, you will be instrumental in ... office full time, with flexibility when it's needed. This model supports real-time problem-solving, stronger relationships, and the kind...abreast of the latest trends in cloud security and AI / ML + Define product requirements and manage… more
- Google (Sunnyvale, CA)
- Software Engineer III, AI / ML , TorchTPU _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and mentoring ... ML field. + 1 year of experience with ML infrastructure (eg, model deployment, model...Experience in compiler design and development, or related language runtime environments. **Preferred qualifications:** + Master's degree or PhD… more
- Google (Sunnyvale, CA)
- Senior Software Engineer, AI / ML , TPU E2E _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and mentoring ... ML field. + 3 years of experience with ML infrastructure (eg, model deployment, model...push technology forward. Our team builds the compiler and runtime which enables TPUs, Google's in-house custom designed processor,… more
- Meta (Menlo Park, CA)
- …strategy that delivers a highly flexible platform to train & serve new DL/ ML model architectures, combined with auto-tuned high performance for production ... be working on one of the core areas such as PyTorch framework components, AI compiler and runtime , high-performance kernels and tooling to accelerate machine… more
- Microsoft Corporation (Mountain View, CA)
- …Grok. + Experience either shipping applied research to production with coding and AI model development skills, OR working with LLM deployment, orchestration ... II and Senior Applied Scientist (Multiple Positions) - PowerPoint ML Team, Office Product Group** Are you an applied...an applied scientist with a passion for applying state-of-the-art AI innovations to augment human creativity? Join us as… more
- Meta (Sunnyvale, CA)
- …machine learning accelerators 2. Work with algorithm research teams to map ML graphs to hardware implementations, model data-flows, create cost-benefit analysis ... for custom hardware accelerator blocks. **Required Skills:** Software Engineer, Systems ML - Compilers Responsibilities: 1. Analyze and design effective compiler… more
- Meta (Menlo Park, CA)
- …via concurrent design and optimization of many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network ... sustained scaling and hardware efficiency during training and inference. 3. Benchmark, analyze, model and project the performance of AI workloads against a wide… more
- Palo Alto Networks (Santa Clara, CA)
- …comprehensive platform for AI security, encompassing model security, posture management, AI red teaming, and runtime protection. As the Director AI ... to derive new insights, identify emerging threat trends, and build novel AI / ML -driven detection methodologies. + Thought Leadership: Serve as a primary… more
- NVIDIA (Santa Clara, CA)
- …several software ecosystem projects with external partners, technical background in AI / ML systems, fundamentals of computer systems architectures (ISAs) ... researchers and developers. Focus will be on accelerating GenAI model training and inference, which is making a major...and the peer NVIDIA team(s) + Proven understanding of AI / ML software ecosystem and GPU acceleration libraries… more
- Broadcom (Palo Alto, CA)
- …control plane that automates the lifecycle of AI Services: Model Gallery, Model Runtime and ML API Gateway, Data Indexing and Retrieval, and Agent ... Kubernetes Platform Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...key to building a best in class private cloud AI platform. You will have a high impact by… more
- Google (Mountain View, CA)
- …experience with software design and architecture. + 3 years of experience with ML infrastructure (eg, model deployment, model evaluation, optimization, data ... to crafting a GenAI capability by fully harnessing the potential of Large Language Model (LLMs) and other foundational AI models running directly on mobile… more
- Palo Alto Networks (Santa Clara, CA)
- …research on cutting-edge areas, including AI -Native Security (LLM, AI Agent, Model Supply-Chain, Runtime AI ) and the broader LLM ecosystem security. ... response,Prisma AIRS delivers model security, posture management, AI red teaming, and runtime protection. Our...architectural design of a highly scalable, low-latency, and resilient ML inference platform capable of serving a diverse range… more
- Palo Alto Networks (Santa Clara, CA)
- …address. In response, Prisma AIRS delivers model security, posture management, AI red teaming, and runtime protection. Our customers can confidently deploy ... office full time, with flexibility when it's needed. This model supports real-time problem-solving, stronger relationships, and the kind...for the architectural design and long-term strategy of our AI platform - ML inference. Beyond individual… more
- MongoDB (Palo Alto, CA)
- …routing, and model health monitoring + Collaborate with peers across ML , infra, and product teams to define architectural patterns and operational practices that ... model serving architecture using tools like vLLM, ONNX Runtime , and container orchestration in Kubernetes + Provide technical...world's most popular developer data platform + Collaborate with ML experts from Voyage. ai to bring cutting-edge… more
- Google (Sunnyvale, CA)
- …and subsystems. + Collaborate with partners in Hardware Design, Software, Compiler, ML Model and Research teams for hardware/software co-design. + Work ... this role, you'll work to shape the future of AI / ML hardware acceleration. You will have an...extensions and memory layouts, by leveraging existing compiler and runtime stacks, and develop transaction-level models for early performance… more
- Palo Alto Networks (Santa Clara, CA)
- …platform for AI security, encompassing model security, posture management, AI red teaming, and runtime protection. As the Director of Engineering, you ... for AI security, encompassing model security, posture management, AI red teaming, and runtime protection. **Compensation Disclosure** The compensation… more
- Palo Alto Networks (Santa Clara, CA)
- …Prisma AIRS leads the industry in advanced AI Security Capabilities, including Runtime Security, Model Scanning and Red Teaming. As a Site Reliability ... directly in an engineering team, enabling deep collaboration with Software Engineers, AI / ML Researchers, Architects, and Product Managers. You will have a… more