- Meta (Menlo Park, CA)
- … training and inference services in the world! **Required Skills:** AI Capacity Engineering & Analysis Responsibilities: 1. Performance and Capacity ... and engineering to perform cutting-edge technologies investigation and cost analysis for AI 6. Identify AI capacity -related issues proactively and… more
- Meta (Menlo Park, CA)
- …of complex AI training and inference workloads across a set of AI platforms. In close collaboration with engineering and cross functional partners this ... all Meta's products and services. **Required Skills:** Technical Program Manager, AI Accelerator Software Responsibilities: 1. Collaborate with Engineering and… more
- Walmart (Sunnyvale, CA)
- …agentic AI systems** that can autonomously handle complex reliability engineering workflows, predictive failure analysis , and self-optimization across all ... centers. **What you'll do ** Join Walmart Global Tech's Site Reliability Engineering organization as a Distinguished AI /ML Engineer to architect revolutionary… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Performance & Capacity Engineer to join the Capacity Engineering & Analysis team to focus on site-wide performance and ... and cutting edge technology including some of the largest AI infrastructure builds in the world. Help build one...15. Practical experience and demonstrated success in performance or capacity engineering 16. Experience with planning and… more
- Meta (Menlo Park, CA)
- …covering a range of areas (Data Center, Network, Hardware Systems, Infrastructure Engineering , Software Engineering , Capacity Management) and across multiple ... **Summary:** This position will play a critical role in driving end-to-end AI product introductions and AI operations initiatives supporting Meta's growing AI… more
- Google (Sunnyvale, CA)
- …planning function for Google's third-party data center delivery. You will be scaling Google's AI and Cloud compute capacity to meet projected demands, which is ... Strategic Capacity Planning Lead, Third Party Data Centers _corporate_fare_...reports. **Preferred qualifications:** + MBA or Master's degree in Engineering , Finance, Economics, or a related quantitative field. +… more
- Cisco (Milpitas, CA)
- … Canvas and Assistants. You will operate at the intersection of AI /ML engineering , platform architecture, and developer experience design, collaborating closely ... , systems architecture, or platform development. * 3+ years of experience in software engineering , with recent exposure to AI assistants. * Expertise in building… more
- Microsoft Corporation (Mountain View, CA)
- …models to develop novel responsible and efficient artificial general intelligence, large compute- capacity is required, and as an AI Networking Engineer, you'll ... **Overview** Microsoft AI is hiring a Member of Technical Staff,...network modeling, and scaling strategy + Telemetry, observability, reliability engineering , and automated troubleshooting + Develop and tune the… more
- Meta (Menlo Park, CA)
- …projects and handle ambiguity. **Required Skills:** Product Marketing Manager, Business AI Responsibilities: 1. Lead market assessment, quantitative analysis , ... models have over 1.2B downloads to date, and Meta AI now has more than 500M monthly actives. Meta...requirements of the market internally with Product Management and Engineering 2. Understand advertiser needs/requirements and product risk areas… more
- Meta (Fremont, CA)
- …cost and technology perspective, with millions of servers, gigawatts of data center capacity , and cutting edge technology including advanced AI .If you have ... across Infra - platform teams, server hardware engineering , network engineering , data center engineering , operations, capacity planning, networking… more
- NVIDIA (Santa Clara, CA)
- …experience, including 7+ years leading and developing TPM teams in infrastructure, AI /ML, or platform engineering domains. + Demonstrated success in implementing ... The DGX Cloud organization builds and operates the AI infrastructure that makes this innovation possible. We...analysis . + Bachelor or Master in Computer Science, Engineering , or related field (or equivalent experience). Ways to… more
- General Motors (Sunnyvale, CA)
- …tools for monitoring capacity and demand signals + Partner with AI /ML engineering , infrastructure engineering , and cloud vendors to proactively ... looking for a Senior Capacity and Performance Engineer to join the AV Capacity and Performance Engineering team to support our critical efforts in developing… more
- NVIDIA (Santa Clara, CA)
- …graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that ... Make the choice to join us today! As a member of the GPU AI /HPC Infrastructure team, you will provide leadership in the design and implementation of ground… more
- Amazon (Cupertino, CA)
- Description Do you want to shape the future of Generative AI at AWS? Join the team building the foundation of the world's most advanced cloud for AI training and ... deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing the limits… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for an outstanding, passionate, and talented Senior AI Infrastructure Engineer to join our DGX Cloud group. This engineering role will design, ... efficiency and availability using the combination of software and systems engineering practices. This role demands knowledge across different systems, networking,… more
- NVIDIA (Santa Clara, CA)
- We are seeking a AI Infrastructure Engineer to integrate third-party infrastructure partners into NVIDIA's operational excellence programs. This cross-functional ... as well as managing vendor relationships. You will partner with engineering , SRE, product, and third-party infrastructure providers to achieve operational… more
- Intuit (Mountain View, CA)
- **Overview** **Job Overview** **Intuit is seeking an innovative and hands-on** **Senior AI Scientist** **to join the** **Ops Science team** **. This team is at the ... scheduling challenges. You will work to predict demand, optimize workforce capacity , and enable real-time decision-making that directly impacts hundreds of thousands… more
- Palo Alto Networks (Santa Clara, CA)
- …and stakeholder management skills. + Experience introducing innovative approaches (eg, ML/ AI -driven performance analysis , chaos engineering ). **Why Join ... challenges with a talented global team. + Drive innovation in performance engineering , automation, and AI -driven insights. + Collaborate with leaders across… more
- Google (Mountain View, CA)
- …Intelligence. + Experience in experimental design (eg, A/B, multivariate) and incremental analysis . **About the job** Site Reliability Engineering (SRE) combines ... Site Reliability Manager, Site Reliability Engineering _corporate_fare_ Google _place_ Mountain View, CA, USA...SRE's will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses… more
- ThermoFisher Scientific (Santa Clara, CA)
- …and quality standards into the design. **Process improvement:** + Apply data analysis , Lean and Six Sigma methodologies, and deep-dive investigation to identify and ... parties. **Cross-functional collaboration:** + Collaborate with various departments, including engineering , manufacturing, product development, and supply chain, to ensure… more