- Meta (Menlo Park, CA)
- …platforms, all the way to mass production and deployment. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production...Inference Accelerator (MTIA) program as a part of the AI /ML initiatives supporting large scale AI Training… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across stack: network… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
- Capital One (San Francisco, CA)
- …to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems . + Contribute to the technical vision and ... Street (61049), United States of America, San Francisco, California AI Engineer - Sr. Manager Level (IC)...latest AI research and AI systems , and judiciously apply novel techniques in production… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Partner Engineer to join Meta's AI Partner Engineering team, a highly technical team that works with strategic partners, machine ... evangelize Meta's AI design patterns and best practices. **Required Skills:** Partner Engineer , Generative AI Responsibilities: 1. Apply relevant AI and… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an AI - Engineer to join our Fundamental AI Research Organization; (FAIR).Through specialized and domain-specific industry ... We are hiring in multiple locations. **Required Skills:** Research Engineer , AI Specialist - FAIR Responsibilities: 1....products and experiences 2. Develop novel, accurate, and safe AI algorithms and advanced systems for large… more
- Amazon (San Francisco, CA)
- …team, you'll be instrumental in transforming cutting-edge research into high-performance production systems . You'll collaborate directly with scientists to ... Peter Chen to make breakthrough foundation models run at production scale. As a Senior Machine Learning Engineer...We tackle some of the most challenging problems in AI and robotics, from developing sophisticated perception systems… more
- Meta (Fremont, CA)
- …world class manufacturing and quality processes for the next generation of disruptive AI hardware products **Required Skills:** Product Quality Engineer , AI / ... a comprehensive product quality strategy and process for next-generation AI hardware systems , ensuring timely delivery and...action for failures that occur during server integration, rack production and in the Data Center 5. Early engagement… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are ... cycle of servers in production . **Required Skills:** Production Systems Engineer , Sustaining Responsibilities:...hardware at scale 9. Experience in deploying and productionizing AI /HPC systems and/or related components at scale… more
- Meta (Menlo Park, CA)
- …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Enabling reliable ... learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations,… more
- Google (San Bruno, CA)
- …and architecture. + 5 years of experience building and deploying recommendation systems models (retrieval, prediction, ranking, embedding) in production and ... on and is growing every day. As a software engineer , you will work on a specific project critical...engineers. + Lead the design and implementation of recommendation systems , optimize ML infrastructure, and guide the development of… more
- Meta (Menlo Park, CA)
- …network products that enable networking for AI training and Inference.A Network Production Engineer in this role would support leading Meta's server fleet ... our server network communication stack. **Required Skills:** Network Production Engineer Responsibilities: 1. Conceive, develop, and deploy systems and tools… more
- Meta (Menlo Park, CA)
- …the company.Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems Engineer , DevOps Engineer , Network ... in Production Engineering, and we are always learning. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services which handle fleet… more
- Meta (Menlo Park, CA)
- …experts. Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems Engineer ,, DevOps Engineer , Network ... **Summary:** Production Engineers at Meta are hybrid software/ systems...we are always learning.This position is full-time. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services… more
- Meta (Menlo Park, CA)
- … workloads that power new Meta products. **Required Skills:** Network Production Engineer , Network Infrastructure Responsibilities: 1. Conceiving, developing, ... products and experiences, and we are looking for seasoned Production Engineers who are interested in solving complex technical...and deploying systems and tools to keep the network running reliably… more
- Meta (Menlo Park, CA)
- …apply, click "Apply to Job" online on this web page. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services which handle fleet ... Meta Ads, infrastructure components that drive Meta's advances in AI , core services which are used by every team...working on the coolest stuff around, the code and systems you work on will be in production… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Hardware Systems Engineer to join our Release to Production (RTP) team working on new NPI hardware. Our servers and data ... technologies for AI at datacenter scale. Hardware Systems Engineer in RTP work closely with...go/no-go decisions and optimize these systems in production . **Required Skills:** Hardware Systems Engineer… more
- Meta (Fremont, CA)
- …has either developed robust hardware system test plans, or validation planning as a systems engineer supporting AI /ML, compute, storage & network hardware in ... **Summary:** The Systems Integration Engineer (SIE) is responsible...to develop test plans for operational integration of future AI /ML, compute, storage and networking products into the existing… more
- Meta (Menlo Park, CA)
- …This work is not theory or research, but the practical application of AI with aggressive production goals.We are building automation programs at global ... **Summary:** AI is the most dynamic and exciting technological...research, and market analysis into product requirements to improve engineer productivity and enhance user satisfaction. 10. Define and… more
- Meta (Menlo Park, CA)
- …software codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels Responsibilities: 1. ... to accelerate the next generation of deep learning models such as Recommendation systems , Generative AI , Computer vision, NLP etc. 5. Performance tuning and… more