• Production Systems Engineer

    Meta (Menlo Park, CA)
    production issue triage, rolling out new features in FW/Driver. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Develop ... **Summary:** Meta is seeking an experienced Production Systems Engineer to...life cycle for computer system products 15. Experience supporting AI /HPC systems , GPU or Silicon hardware, and/or… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems (NetZero) Responsibilities: 1. ... **Summary:** Meta is seeking a Production Systems Engineer to...full system technologies, full system lifecycle 16. Experience supporting AI /HPC systems and/or related components at scale.… more
    Meta (07/25/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Interface ... **Summary:** Meta is seeking a Production Systems Engineer to...life cycle for computer system products 17. Experience supporting AI /HPC systems and/or related components at scale… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Hardware Systems Engineer , NPI…

    Meta (Menlo Park, CA)
    …validation, supporting customer deployment, production issue triage. **Required Skills:** Hardware Systems Engineer , NPI AI Responsibilities: 1. Lead and ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • AI /HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
    Meta (08/01/24)
    - Save Job - Related Jobs - Block Source
  • Engineer , Silicon Systems AI

    Micron Technology, Inc. (San Jose, CA)
    …or related field and 24 months of experience in the job offered or in an Engineer , Silicon Systems AI - related occupation. Position requires 24 months of ... System Design. 2. C/C++/Python in a Linux development environment. 3. AI model deployment in production environment. 4. AI algorithms to modify silicon… more
    Micron Technology, Inc. (06/24/24)
    - Save Job - Related Jobs - Block Source
  • Principal Applied AI Engineer

    Microsoft Corporation (San Francisco, CA)
    …of the Azure Data - Customer Advisory Team (CAT) and we're seeking a Principal Applied AI Engineer to lead some of our efforts. Our mission is to empower our ... to build the data platform for the age of AI , powering a new class of data-first applications and...shipping. + 3+ years experience developing and deploying live production systems , as part of a product… more
    Microsoft Corporation (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer , Security…

    Cisco (San Jose, CA)
    Who We Are The Cisco Security AI team delivers AI products and platform for all Cisco Secure products and portfolios so businesses around the world can defend ... customers secure by simplifying security with zero compromise using AI and Machine Learning. Who You Are You are...Learning. Who You Are You are a passionate software engineer with a proven track record of successfully developing… more
    Cisco (07/17/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - AI

    Meta (Menlo Park, CA)
    …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Enabling reliable ... learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations,… more
    Meta (09/04/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are ... cycle of servers in production . **Required Skills:** Production Systems Engineer , Sustaining Responsibilities:...hardware at scale 9. Experience in deploying and productionizing AI /HPC systems and/or related components at scale… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …upcoming and existing platforms in the AI space. **Required Skills:** Production Systems Engineer , Tooling Responsibilities: 1. Enhance existing tooling ... **Summary:** Meta is seeking an experienced Production Systems Engineer to...Go). 6. 2+ years of experience in working with AI /HPC systems , including hardware and software components,… more
    Meta (07/24/24)
    - Save Job - Related Jobs - Block Source
  • Tech Lead - AI /ML Infrastructure…

    Cisco (San Jose, CA)
    …to build the next generation infrastructure to meet the needs of AI /ML workloads and continuously increasing internet users and application. We are uniquely ... by harnessing the potential of open-source technologies while pushing the boundaries on Systems and Silicon Architecture. Who You'll Work With: You will be working… more
    Cisco (08/27/24)
    - Save Job - Related Jobs - Block Source
  • Software Dev Engineer II, Stores…

    Amazon (Palo Alto, CA)
    …teams, applied scientists, and product managers to define requirements for production systems , collaborate on science experimentation, and creating plan ... Description Stores Foundational AI (SFAI) is the team powering the Amazon...(design patterns, reliability and scaling) of new and existing systems experience - Experience programming with at least one… more
    Amazon (09/06/24)
    - Save Job - Related Jobs - Block Source
  • Customer Engineer II, AI /ML,…

    Google (San Francisco, CA)
    …Experience in architecting and developing software or infrastructure for scalable, distributed systems . + Experience in data and information management as it relates ... teams as a subject matter expert in Artificial Intelligence and Machine Learning ( AI /ML) to differentiate Google Cloud to our customers. You will help prospective… more
    Google (08/15/24)
    - Save Job - Related Jobs - Block Source
  • Production Engineer

    Meta (Fremont, CA)
    …the company.Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems Engineer , DevOps Engineer , Network ... in Production Engineering, and we are always learning. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services which handle fleet… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Engineer

    Meta (Fremont, CA)
    …experts. Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems Engineer ,, DevOps Engineer , Network ... **Summary:** Production Engineers at Meta are hybrid software/ systems...we are always learning.This position is full-time. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Network Production Engineer

    Meta (Menlo Park, CA)
    … workloads that power new Meta products. **Required Skills:** Network Production Engineer , Network Infrastructure Responsibilities: 1. Conceiving, developing, ... products and experiences, and we are looking for seasoned Production Engineers who are interested in solving complex technical...and deploying systems and tools to keep the network running reliably… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Engineer

    Meta (Menlo Park, CA)
    …apply, click "Apply to Job" online on this web page. **Required Skills:** Production Engineer Responsibilities: 1. Develop, design, create, modify, and/or test ... defined procedures and practices to determine appropriate action. 7. Support of AI infrastructure systems (GPUs, AI models), and knowledge of how to… more
    Meta (08/18/24)
    - Save Job - Related Jobs - Block Source
  • Hardware Systems Engineer , RAS

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking a Hardware Systems Engineer to join our Release to Production (RTP) team working on new NPI hardware. Our servers and data ... technologies for AI at datacenter scale. Hardware Systems Engineer in RTP work closely with...go/no-go decisions and optimize these systems in production . **Required Skills:** Hardware Systems Engineer more
    Meta (08/21/24)
    - Save Job - Related Jobs - Block Source