• Production Systems Engineer

    Meta (Menlo Park, CA)
    production issue triage, rolling out new features in FW/Driver. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems (NetZero) Responsibilities: 1. ... **Summary:** Meta is seeking a Production Systems Engineer to...full system technologies, full system lifecycle 16. Experience supporting AI /HPC systems and/or related components at scale.… more
    Meta (07/25/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Interface ... **Summary:** Meta is seeking a Production Systems Engineer to...life cycle for computer system products 17. Experience supporting AI /HPC systems and/or related components at scale… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Hardware Systems Engineer , NPI…

    Meta (Menlo Park, CA)
    …validation, supporting customer deployment, production issue triage. **Required Skills:** Hardware Systems Engineer , NPI AI Responsibilities: 1. Lead and ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • AI /HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
    Meta (08/01/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - AI

    Meta (Menlo Park, CA)
    …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Enabling reliable ... learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations,… more
    Meta (09/04/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are ... cycle of servers in production . **Required Skills:** Production Systems Engineer , Sustaining Responsibilities:...hardware at scale 9. Experience in deploying and productionizing AI /HPC systems and/or related components at scale… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Engineer

    Meta (Menlo Park, CA)
    …the company.Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems Engineer , DevOps Engineer , Network ... in Production Engineering, and we are always learning. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services which handle fleet… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Engineer

    Meta (Menlo Park, CA)
    …experts. Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems Engineer ,, DevOps Engineer , Network ... **Summary:** Production Engineers at Meta are hybrid software/ systems...we are always learning.This position is full-time. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Network Production Engineer

    Meta (Menlo Park, CA)
    … workloads that power new Meta products. **Required Skills:** Network Production Engineer , Network Infrastructure Responsibilities: 1. Conceiving, developing, ... products and experiences, and we are looking for seasoned Production Engineers who are interested in solving complex technical...and deploying systems and tools to keep the network running reliably… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Hardware Systems Engineer , RAS

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking a Hardware Systems Engineer to join our Release to Production (RTP) team working on new NPI hardware. Our servers and data ... technologies for AI at datacenter scale. Hardware Systems Engineer in RTP work closely with...go/no-go decisions and optimize these systems in production . **Required Skills:** Hardware Systems Engineer more
    Meta (08/21/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , Systems ML…

    Meta (Burlingame, CA)
    …software codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels Responsibilities: 1. ... to accelerate the next generation of deep learning models such as Recommendation systems , Generative AI , Computer vision, NLP etc. 5. Performance tuning and… more
    Meta (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Applied AI Research Scientist…

    Meta (Menlo Park, CA)
    …real-world problems. We also own Meta's central reinforcement learning platform, ReAgent (reagent. ai ), which is also open-sourced as the production ready ... help us with a vision to bring reinforcement learning to large scale real-world systems . Our research scientists have an opportunity to work with top researchers in… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior Product Manager - AI Assisted…

    Intuit (Mountain View, CA)
    …and GraphQL. We are on a journey to enable hyper-scale on both our systems and our development processes and are making fundamental technological shifts that will ... solve big customer needs, specifically in large scale distributed systems and platforms + Excellent understanding of SDLC and...+ Product development and technical background, either as an engineer or in a previous technical role with prior… more
    Intuit (08/28/24)
    - Save Job - Related Jobs - Block Source
  • Hardware Systems Engineer , NPI

    Meta (Menlo Park, CA)
    … data centers, and track the health and lifecycle of servers in production . **Required Skills:** Hardware Systems Engineer , NPI Responsibilities: 1. ... **Summary:** Meta is seeking an engineer to join our Release to Production...ARM), storage (SAS, NVMe), device management (OOBM, I2C, I3C), AI -ML hardware (GPUs & TPUs), data center network hardware… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , Systems ML - HPC

    Meta (Burlingame, CA)
    **Summary:** Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI ... We are hiring in multiple locations. **Required Skills:** Software Engineer , Systems ML - HPC Responsibilities: 1....Systems ML - HPC Responsibilities: 1. Apply relevant AI and machine learning techniques to build & optimize… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior Data Engineer

    Plato Systems (San Francisco, CA)
    Plato Systems is pioneering the use of spatial AI to boost capacity, productivity, and safety across physical operations, starting with manufacturing. ... 2019, Plato has developed an Operations Digital Twin and AI copilot fueled by our proprietary hardware that uses...find out more about us by visiting our website (https://www.plato. systems /). We are seeking a Senior Data Engineer more
    Plato Systems (08/02/24)
    - Save Job - Related Jobs - Block Source
  • Laser Systems Control Engineer

    Lightmatter (Mountain View, CA)
    …photonics-accelerated computers, consider joining the team at Lightmatter! We are hiring a Laser Systems Control Engineer to join our team. In this role, you ... will be responsible for bringup, configuration, and stabilization of multiwavelength laser systems for datacenter AI communications. You will collaborate closely… more
    Lightmatter (08/30/24)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , Systems

    LinkedIn (Mountain View, CA)
    …delivery platform,compute platform, massively scalable data storage and replication systems , cutting-edge search platform, best-in-class AI platform, ... to use your passion for distributed technologies and algorithms, API design and systems -design, and your passion for writing code that performs at an extreme scale.… more
    LinkedIn (06/26/24)
    - Save Job - Related Jobs - Block Source
  • MLOps Engineer

    Stanford Health Care (Palo Alto, CA)
    …operational services. This group is tasked to innovate, build, deploy and monitor production grade AI , machine learning (ML) and predictive algorithms into ... + Design, build and maintain scalable and robust infrastructure for AI /ML systems , including cloud-based environments, containerization and orchestration… more
    Stanford Health Care (08/08/24)
    - Save Job - Related Jobs - Block Source