• Software Engineer , Systems ML…

    Meta (Menlo Park, CA)
    …and inference). 5. Conduct cutting-edge research on -ML compilers and ML distributed technologies. 6. Collaborate with users of PyTorch to enable new use ... the entire community. 3. Explore the intersection of the PyTorch compiler and PyTorch distributed ....in GPU performance and writing high-performance CUDA kernels. 14. Research and software engineer experience demonstrated via… more
    Meta (11/07/24)
    - Save Job - Related Jobs - Block Source
  • Research Scientist Intern, PyTorch

    Meta (Menlo Park, CA)
    PyTorch 2.0 technologies that bring torch.compile() to the heart of PyTorch -Advance PyTorch Distributed through torch.compile()-Improve PyTorch ... performance in general 6. Explore the intersection of PyTorch compiler and PyTorch distributed ...algorithms, MCTS, AI Agents, LLM performance, Reinforcement learning 16. Research or software engineer experience demonstrated via… more
    Meta (12/11/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager, PyTorch

    Meta (Menlo Park, CA)
    …is still a long and exciting technical roadmap going forward. We seek an Engineer Manager to drive subcomponents of the PyTorch Compiler development and holistic ... **Summary:** The PyTorch compiler team is the driving force behind...distributed training. 14. Knowledge of ML model architecture research landscape **Public Compensation:** $177,000/year to $251,000/year + bonus… more
    Meta (10/18/24)
    - Save Job - Related Jobs - Block Source
  • Principal Engineer , Distributed

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Principal Engineer to join our Distributed Machine Learning team focused on GPU accelerated Apache Spark. Data scientists often apply ... the model training, some libraries (eg, XGBoost, RAPIDS cuML, PyTorch , and TensorFlow) have been extended for distributed...(or equivalent experience). + 12+ years of work or research experience in software development. + 5+ experience as… more
    NVIDIA (11/27/24)
    - Save Job - Related Jobs - Block Source
  • Research Engineer - Conversational…

    Meta (Menlo Park, CA)
    **Summary:** Reality Labs is seeking a Research Engineer to join our Large Language Model (LLM) Research team for the device driven AI Assistant effort. We ... produce SOTA research and results. **Required Skills:** Research Engineer - Conversational AI - Reality...learning methods to best exploit modern parallel environments (eg distributed clusters, multicore SMP, and GPU). 5. Work with… more
    Meta (11/09/24)
    - Save Job - Related Jobs - Block Source
  • Research Engineer , Language…

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking a Research Engineer to join our Large Language Model (LLM) Research team. We conduct focused research and engineering to ... training and inference; and/or multilingual and multimodal modeling. **Required Skills:** Research Engineer , Language - Generative AI Responsibilities: 1. Design… more
    Meta (10/18/24)
    - Save Job - Related Jobs - Block Source
  • Research Engineer , Language - OCR

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking a Research Engineer to join our Optical Character Recognition (OCR) team. We conduct focused research and engineering to build ... a background in OCR, CV or NLP. **Required Skills:** Research Engineer , Language - OCR Responsibilities: 1....learning methods to best exploit modern parallel environments (eg distributed clusters, multicore SMP, and GPU). 5. Work with… more
    Meta (11/23/24)
    - Save Job - Related Jobs - Block Source
  • Senior Researcher Engineer - Fujitsu…

    Fujitsu (Santa Clara, CA)
    …innovation. Fujitsu Research of America (FRA) is inviting applications for a research engineer member position in AI Labs. The selected candidate will work ... on machine learning, human-computer interaction, or software engineering. We are seeking a Research Engineer to support AI Agent research and development… more
    Fujitsu (01/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer - AI…

    NVIDIA (Santa Clara, CA)
    …design and implementation of ground breaking GPU compute clusters that powers all AI research across NVIDIA. We seek an expert to build and operate these clusters at ... improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of how our systems… more
    NVIDIA (12/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer

    NVIDIA (Santa Clara, CA)
    …large-scale MLOps and AI infrastructure; + Proven experience designing and optimizing distributed training systems with frameworks like PyTorch , JAX, or ... NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for...large-scale foundation model training in the Generalist Embodied Agent Research (GEAR) group. Our team is leading Project GR00T… more
    NVIDIA (12/07/24)
    - Save Job - Related Jobs - Block Source
  • Research Engineer , SysML - FAIR

    Meta (Menlo Park, CA)
    …FlashAttention, training performance acceleration through hardware-software co-design. **Required Skills:** Research Engineer , SysML - FAIR Responsibilities: 1. ... **Summary:** Meta is seeking Research Engineers to join Fundamental AI Research...design principles. Some aspects of this role include enabling distributed training at an unprecedented scale through advancements and… more
    Meta (01/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC and AI Networking Performance…

    NVIDIA (Santa Clara, CA)
    …NVIDIA is seeking a Senior High Performance Computing (HPC) and AI Networking Performance Research and Analysis Engineer to join our Performance group. In this ... and analyze AI workloads on large GPUs and CPUs scale clusters for distributed Deep Learning LLM training focused on collectives communication and networking. You… more
    NVIDIA (12/07/24)
    - Save Job - Related Jobs - Block Source
  • Research Engineer , Computer Vision…

    Meta (Menlo Park, CA)
    **Summary:** We are looking to hire an experienced Research Engineer to help us build evaluation solutions for GenAI Media models. Join a team of research ... to map offline evaluation to online metrics. **Required Skills:** Research Engineer , Computer Vision - GenAI Evaluations...automated solutions 3. Work with a large and globally distributed team across multiple functions to understand the needs… more
    Meta (01/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a senior or principal engineer who specializes in large-scale reinforcement learning and policy learning in the Generalist Embodied Agent ... Research (GEAR) group. Our team is leading Project GR00T...Exceptional engineering skills in building, testing, and maintaining scalable distributed GPU training frameworks; + Proficiency in Python. Hands-on… more
    NVIDIA (12/07/24)
    - Save Job - Related Jobs - Block Source
  • Research Scientist / Engineer

    Bosch (Sunnyvale, CA)
    **Company Description** The Bosch Research and Technology Center North America with offices in Sunnyvale, California, Pittsburgh, Pennsylvania, and Cambridge, ... portfolio, and a history spanning over 125 years. The Research and Technology Center North America (RTC-NA) is dedicated...take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different… more
    Bosch (12/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - Automated…

    NVIDIA (Santa Clara, CA)
    The PyTorch Team @ NVIDIA is hiring passionate parallel programmers. Join us to design and build the tools used by millions of AI practitioners deploying AI ... of best in class experience on NVIDIA's hardware with PyTorch . Join our team and collaborate with many multi-disciplinary...Deep Learning models coming out of academic and industry research , for NVIDIA GPUs and systems. + Collaborating with… more
    NVIDIA (12/20/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI Platform Engineer

    NVIDIA (Santa Clara, CA)
    …on GPUs and specialized AI hardware. We are looking for an AI Platform Engineer with expertise in AI tools, frameworks, and hardware. The ideal candidate will have ... experience with AI workloads, particularly using PyTorch , C++, and be skilled in debugging and optimizing...efficiency and performance. + Collaborate with internal teams, including research , engineering, and data science, to identify areas for… more
    NVIDIA (11/16/24)
    - Save Job - Related Jobs - Block Source
  • Principal Engineer for AI Software…

    NVIDIA (Santa Clara, CA)
    We are now looking for a Principal Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are seeking a Principal Software ... Engineer to lead the development of AI software resiliency...successful integration of resiliency features into AI frameworks like PyTorch and JAX/XLA. + Customer Engagement: Collaborate directly with… more
    NVIDIA (11/23/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Machine Learning Engineer , Amazon…

    Amazon (Sunnyvale, CA)
    …learning initiatives. We leverage advanced hardware, innovative software architectures, and distributed computing techniques to enable breakthrough research and ... the company. We are seeking a Senior Machine Learning Engineer to join our team and lead the development...authority on technical issues by both the technical and research community, you are responsible for guiding difficult trade-off… more
    Amazon (12/23/24)
    - Save Job - Related Jobs - Block Source
  • (USA) Senior, Software Engineer - Gen AI

    Walmart (Sunnyvale, CA)
    …to redefine customer experiences. We are seeking a **Senior Machine Learning Engineer ** with deep expertise in **Generative AI** , **LLMs** , and scalable ... impactful solutions. **Role Summary:** We are seeking a **Senior Machine Learning Engineer ** to design and deploy AI systems using **LLMs** , **RAG frameworks**… more
    Walmart (12/19/24)
    - Save Job - Related Jobs - Block Source