- Microsoft Corporation (Mountain View, CA)
- …are excited about investigating and implementing cutting-edge large language model ( LLM ) inference techniques and optimizations like quantized KV-caches, ... Research Internships at Microsoft provide a dynamic environment...at Microsoft Azure and contribute to a production-focused, planetary-scale LLM serving stack that is being built on top… more
- Meta (Menlo Park, CA)
- … research , visit https://ai.facebook.com. **Required Skills:** Research Scientist Intern , Systems ML - SW/HW Co-Design - Inference Responsibilities: 1. ... optimized and suited for the hardware infrastructure.Meta is seeking Research Scientist Interns to join our AI & Systems...across HW types (GPUs, ASICs), workload types (Recommendation Models, LLM , LDM) and workloads (Training & Inference ).… more
- IBM (Yorktown Heights, NY)
- …system schools. The candidate will be responsible for conducting cutting-edge research on natural language processing, developing prototype solutions to real-world ... with distributed programming skills, stays on top of AI research and will be responsible for wide spectrum of...top AI conferences + Experience with distributed training and inference of large language models About Business UnitResearch is… more
- IBM (Yorktown Heights, NY)
- …tasks. We are looking for interns with skills and tasks of interest include: + [ LLM for code generation] Research for effective use of foundational models for ... for improving tasks such as data discovery and automated text-to-sql + [FM Inference ] Research for improving foundation models inference in terms of both… more
- Meta (Sunnyvale, CA)
- …and we have various start dates throughout the year. **Required Skills:** Research Scientist Intern , On-Device AI Model Optimization (PhD) Responsibilities: 1. ... on CNN, LLM and multimodal models 3. Collaborate with other research scientists and software engineers to develop innovative deep learning techniques, such as… more
- Microsoft Corporation (Baltimore, MD)
- …+ Grounding and attribution within BizChat responses. + Large language model ( LLM ) inference cost and latency reduction. + Model-assisted data collection ... Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized… more