- Microsoft Corporation (Mountain View, CA)
- …are excited about investigating and implementing cutting-edge large language model ( LLM ) inference techniques and optimizations like quantized KV-caches, ... Research Internships at Microsoft provide a dynamic environment...at Microsoft Azure and contribute to a production-focused, planetary-scale LLM serving stack that is being built on top… more
- Meta (Menlo Park, CA)
- …on GPU, CPU, accelerators-Vertical performance optimization for models for training and inference -Contribute to the research direction of AI for GPU programming, ... more about our research , visit https://ai.facebook.com. **Required Skills:** Research Scientist Intern , PyTorch Core (PhD) Responsibilities: 1. Contribute… more