- NVIDIA (Santa Clara, CA)
- …how you can make a lasting impact on the world. We are seeking a highly-skilled Senior On- Device Model Inference Optimization Engineer to join our team ... you'll be doing: + Develop and implement strategies to optimize AI model inference for on- device deployment. + Employ techniques like pruning, quantization,… more
- Qualcomm (San Diego, CA)
- …+ Experience in LLM reasoning or inference acceleration research + Experience of on- device AI model production or exposure to mobile / edge device ... or methods. **Responsibilities** + Research and engineering on efficiency of LLM inference and decoding + Generation of creative solutions with consideration of… more
- LinkedIn (Mountain View, CA)
- …large models together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with hundreds of billions of parameters ... and Serving performance optimizations across billions of user queries Model Training Infrastructure: As an engineer on the AI...infra on top of native cloud, enable GPU based inference for a large variety of use cases, cuda… more
- Qualcomm (San Diego, CA)
- …of models on servers and porting to client Windows compute platforms involving model inference deployment and performance tuning * Proficiency in programming ... and operating system components available on Snapdragon. As a senior member of the team responsible for the AI...plug in their proprietary models to first test their model on Qualcomm hardware (such as Windows on Snapdragon)… more
- NVIDIA (Santa Clara, CA)
- …platform upon which every new AI-powered application is built. We are seeking a senior engineer to design and build factory automation for NVIDIA Inference ... NVIDIA optimizes and serves performant inferencing for every AI model . Our NIM offerings are easy to use, highly...working with system software and platform layers including Kernel, device driver, memory, storage, networking and PCIe devices +… more
- Qualcomm (Santa Clara, CA)
- …strategy both compute and data lake architecture requirements. + Defining scalable AI model optimization strategy for mission mode inference . + Managing the AI ... of how Wi-Fi platforms are architected, provisioned, and used. As a Senior AI Platform Product Manager for Qualcomm's Wireless Infrastructure Networking (WIN)… more