- Microsoft Corporation (Redmond, WA)
- …particularly by integrating multiple machine-learned components such as computer vision , speech recognition, dialogue handling, natural language generation, ... fields, including computing, healthcare, economics, and the environment. The **Interactive Multimodal Futures (IMF)** group at Microsoft Research seeks a PhD-level… more
- Meta (Bellevue, WA)
- …Interns to join our Meta Superintelligence Lab in the post-training modeling team across language , and multimodal . We are committed to advancing the field of ... passionate in areas such as generative modeling, deep learning, computer vision , natural language processing, machine learning, and reinforcement learning.… more
- Meta (Menlo Park, CA)
- …frontiers of multimodal pretraining, and have experience in developing novel language modeling architectures, vision - language modeling, multimodal ... in the general domain of neural scaling laws, model architectures, image/text modeling, vision - language modeling 6. Must obtain work authorization in the country… more
- Google (San Jose, CA)
- … ML frameworks such as JAX, TensorFlow, or PyTorch. + Experience with multimodal learning, large language models or AI agents. + Experience with prompt ... on Large Language Models (LLMs) and agents, particularly in the multimodal domain (eg, audio), focusing on developing more capable Artificial Intelligence (AI)… more
- Microsoft Corporation (Redmond, WA)
- …to the market. A lot of these experiences will be powered by speech, computer vision , natural language processing (NLP), and large language models (LLM) - ... will have the unique opportunity to develop machine learning ( ML ) algorithms and deep neural networks (DNN) models for...in a PhD or master's program in audio, computer vision , NLP, machine learning or closely related fields. **Other… more
- Wayfair (Boston, MA)
- …for Wayfair's Customer Service team. In this role, you will apply large language models (LLMs) and LLMops expertise to transform complex customer service challenges ... comprehensive experience and deep expertise in Generative AI and Natural Language Processing, strong problem-solving skills, and the ability to work effectively… more
- General Motors (San Francisco, CA)
- …You'll Do:** + Drive the development of embodied foundation models and vision - language -action architectures that unify multimodal perception with robotic ... high-level reasoning and physical execution. Your work will focus on advancing vision - language -action architectures to solve critical challenges in data mining… more
- Takeda Pharmaceuticals (Boston, MA)
- …ML Foundation team, you will build large-scale models including large language models (LLMs), diffusion models, and multimodal architectures that integrate ... molecular datasets. + Fine-tune and adapt pre-trained foundation models (protein language models, chemical LLMs, vision transformers) for Takeda-specific… more
- LinkedIn (Mountain View, CA)
- …ads, or live content, we are leading the way in developing state-of-the-art large vision language technologies. Candidates must be currently enrolled in a PhD ... to join our team as Generative AI Engineering Interns. As a part of our AI/ ML teams, you will work on advancing the frontier of Generative AI, applying cutting-edge… more
- General Motors (San Francisco, CA)
- …of advanced machine learning methods, such as foundation models, vision - language architectures, diffusion models, image/video generation, self-supervised ... learning techniques, especially deep learning architectures (eg, transformers, generative models, multimodal learning). + Proficiencyin Python and ML frameworks… more
- Modernizing Medicine (Boca Raton, FL)
- …AI/ ML Researchers at various levels to join our team. We are seeking AI/ ML Researchers to drive innovation in large language models (LLMs), AI agents, and ... + Autonomous AI Agents and multi-agent systems + Generative AI (text, vision , speech, or multimodal ) + Reinforcement learning and decision-making systems… more
- General Motors (Mountain View, CA)
- …ML validation frameworks and tools for autonomous vehicle software systems. + Leverage vision - language models (VLMs) and large language models (LLMs) to ... robotics, computer vision through projects or research. + Familiarity with multimodal learning or working with sensor data. + Interest in contributing to… more
- Oracle (Charleston, WV)
- …mission-driven team within one of the world's largest cloud platforms. The Multimodal AI team in OCI Applied Science is working on developing cutting-edge ... to disrupt industry verticals and push the state-of-the-art in Multimodal and Video GenAI research. You will work with...cloud infrastructure. - Strong proficiency in at least one language such as Go, Java, Python, or C++. -… more
- Amazon (San Francisco, CA)
- …What sets us apart is our unique combination of ambitious research vision and practical impact. We leverage Amazon's massive computational infrastructure and rich ... spans the full spectrum of robotics intelligence - from multimodal perception using images, videos, and sensor data, to...years of programming with at least one software programming language experience 5+ years of leading design or architecture… more
- Amazon (San Francisco, CA)
- …Experience with dataset curation and quality assessment techniques Knowledge of computer vision and multimodal data processing - Background in research ... What sets us apart is our unique combination of ambitious research vision and practical impact. We leverage Amazon's massive computational infrastructure and rich… more
- Microsoft Corporation (Mountain View, CA)
- …Deeply understand the pipeline of collecting data, training, evaluating, and serving language models and multimodal models. + Have experience working ... highly motivated and detail-oriented Technical Program Managers to help bring our vision to life. We are seeking outstanding individuals excited about contributing… more
- Meta (Menlo Park, CA)
- …be applied to Meta's product development. **Required Skills:** AI Research Scientist, VLM ( vision language models) Responsibilities: 1. Push state of the art in ... that pushes forward the state of the art in multimodal reasoning and generation research.*Work towards long-term ambitious research...understanding for AI Assistants 3. Mentor and work with AI/ ML engineers to find a path from research to… more
- Meta (Redmond, WA)
- …welcome candidates with expertise in embodied AI, reinforcement learning, planning, multimodal learning, vision - language models, LLM interpretability, world ... various start dates throughout the year. **Required Skills:** Research Scientist Intern, Vision - Language and Embodied AI (PhD) Responsibilities: 1. Plan and… more
- Amazon (Seattle, WA)
- …AAAI, or similar - Experience with Visual Language Models (VLMs), multimodal transformers, or vision - language pretraining - Experience with explainable ... product identity and relationships at unprecedented scale. Using Generative AI, Visual Language Models (VLMs), and multimodal reasoning, we determine what makes… more
- Bloomberg (New York, NY)
- Senior Language Modeling Research Engineer - Artificial Intelligence Location New York Business Area Engineering and CTO Ref # 10036231 **Description & ... solutions using technologies such as transformers, gradient boosted decision trees, large language models, and dense vector databases. We are expanding our group and… more