- Zyphra Technologies Inc. (Palo Alto, CA)
- …intelligence company based in Palo Alto, California. The Role: As a Research Engineer - Audio & Speech Models , you will be a core contributor on ... Zyphra's Audio Team, building the next generation of open-source text-to- speech and audio models. You will be deeply involved in the entire model training… more
- Zyphra Technologies Inc. (Palo Alto, CA)
- A cutting-edge AI firm in Palo Alto is seeking a Research Engineer specializing in audio and speech models. You will contribute to the development of ... advanced open-source audio technologies. Responsibilities include data processing, model training, and performance optimization. Ideal candidates possess strong … more
- David AI (New York, NY)
- …stakeholders to productionalize the latest research insights. About this role As a Founding Audio AI Research Engineer , you'll define the research ... build comprehensive evaluation frameworks for audio AI capabilities across speech , emotion, conversation dynamics, and acoustic patterns. Research and… more
- Liquid AI (San Francisco, CA)
- …like DeepSpeed, FSDP, or Megatron-LM You've worked with multimodal data (eg audio , text, image, video) You've contributed to research papers, open-source ... If: You have experience with machine learning at scale You have worked with audio models and understand the effects of architecture choices on runtime, latency, and… more
- Disney (San Francisco, CA)
- Sr Audio Digital Signal Processing Engineer The Skywalker Sound Development Group is seeking an innovative and experienced Audio Digital Signal Processing ... Engineer to develop and implement advanced audio processing solutions that integrate seamlessly into modern media...work will focus on enabling high-quality, efficient, and reliable audio workflows for applications such as speech … more
- Sesame (San Francisco, CA)
- … System Engineer to define, tune, evaluate, and ship high‑quality speech reproduction systems and spatialized audio capture solutions. Responsibilities Own ... transducer requirements to ensure product success. Design, analyze, and refine real‑time audio capture and render systems. Research and implement advanced… more
- Apple Inc. (Cupertino, CA)
- Sr. Machine Learning Engineer , Siri Speech Cupertino, California, United States Machine Learning and AI The Speech Team within the Siri organization drives ... research anchored in Apple‑specific production needs, while improving speech interaction experiences for Apple's customers around the world. Description You… more
- Trades Workforce Solutions (San Francisco, CA)
- …language switching, and accurate pronunciation across languages and accents. As a Staff Research Engineer , you'll work hands‑on to expand their foundation models ... that balance speed with control. What you'll do Conduct research to advance core speech models and...systems What you'll bring 3+ years of experience in speech synthesis, audio generation, or generative modeling… more
- Trades Workforce Solutions (San Francisco, CA)
- A speech AI technology firm in San Francisco seeks a Staff Research Engineer to advance core speech models and develop new architectures. The ideal ... candidate has over 3 years of experience in speech synthesis and audio generation, with a solid background in model training. This role offers up to $250K base… more
- David AI (San Francisco, CA)
- About David AI David AI is the first audio data research company. We bring an R&D approach to data-developing datasets with the same rigor AI labs bring to ... bring AI into the real world, and we believe audio is the gateway. Speech is versatile,...machine learning experts focused on building the world's first audio data research company. We move fast,… more
- David AI (San Francisco, CA)
- About David AI David AI is the first audio data research company. We bring an R&D approach to data-developing datasets with the same rigor AI labs bring to ... bring AI into the real world, and we believe audio is the gateway. Speech is versatile,...machine learning experts focused on building the world's first audio data research company. We move fast,… more
- Cartesia (San Francisco, CA)
- …even the best models can continuously process and reason over a year-long stream of audio , video and text-1B text tokens, 10B audio tokens and 1T video ... where you will solve critical data bottlenecks and directly accelerate our research progress. What you'll do Evaluate fidelity, diversity, and usefulness of… more
- Recruiting From Scratch (San Francisco, CA)
- …1 year of prior software development experience before transitioning into ML. Experience with speech or audio research , particularly in applying ML models to ... the best candidate for our clients. Applied Machine Learning Engineer - Audio AI Location: San Francisco,...and models for collecting and training on large-scale, high-fidelity speech and audio datasets across diverse languages… more
- Cartesia (San Francisco, CA)
- …even the best models can continuously process and reason over a year-long stream of audio , video and text-1B text tokens, 10B audio tokens and 1T video ... build large-scale multilingual datasets for model training. Build evaluations of multilingual speech models, both via manual annotation and at scale with automated… more
- Decagon AI, Inc. (San Francisco, CA)
- …conversations across our omnichannels (phone, web, mobile, etc). We work across speech understanding, audio streaming, synthesis quality, and the voice‑specific ... to end. About the Role As a Senior Software Engineer focused on Voice Agent, you will design and..., networking, or real time pipelines An interest in speech , audio , or multimodal AI Even better… more
- Recruiting From Scratch (San Francisco, CA)
- …the Voice Agent group-a technically challenging surface responsible for real-time speech understanding, audio streaming, synthesis quality, and voice-specific ... measure and iterate on voice performance. Partner closely with Research to integrate new speech models and...Experience with automatic speech recognition (ASR) or text-to- speech (TTS) systems. Familiarity with VAD, audio … more
- Sanas (Palo Alto, CA)
- …Deep experience with ML tooling for training and serving models, ideally in audio or speech domains (eg, PyTorch, ONNX, Hugging Face Transformers, torchaudio). ... deep industry experience, Sanas has developed the world's first real-time speech transformation platform capable of accent translation, noise elimination, speech… more
- Mursion, Inc. (San Francisco, CA)
- …evolving and fast-paced startup environment. About the role As a Senior AI Engineer for Real-Time Conversational Agents, you will play a crucial role in developing ... conversational agents. You'll work primarily with large language models (LLMs), automatic speech recognition (ASR), text-to- speech (TTS), and emerging speech… more
- OpenAI (San Francisco, CA)
- …multimodal models. Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs. Enable experimental research workflows to ... and scalable in production, and we partner closely with Research to bring the next generation of models into...the infrastructure needed to serve models that handle image, audio , and other non-text modalities. These workloads are inherently… more
- Sanas (Palo Alto, CA)
- …deep industry experience, Sanas has developed the world's first real-time speech transformation platform capable of accent translation, noise elimination, speech ... algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity. Pioneered by seasoned startup founders with a proven track… more