📄 Market Snapshot: Spoken NLP Roles in 2026

The convergence of speech and natural language processing has created one of the hottest specializations in 2026. As LLMs move from text-only to native audio processing (like GPT-4o and Gemini), companies are desperate for engineers who can bridge the gap between raw audio and semantic meaning—handling everything from intent extraction to conversational AI.

Audio Algorithm Engineer (Diarization)

Plaud AI

💰 $150K – $240K + EQUITY 📍 SAN FRANCISCO / HYBRID ⚙️ SPEECH LLM

The Role

Building note-taking infrastructure for 1.5M+ users. Focus on multi-language hotword algorithms and fine-tuning SpeechLLM for diarization.

Requirements

Experience fine-tuning SpeechLLM (StepAudio, Qwen3omni)
Processing 100k+ hours of speech data
Publication record (Interspeech/ICASSP)

Direct Access

Vetted profiles are sent directly to Plaud’s R&D lead.

Submit Profile to Network Plaud Careers

Staff ML Engineer - Entity Resolution

Apple (Knowledge & Information)

💰 STAFF LEVEL (T7) 📍 CARY, NC / CUPERTINO ⚙️ Siri Personalization

The Role

Leading software projects suffusing knowledge of the user throughout Apple’s products. Building entity resolution APIs that Siri application developers love.

Requirements

8+ years professional software experience
Expertise in NLU, ASR, or TTS
Rust, C++, or Swift proficiency

Direct Access

We help senior engineers navigate Apple's technical interview pipeline.

Submit Profile to Network Apple Careers

AI Engineer (Natural Language Processing)

El Segundo, CA

💰 $140,000 - $180,000 📍 EL SEGUNDO, CA / AI ENGINEER - EL SEGUNDO, CA / THIS IS A FULL-TIME ONSITE OPPORTUNITY BASED OUT OF EL SEGUNDO, CA ⚙️ PYTORCH / REAL-TIME

The Role

AI Engineer (Natural Language Processing) El Segundo, CA Our mission at Skyryse is nothing less than a new era in flight, where fatalities are nearly zero, piloting any aircraft is simple and safe, and the sky is accessible to all. SkyOS, our industry-first universal operating system for flight, provides any airplane or helicopter with a fly-by-wire flight management solution t…

              AI Engineer (Natural Language Processing)
El Segundo, CA
Our mission at Skyryse is nothing less than a new era in flight, where fatalities are nearly zero, piloting any aircraft is simple and safe, and the sky is accessible to all. SkyOS, our industry-first universal operating system for flight, provides any airplane or helicopter with a fly-by-wire flight management solution that significantly reduces the complexity of flying. Skyryse One, our first production aircraft powered by SkyOS, was unveiled in 2024 and features the simplest, safest, and most intuitive flight controls in general aviation.

AI ENGINEER - EL SEGUNDO, CA
As an AI Engineer on our Advanced Technology Team, you will take part in designing, developing and integrating our SkyOS AI assistant. You will be given projects and tasks to develop new features, enhance throughput, and optimize architecture for this flight enhancing software.
This is a full-time onsite opportunity based out of El Segundo, CA.

RESPONSIBILITIES:

Conduct cutting-edge generative AI and NLP research in a fast-paced, collaborative environment
Contribute to the design of AI-driven features that enhance user experience and product functionality for aerospace applications
Design new agent architectures to improve end-to-end performance of AI workflows
Develop high fidelity, application specific AI applications that can be locally hosted
Design, run, and evaluate experiments to improve LLMs on various NLP benchmarks
Execute data engineering tasks to curate data for LLM pre-training, fine-tuning, RAG
Help develop roadmaps that utilize generative AI and knowledge retrieval to understand and integrate new research in the product roadmap
Understand and apply responsible AI practices, including considerations of bias, fairness, and data privacy
MINIMUM QUALIFICATIONS:

Bachelor’s or Master’s degree in an engineering discipline (Computer Science, Software Engineering, or similar)
3+ years of professional experience in AI and voice-to-text algorithms.
Proficiency in Natural Language Processing
Proficiency in Digital Signal Processing
Proficiency in ML/Deep Learning Frameworks and frameworks like PyTorch or TensorFlow
PREFERRED QUALIFICATIONS:

PhD in Computer Science (focus in AI, ML, or similar)
Expertise in speech recognition models
Real-time and edge deployment experience
Piloting experience a bonus
WHY SKYRYSE?

The opportunity to change the world through improving aviation safety and accessibility
Salary: $140,000 - $180,000
Valuable stock option plan
Heavily subsidized medical, dental and vision plans
Full-time employees are eligible for 20 days of paid time off (PTO) and 5 sick days annually.  PTO and sick days must be used in accordance with Company policy.
A company with an ambitious vision, a dynamic work environment, and a team of smart, motivated, and fun to work-with colleagues!
The posted salary range reflects the potential base pay for this role, adjusted to account for varying geographic markets. Final compensation will be based on factors such as your location, job-related skills, experience, and internal alignment, including equity and benefits.

WE WELCOME ALL
Skyryse is an equal opportunity employer. We value diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

NO AGENCY CANDIDATES WILL BE CONSIDERED.
            

Requirements

PyTorch experience
Experience with latency-constrained systems

Direct Access

We help senior candidates navigate El Segundo, CA’s interview pipeline for conversational AI roles.

Submit Profile to Network Company Careers

MACHINE LEARNING ENGINEER

Facility Stabile Research Building (SRB)

💰 STAFF LEVEL 📍 LOCATION TAMPA, FL ⚙️ PYTORCH / DIARIZATION / MULTIMODAL

The Role

Machine Learning Engineer Location Tampa, FL Facility Stabile Research Building (SRB) Department Machine Learning-Staff Schedule - Shift - Hours Full Time - Day Shift - Mon-Fri Req #: 93528 Share: Working at Moffitt is both a career and a mission: to contribute to the prevention and cure of cancer. As the only National Cancer Institute-designated Comprehensive Cancer Center bas…

              Machine Learning Engineer
Location Tampa, FL
Moffitt Cancer Center
Department Machine Learning-Staff
Full-Time

Summary

Join the Rasool Lab within Moffitt Cancer Center & Research Institute to build trustworthy, clinically grounded AI that improves cancer research and care. The Machine Learning Engineer will develop end-to-end machine learning systems, especially domain-adapted speech and language pipelines, to support oncology workflows and research studies. Projects include robust ASR in noisy clinical settings, speaker diarization and long-form transcription, privacy-aware modeling (including federated learning), and NLP/LLM-based information extraction from unstructured clinical text, which is often in multimodal settings spanning radiology, pathology, molecular data, EHRs, and cancer registry data.

Position Highlights:

Opportunity to work in a world-class research environment focused on applied machine learning and AI for healthcare and biomedical domains, with an emphasis on trustworthy multimodal AI.
Strong collaboration with interdisciplinary teams (researchers, clinicians, and data scientists), leveraging state-of-the-art computational infrastructure, and opportunities to contribute to impactful publications and open research software.
Exposure to cutting-edge themes, including multimodal learning, federated learning, and LLM-enabled radiology/pathology informatics.
The Ideal Candidate:

Expertise in Automated Speech Recognition (ASR): Proven experience building, adapting, or optimizing ASR systems in domain-specific, noisy, or medical/clinical environments, with comfort with evaluation and error analysis for real-world audio.
Strong ML & Deep Learning background: Hands-on experience with modern ML frameworks (e.g., PyTorch/TensorFlow) and practical experience with Transformer models, LLMs, or generative AI in applied settings.
Solid foundation in NLP: Skilled in text representation, sequence modeling, and applying NLP to unstructured, domain-specific datasets (clinical notes, reports, transcripts).
Federated / privacy-preserving learning exposure: Understanding of distributed ML systems and privacy-aware approaches for sensitive or regulated data.
Healthcare/medical data familiarity: Experience with speech, text, EHR-derived data, and/or multimodal biomedical datasets, comfort working with clinical stakeholders and study constraints.
Software engineering rigor (highly valued): Writes maintainable, tested, well-documented code, with experience in version control, reproducible pipelines, and collaborative development.
Responsibilities:

Support principal investigators through the development of machine learning and deep learning software for cancer research, from prototyping to robust, reusable pipelines.
Collaborate with investigators and multi-institutional consortia on research software planning and implementation, including privacy-aware and cross-site workflows when needed.
Design, develop, and maintain machine and deep learning software for oncology research applications, with a focus on ASR/NLP systems and multimodal AI projects.
Build evaluation and monitoring approaches (data quality checks, performance metrics, failure-mode analysis) to keep models reliable in clinical-like conditions.
Credentials and Qualifications:

Minimum of a Bachelor’s degree in Computer Science, Engineering, Data Science, or a related field and at least two (2) years of software development/machine learning engineering experience.
A Master’s degree in Electrical Engineering, Computer Science, or a closely related field is preferred.
Demonstrated experience implementing ML systems in Python (and/or similar languages) with strong software engineering practices (testing, documentation, reproducibility).
Interested applicants should attach a cover letter summarizing their research training and accomplishments, along with a current CV listing recent publications, to Dr. Rasool at ghulam.rasool@moffitt.org.

Moffitt Cancer Center is an NCI-designated comprehensive Cancer Center in Tampa, Florida. Tampa is an ideal choice for those seeking opportunities in a rapidly growing metropolitan city. With its vibrant economy and diverse culture, Tampa Bay offers a unique blend of urban sophistication and natural beauty. Join the many who have already discovered what makes Tampa one of the fastest-growing metropolitan cities in the US.

Moffitt Team Members:

Eligible for an annual Team Member Incentive
Eligible for an annual Team Member Merit Increase
Have an Inclusive Benefits package - Health, Financial, & Lifestyle coverage


If you're driven to build rigorous, trustworthy AI that advances cancer research, we welcome your expertise. Bring your machine learning skills, engineering discipline, and passion for impactful innovation—there’s a place for you in our mission.
            

Requirements

PyTorch experience
Strong engineering fundamentals

Direct Access

We help senior candidates navigate Facility Stabile Research Building’s interview pipeline for conversational AI roles.

Submit Profile to Network Company Careers

Hiring Demand

Very High

Avg Salary

$180K-$250K

Growth Rate

+45% YoY

Current Market Pulse

Hiring Demand

Very High. The explosion of multimodal AI has created unprecedented demand for engineers who understand both speech recognition and natural language understanding. Voice assistants are evolving from simple command-response systems to full conversational agents that need to understand context, intent, emotion, and nuance from spoken input.

Major hiring sectors include:

Voice AI platforms: Building next-gen assistants (post-Alexa/Siri era)
Contact centers: Intent extraction, sentiment analysis, conversation summarization
Healthcare: Clinical documentation from doctor-patient conversations
Automotive: Natural in-car conversation systems
Enterprise: Meeting intelligence, action item extraction, search from audio

Top Skills

Experience with Spoken Language Understanding (SLU), end-to-end audio-to-text-to-intent models, and NLP frameworks like Hugging Face or LangChain is essential. Specific skills in demand:

Audio-to-intent pipelines: Building systems that go directly from speech to semantic understanding without intermediate text
Conversational AI: Multi-turn dialogue management, context tracking, co-reference resolution
Joint speech-text models: Working with architectures like Speech-LLaMa, Whisper + GPT, multimodal transformers
Slot filling and entity extraction: From spoken queries (not text)
Emotion and sentiment detection: Understanding *how* something is said, not just what
Disfluency handling: Dealing with "um," "uh," false starts, corrections in natural speech
Spoken question answering: QA systems that operate on audio inputs

Compensation

Mid-to-senior roles are seeing explosive growth, often exceeding $180K–$250K total compensation at remote-first startups and FAANG companies. The scarcity of engineers who truly understand both domains (speech + NLP) commands a significant premium.

Salary breakdown:

Entry (0-2 years): $130K-$170K - Usually requires strong NLP background + basic ASR knowledge
Mid (3-5 years): $170K-$215K - Production experience with conversational AI systems
Senior (6+ years): $200K-$280K+ - Architectural leadership, published work, multimodal expertise

Why This Niche is Exploding

The 2024-2026 shift from text-based LLMs to multimodal AI has fundamentally changed the landscape. Companies that built text-only NLP systems are now racing to add native audio understanding. This creates massive demand for "bridge" engineers who can:

Integrate ASR systems with LLMs
Build end-to-end audio understanding without transcription bottlenecks
Handle real-world speech phenomena that text-trained models struggle with
Design conversation systems that feel natural, not robotic

Key Companies Hiring

Voice AI Platforms: OpenAI (ChatGPT voice), Anthropic (Claude voice), Google (Gemini), Meta
Conversation Intelligence: Gong, Chorus.ai, Fireflies, Otter.ai
Enterprise AI: Microsoft (Teams intelligence), Zoom, Cisco
Healthcare: Nuance, Suki.ai, Notable Health
Customer Service: Replicant, PolyAI, Observe.AI

Recommended Tools for Spoken NLP Engineers

Note: Some of the links below are affiliate links. We may earn a small commission if you make a purchase through these links at no additional cost to you.

Hugging Face Audio Course

Free comprehensive course covering speech + NLP integration - essential for this field

Start Free

Speech and Language Processing (Jurafsky & Martin)

The definitive textbook - free online version covers both speech and NLP fundamentals

Read Free

Blue Yeti USB Microphone

Professional audio quality for testing voice systems - under $100

View on Amazon

📄 Market Snapshot: Spoken NLP Roles in 2026

Audio Algorithm Engineer (Diarization)

The Role

Requirements

Direct Access

Staff ML Engineer - Entity Resolution

The Role

Requirements

Direct Access

AI Engineer (Natural Language Processing)

The Role

Requirements

Direct Access

MACHINE LEARNING ENGINEER

The Role

Requirements

Direct Access

Current Market Pulse

Hiring Demand

Top Skills

Compensation

Why This Niche is Exploding

Key Companies Hiring

Browse Other Specialties

Recommended Tools for Spoken NLP Engineers

Hugging Face Audio Course

Speech and Language Processing (Jurafsky & Martin)

Blue Yeti USB Microphone

Get Notified of Premium Openings