Speech AI Engineer
About VORTO
Vorto is on a mission to increase sustainability and create more jobs by making supply chains more efficient across the entire value chain. Through powerful AI technology, Vorto's autonomous supply chain platform seeks to reduce carbon emissions caused by supply chain transportation, improve the lives of approximately 3.5 million truck drivers and create more jobs across all players in B2B transactions. We operate in a very fast-paced and nimble environment that is highly focused on a team-first, accomplishment-oriented culture that is passionate about the organization's success. Our products have been developed by a world-class engineering team that simplifies complex business problems to a degree where adoption is effortless. We encourage you to visit our careers page and read this blog post to learn more about our culture.
What You'll Do
Own the Speech interface layer: real-time speech-to-intent pipeline using models like Whisper or Deepgram
Build conversational UX flows for tasks like:
Load search and booking
Navigation and status updates
Voice journaling and micro-coaching
Optimize for real-world conditions: noisy cabs, unreliable signal, driver interruptions
Design fallback logic and graceful failure handling when voice commands are ambiguous or partial
Develop personalization features to learn and adapt to each driver’s preferences, tone, and goals over time
Personalization Responsibilities
You’ll lead the design of systems that help the assistant become a
relationship-level tool, not just a command engine:
Build a driver memory graph to store goals, language patterns, habits, and personality traits
Develop logic for driver-specific intent recognition and natural phrasing (e.g., “get me home this weekend”)
Implement adaptive prompting based on driver profile tags (e.g., preferred tone, motivational style)
Enable the system to reference past conversations, track personal milestones, and make coaching feel deeply contextual and human
Use retrieval-augmented generation (RAG) or custom embeddings to let the assistant "remember" and evolve
Technologies You Might Use
Speech: Whisper, Deepgram, Google Speech-to-Text
NLU / LLMs: GPT-4o, Rasa, LangChain, custom intent parsers
Frontend: React Native (for fallback UI), WebRTC, Twilio Voice
Infra: Node.js, Python, Firebase, Supabase, Postgres, Pinecone
You Might Be a Fit If You
Have built or contributed to voice or AI-driven products with personalization or memory features
Understand the difference between "generic commands" and relationship-based coaching
Know how to ship fast, but keep latency and UX top of mind
Are excited to build tools that respect and elevate working-class users
Are obsessed with human-first AI design
Benefits
At VORTO we are committed to developing our employees and providing them exciting opportunities to grow and prosper in their careers. We encourage you to visit our careers page and read this blog post to learn more about our culture.
We offer a competitive benefits package as well as numerous additional perks including:
Competitive compensation package
Health, Dental and Vision Insurance
401k with matching
Company paid life and short-term disability insurance
Company paid parking or transit pass
Relocation offered when applicable
Modern office space in downtown Denver
Daily coffee, tea, drinks & snacks
Team happy hours
VORTO is an Equal Opportunity Employer.
This position will be posted until a qualified candidate is hired.
Disclaimer: This job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee. Other duties, responsibilities and activities may change or be assigned.
Apply to this Job