If you want to build voice agents without having a mental breakdown over audio transcription internals, Deepgram makes it ridiculously approachable. I used their stack to get real-time STT + TTS working fast, and the free credit is enough to actually prototype and iterate for a while.
You'll be taken to Dpgr to complete your purchase.