Overcast Maker Marco Arment Talks Supporting Transcripts, Accessibility, More In New Interview
“Oh shit.”
That’s what Marco Arment said to me this week over email when asked about his reaction to Apple’s announcement in March of last year it was going to add transcripts to Podcasts. Arment’s own podcast app, Overcast, was updated this week to add transcripts after the feature was announced last month by Arment in the app’s Reddit.
Why was Arment filled with existential dread over Apple’s embrace of transcripts?
“I knew it would be a huge competitive advantage for them that I wouldn’t be able to match for a long time, if ever, due to the immense cost and complexity of generating transcripts for millions of podcasts at that time,” he said.
Arment’s quip about transcripts’ cost and complexity is well taken. Indeed, a big reason why most podcasts, particularly the popular shows, don’t have transcripts is because they’re expensive and time-consuming to produce. This is the main reason why Apple’s announcement was so newsworthy; they have the engineering resources and financial wherewithal to do transcripts and do them well. As to the industry writ large, Arment explained most big shows use dynamically-inserted ads to each download, meaning one’s transcript mechanics could produce a different result than what the user sees on their device(s). For Overcast’s purposes, Arment told me he spent “months” developing a so-called “audio-fingerprinting algorithm” that “[detected] any audio insertions or removals and synchronize the transcript to match.”
Arment’s algorithm “still isn’t perfect” to this day, he added.
Besides the algorithm, Arment explained he deployed a whopping 48 Mac minis—“46 more than I’d ever had,” he said—as a means for scaling transcript generation “for most podcasts.” Arment had never used a datacenter before, and described the whole experience “a huge undertaking” but acknowledged it was “a lot of fun too.” You can hear him talk much more, and in much nerdier detail, about adding transcripts in Overcast in Episode 683 of the Accidental Tech Podcast, released on March 19.
Really, though, the biggest technological breakthrough circles back to Apple.
“The breakthrough that finally made [transcripts] feasible at my scale was the new Apple speech-recognition API in the 26-series OSes last year, which produced transcripts extremely quickly on consumer-level hardware with decent enough quality to be usable as a navigation aid,” Arment said of Apple’s gift to developers like him.
The API Arment refers to is the SpeechAnalyzer framework new to iOS 26, et al.
As to Overcast and accessibility writ large, Arment said transcripts that are computer-generated actually have errors rates “far higher” than human-created transcripts. (As a corollary, this is also why captioning heavyweights like VITAC use human captioners as a buffer for computers doing the transcription.) Because of this, Arment believes reading transcripts ism’t a viable replacement for listening to the audio—at least not yet, anyway. He hopes someday they “can be,” but for now, computer-generated transcripts is a “small step in [the] direction” towards making podcasts accessible to those who can’t hear them. (I can say anecdotally, there are many people in the Deaf and hard-of-hearing community who won’t listen to podcasts if there’s no transcript.)
“While these transcripts today aren’t ready to be complete audio replacements, they do serve as an excellent augmentation to the audio,” Arment said. “This can help people better understand what was spoken, review anything they may have missed, navigate between topics more easily, and—soon—search and share more effectively.”
At their core, transcripts are about accessibility and making podcasts more inclusive.
“Good accessibility isn’t just the right thing to do, but it’s also good business, and it makes the app better for everyone,” Arment said of his years-long embrace of supporting accessibility features in Overcast. “Accessibility means making an app that can be used and loved by more people, more effectively and easily, in more circumstances. I can’t imagine why any app developer wouldn’t want that.”
For his part, he conceded there remains “a lot” of UI polish and other work to iterate on in Overcast now that the foundation is laid for transcripts. The groundwork portends what Arment called “amazing potential improvements to navigation, search, and sharing that I’m very excited to build.” Likewise, he hopes Apple’s API will be improved so as to see “further increases in quality in the new OSes this summer and beyond.”
When asked about feedback during the beta period, Arment called it “amazing.”
“I most loved watching the comments roll in as people started navigating their podcasts by scrolling and tapping the transcript instead of the seek buttons,” he said.
Overcast’s 2026.4 update with transcripts is now available in the App Store.