Transcribe Audio + 10 more blocks now live! 🎉

dannielle · June 4, 2025, 9:11pm

Now live: Audio Transcription →

The new “Transcribe Audio” block lets your agents pull transcripts from any audio URL — using GPT-4o, GPT-4o-mini, or Whisper.

Need users to upload their own files? Use the new “Upload Audio” user input and set Processing = Transcribe.

Plus, 10 new blocks for transforming text

Everything you actually need to do with text — now built in.
No prompt required. Just drop the block in and go.

Here’s what’s new:

Summarize Text – Get the TL;DR

Improve Writing – Fix spelling, grammar, clarity

Translate Text – Any language, instantly

Rewrite for Audience – Adapt for different personas

Convert Reading Level – Make it simpler or more advanced

Merge Text – Combine multiple inputs into one

Adjust Tone – Casual, formal, fun — your call

Align to Brand – Stay on voice, every time

Rewrite Text – Give it a rule, and it’ll rewrite at scale

Convert to Format – Repurpose content into a blog, email, post — whatever you need

All of this is available now in the menu block.

jerry-mindstudio · June 4, 2025, 11:18pm

@Kuane speech to text

Kuane · June 5, 2025, 12:22am

Looks like all 3 models cost the same for audio transcription. How does the pricing work? Is it by tokens generated in the transcribed transcript?

Topic		Replies	Views
Add ElevenLabs tagging of speakers to Speech to Text Block Feature Requests	2	72	August 12, 2025
Please add Whisper Large V3 Model Feature Requests	0	15	October 23, 2025
Speech to Text Block Feature Requests	6	102	August 8, 2025
Add Full ElevenLabs API Support for Voice Selection via API Key in External Integrations Feature Requests	5	130	August 11, 2025
Generate Podcast Agents	1	114	April 8, 2025