Transcribe Audio + 10 more blocks now live! 🎉

Now live: Audio Transcription :speaker_high_volume::writing_hand:

The new “Transcribe Audio” block lets your agents pull transcripts from any audio URL — using GPT-4o, GPT-4o-mini, or Whisper.

Need users to upload their own files? Use the new “Upload Audio” user input and set Processing = Transcribe.

Plus, 10 new blocks for transforming text :repeat_button:

Everything you actually need to do with text — now built in.
No prompt required. Just drop the block in and go.

Here’s what’s new:

:memo: Summarize Text – Get the TL;DR

:brain: Improve Writing – Fix spelling, grammar, clarity

:globe_showing_europe_africa: Translate Text – Any language, instantly

:busts_in_silhouette: Rewrite for Audience – Adapt for different personas

:graduation_cap: Convert Reading Level – Make it simpler or more advanced

:shuffle_tracks_button: Merge Text – Combine multiple inputs into one

:speech_balloon: Adjust Tone – Casual, formal, fun — your call

:white_check_mark: Align to Brand – Stay on voice, every time

:pencil: Rewrite Text – Give it a rule, and it’ll rewrite at scale

:brick: Convert to Format – Repurpose content into a blog, email, post — whatever you need

All of this is available now in the menu block.

1 Like

@Kuane speech to text

1 Like

Looks like all 3 models cost the same for audio transcription. How does the pricing work? Is it by tokens generated in the transcribed transcript?

1 Like