how to send images along with my text prompt?

Wouldn’t it be better to use an analyze image block (vision model)?