What are you trying to achieve?
I want to send an image containing text to a Telegram bot and have ChatGPT analyze it and return the extracted text as a plain text message.
Steps taken so far
I want to send an image containing text to a Telegram bot and have ChatGPT analyze it and return the extracted text as a plain text message.
When I provide the image as a URL in the ChatGPT - Analyze Images (Vision) module, ChatGPT processes it correctly and extracts the text. However, when I change the input method to downloading the image from the Telegram bot, I start facing issues.
ChatGPT either says it cannot process the request (which means it receives something but refuses to extract the text) or it outputs completely unrelated content, as if it is making up information instead of recognizing the actual text from the image.
I suspect that ChatGPT is receiving a lower-resolution version of the image, making the text unreadable, which causes it either to fail or to hallucinate a response (but this is just my guess—I don’t know how to verify it).
How can I solve this issue?
In the first two screenshots, I provided an example where I simply use the image URL in ChatGPT, and everything works correctly.
The other two screenshots show the case where I take the image from the Telegram bot (download it from there), and in this case, there is an issue with the output result.