Splitting audio with 1001fx for Whisper

marekw · December 4, 2024, 4:00pm

Hello ladies and gents,

I’m creating a service for long audio transcription. I want to use OpenAI Whisper which has a limit of 25MB per file. I’m splitting audio using 1001fx module (unfortunately only mp3, no m4a). It’s output (binary data) is used as OpenAI’s input.

Here’s an issue: OpenAI module returns an error:
[400] Unrecognized file format. Supported formats: [‘flac’, ‘m4a’, ‘mp3’, ‘mp4’, ‘mpeg’, ‘mpga’, ‘oga’, ‘ogg’, ‘wav’, ‘webm’]

JSON output from 1001fx:
[
{
“data”: “IMTBuffer(149, binary, e87d1455a4927388b58fc0863c24b30a576058fa): 7b22726573756c74223a7b2275726c223a2268747470733a2f2f70726f642d3130303166782d7075626c69632e622d63646e2e6e65742f74656d706173736574732f32303234313230352f30314a4539305451303151395142573546584a565433425642”,
“fileName”: “filename”,
“fileSize”: “1”,
“statusCode”: 200
}
]

Anyone had similar issues or an idea how to do it differently?

Topic		Replies	Views
Recording from OneDrive into Whisper Issue How To chatgpt	4	33	December 31, 2024
openAI Whisper Transcript API Call How To api , open-ai	5	969	April 17, 2024
Split Audio File to Transcribe with Whisper Features google-drive , chatgpt	5	1000	August 12, 2024
OpenAI max tokens How To chatgpt	6	2341	September 7, 2023
How to chunk down audio to 25mb max size for openai whisper transcription with make templates? How To open-ai	3	2712	April 22, 2024

Splitting audio with 1001fx for Whisper

Related topics