Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

client.audio.transcriptions.create receive different results depending on OS #1705

Open
1 task done
Marincor opened this issue Sep 12, 2024 · 2 comments
Open
1 task done
Labels
bug Something isn't working

Comments

@Marincor
Copy link

Confirm this is an issue with the Python library and not an underlying OpenAI API

  • This is an issue with the Python library

Describe the bug

I'm making a transcription of audio using the transcriptions create method, but even running with the same audio, the same configuration, and the same Python version, I have different results depending on the OS, and to be more precise, when I am running into a windows OS the result is correct, but when running into Linux or Mac, the results is wrong. Some examples of what happened with the transcriptions:

Windows:
ok, I will do that, thank you for your support!

Linux and Mac
ok, I will do that, do that, do that, do that, do that, do that, do that

Information about the OS that runs the tests:
Windows: Windows 11 Home Single Language (version: 23H2) (running inside wsl2 with ubuntu 22.04 distro) (running local)
MAC: macOS (version: 14.4.1) (running local)
Linux: Amazon Linux 2 - (running inside ECS)

Please, let me know if need more information than that.

To Reproduce

  1. open local audio in format .ogg as binary ("rb"):
   with open('audio.ogg', 'rb') as opened_audio:
  1. file the params with the following params:
           transcription_params = {
            'model': 'whisper-1',
            'file': opened_audio,
            'temperature': 0.0,
            'language': 'pt',
            'response_format': 'verbose_json'
        }
  1. run as below in different's OS a couple of audios and compare the results:
  from openai import OpenAI

   with open('audio.ogg', 'rb') as opened_audio:
        transcription_params = {
            'model': 'whisper-1',
            'file': opened_audio,
            'temperature': 0.0,
            'language': 'pt',
            'response_format': 'verbose_json'
        }

        OpenAI().audio.transcriptions.create(**transcription_params)

Code snippets

from openai import OpenAI

   with open('audio.ogg', 'rb') as opened_audio:
        transcription_params = {
            'model': 'whisper-1',
            'file': opened_audio,
            'temperature': 0.0,
            'language': 'pt',
            'response_format': 'verbose_json'
        }

        OpenAI().audio.transcriptions.create(**transcription_params)

OS

MACOS

Python version

3.12.5

Library version

1.42.0

@Marincor Marincor added the bug Something isn't working label Sep 12, 2024
@RobertCraigie
Copy link
Collaborator

Thanks for reporting, we'll investigate. I'm curious if you tried making the same request without using the SDK?

@Marincor
Copy link
Author

@RobertCraigie

Thanks for reporting, we'll investigate. I'm curious if you tried making the same request without using the SDK?

Hello, Robert, thanks for your return... I tested it now via the endpoint via request instead of the SDK and the behavior is the same (tried both in python and postman).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants