Class: LLM::OpenAI::Audio

Inherits:

Object

Object
LLM::OpenAI::Audio

show all

Defined in:: lib/llm/providers/openai/audio.rb

Overview

The LLM::OpenAI::Audio class provides an audio object for interacting with OpenAI’s audio API.

Examples:

llm = LLM.openai(key: ENV["KEY"])
res = llm.audio.create_speech(input: "A dog on a rocket to the moon")
IO.copy_stream res.audio, "rocket.mp3"

Instance Method Summary collapse

#initialize(provider) ⇒ LLM::OpenAI::Responses constructor
Returns a new Audio object.
#create_speech(input:, voice: "alloy", model: "gpt-4o-mini-tts", response_format: "mp3", **params) ⇒ LLM::Response
Create an audio track.
#create_transcription(file:, model: "whisper-1", **params) ⇒ LLM::Response
Create an audio transcription.
#create_translation(file:, model: "whisper-1", **params) ⇒ LLM::Response
Create an audio translation (in English).

Constructor Details

#initialize(provider) ⇒ `LLM::OpenAI::Responses`

Returns a new Audio object

Parameters:

provider (LLM::Provider)



16
17
18

# File 'lib/llm/providers/openai/audio.rb', line 16

def initialize(provider)
  @provider = provider
end

Instance Method Details

#create_speech(input:, voice: "alloy", model: "gpt-4o-mini-tts", response_format: "mp3", **params) ⇒ `LLM::Response`

Create an audio track

Examples:

llm = LLM.openai(key: ENV["KEY"])
res = llm.images.create_speech(input: "A dog on a rocket to the moon")
File.binwrite("rocket.mp3", res.audio.string)

Parameters:

input (String) —
The text input
voice (String) (defaults to: "alloy") —
The voice to use
model (String) (defaults to: "gpt-4o-mini-tts") —
The model to use
response_format (String) (defaults to: "mp3") —
The response format
params (Hash) —
Other parameters (see OpenAI docs)

Returns:

(LLM::Response)

#create_transcription(file:, model: "whisper-1", **params) ⇒ `LLM::Response`

Create an audio transcription

Examples:

llm = LLM.openai(key: ENV["KEY"])
res = llm.audio.create_transcription(file: "/audio/rocket.mp3")
res.text # => "A dog on a rocket to the moon"

Parameters:

file (String, LLM::File) —
The input audio
model (String) (defaults to: "whisper-1") —
The model to use
params (Hash) —
Other parameters (see OpenAI docs)

Returns:

(LLM::Response)

#create_translation(file:, model: "whisper-1", **params) ⇒ `LLM::Response`

Create an audio translation (in English)

Examples:

# Arabic => English
llm = LLM.openai(key: ENV["KEY"])
res = llm.audio.create_translation(file: "/audio/bismillah.mp3")
res.text # => "In the name of Allah, the Beneficent, the Merciful."

Parameters:

file (LLM::File) —
The input audio
model (String) (defaults to: "whisper-1") —
The model to use
params (Hash) —
Other parameters (see OpenAI docs)

Returns:

(LLM::Response)

Class: LLM::OpenAI::Audio

Overview

Examples:

Instance Method Summary collapse

Constructor Details

#initialize(provider) ⇒ LLM::OpenAI::Responses

Instance Method Details

#create_speech(input:, voice: "alloy", model: "gpt-4o-mini-tts", response_format: "mp3", **params) ⇒ LLM::Response

Examples:

#create_transcription(file:, model: "whisper-1", **params) ⇒ LLM::Response

Examples:

#create_translation(file:, model: "whisper-1", **params) ⇒ LLM::Response

Examples:

#initialize(provider) ⇒ `LLM::OpenAI::Responses`

#create_speech(input:, voice: "alloy", model: "gpt-4o-mini-tts", response_format: "mp3", **params) ⇒ `LLM::Response`

#create_transcription(file:, model: "whisper-1", **params) ⇒ `LLM::Response`

#create_translation(file:, model: "whisper-1", **params) ⇒ `LLM::Response`