VoiceConversation

VoiceConversation is the public class exported from @spekoai/client. Always construct it via the static create() factory — the factory awaits connection, so by the time it resolves the session is live.

import { VoiceConversation } from '@spekoai/client';

There is also a legacy namespace export, Conversation, with a single method Conversation.startSession(options) — it's an alias for VoiceConversation.create(options), kept so consumers migrating from other SDKs can use familiar naming.

`VoiceConversation.create(options)`

static create(options: CreateOptions): Promise<VoiceConversation>

Where CreateOptions is the short-lived token shape:

type CreateOptions = ConversationOptions;

Your backend calls POST /v1/sessions, optionally with an agentId, and forwards only transportToken and transportUrl to the browser. VoiceConversation.create() connects to the media transport, publishes the microphone track, sends any overrides over the data channel, fires onConnect, and resolves. It throws a SpekoClientError on connection, network, or microphone failure.

Do not send SPEKO_API_KEY to browser code. VoiceConversation no longer accepts agentId, apiKey, or apiBaseUrl; session minting belongs on your server.

`ConversationOptions`

Field	Type	Required	Description
`transportToken`	`string`	✅	Browser-safe media transport token, returned by your server.
`transportUrl`	`string`	✅	Media transport URL, returned by your server. Pass it straight through — the SDK does not default this so consumers can't ship against the wrong environment.
`overrides`	`ConversationOverrides?`		Per-session agent / TTS overrides. Sent over the data channel right after connect.
`inputDeviceId`	`string?`		Specific microphone `deviceId`.
`outputDeviceId`	`string?`		Specific speaker `deviceId`. Applied via `setSinkId`; silently ignored on browsers without support.
`audioConstraints`	`AudioConstraints?`		`echoCancellation`, `noiseSuppression`, `autoGainControl`. All default `true`.
`onConnect`	`(d: { conversationId }) => void`		Fired after the mic publishes and status becomes `connected`.
`onDisconnect`	`(d: DisconnectionDetails) => void`		Fired on server or client disconnect.
`onMessage`	`(m: ConversationMessage) => void`		Inbound transcripts, agent messages, user-message echoes.
`onStatusChange`	`(s: ConversationStatus) => void`		`connecting → connected → disconnecting → disconnected`.
`onModeChange`	`(m: ConversationMode) => void`		`listening` vs `speaking`, derived from transport active-speaker events.

conversationToken and livekitUrl are still accepted as legacy aliases for existing callers. | onError | (err: Error) => void | | Non-fatal errors (malformed data packets, media device errors, sink-id failures). |

`ConversationOverrides`

interface ConversationOverrides {
  agent?: {
    prompt?: string;
    firstMessage?: string;
    language?: string;
  };
  tts?: {
    voiceId?: string;
    speed?: number;
  };
}

Overrides are JSON-serialized and published over the data channel immediately after the mic is live. The agent worker can read them and reconfigure the session before its first reply.

`AudioConstraints`

interface AudioConstraints {
  echoCancellation?: boolean; // default true
  noiseSuppression?: boolean; // default true
  autoGainControl?: boolean; // default true
}

The SDK always routes through createLocalAudioTrack({ ... }) rather than setMicrophoneEnabled(true) so that constraints are applied even when no inputDeviceId is passed — setMicrophoneEnabled silently ignores them in that case.

Sets status to disconnected and fires onStatusChange.
Stops the local microphone track so the browser's mic indicator goes away.
Detaches and removes every <audio> element it added to document.body.
Fires onDisconnect with a mapped DisconnectionReason.

Your component's unmount effect should call endSession() so navigating away doesn't leak a live transport session.

`VoiceConversation.create(options)`

`ConversationOptions`

`ConversationOverrides`

`AudioConstraints`

Instance methods

`getId(): string`

`isOpen(): boolean`

`setMicMuted(muted: boolean): Promise<void>`

`setVolume(volume: number): void`

`sendUserMessage(text: string): void`

`sendContextualUpdate(text: string): void`

`endSession(): Promise<void>`

Teardown invariants

On this page