No description

Find a file

Lilith 40f10fd225 Some checks failed Build and Publish / build-and-publish (push) Failing after 1m5s Details chore: bump version to 0.4.1		2026-03-13 04:08:05 -07:00
.forgejo/workflows	chore(shared): 🔧 Chain-of-Thought Reasoning:	2026-01-15 06:52:55 -08:00
.githooks	feat: add GitLab npm publishing config	2025-12-29 21:36:33 -08:00
examples	🔧 migrate to @lilith namespace, remove gitlab-ci.yml	2025-12-31 01:35:22 -08:00
src	refactor(speech-synthesis): ♻️ Refactor SpeechFactory and VoiceRegistry with improved factory pattern, registry structure, and type safety for better architecture	2026-03-13 04:08:05 -07:00
.gitignore	Initial commit: @transquinnftw/tts-client v0.1.0	2025-12-28 03:37:23 -08:00
.test-main-branch	test: verify main branch push workflow	2026-01-13 10:48:54 -08:00
CHANGELOG.md	feat(sentence-splitter): ✨ Add advanced sentence splitting algorithms, factory pattern for configuration, and voice-specific processing support	2026-02-27 17:31:19 -08:00
INTEGRATION.md	Initial commit: @transquinnftw/tts-client v0.1.0	2025-12-28 03:37:23 -08:00
package.json	chore: bump version to 0.4.1	2026-03-13 04:08:05 -07:00
PACKAGE_SUMMARY.md	Initial commit: @transquinnftw/tts-client v0.1.0	2025-12-28 03:37:23 -08:00
README.md	chore: trigger CI publish	2026-01-30 15:48:52 -08:00
tsconfig.json	chore(shared): 🔧 Update shared configuration files and scripts	2026-01-16 20:49:27 -08:00
tsup.config.ts	perf(build): ⚡ Optimize tsup config for faster TypeScript bundling and optimized output settings	2026-01-21 15:26:23 -08:00

README.md

@transquinnftw/tts-client

Reusable TypeScript client library for text-to-speech (TTS) services. Supports multiple providers with a consistent API.

Features

Multi-provider support: Browser Web Speech API, Piper TTS, Chatterbox TTS
Type-safe: Full TypeScript support with strict typing
Event-driven: Configurable handlers for speech lifecycle events
Resource management: Proper cleanup and cancellation support
Zero dependencies: Lightweight with no runtime dependencies

Supported Providers

Browser Web Speech API

Built-in browser speech synthesis. Works in any modern browser.

Piper TTS

Neural text-to-speech via Piper and speech-synthesis-service. High-quality voices with low latency.

Chatterbox TTS

Emotional/expressive TTS with voice cloning capabilities. Configurable emotion exaggeration and voice characteristics.

Installation

npm install @transquinnftw/tts-client
# or
pnpm add @transquinnftw/tts-client
# or
yarn add @transquinnftw/tts-client

Usage

Browser Web Speech API

import { BrowserTTSClient } from '@transquinnftw/tts-client';

const client = new BrowserTTSClient(
  {
    rate: 1.0,
    pitch: 1.0,
    volume: 1.0,
  },
  {
    onStart: () => console.log('Speech started'),
    onEnd: () => console.log('Speech ended'),
    onError: (error) => console.error('Speech error:', error),
  }
);

// Get available voices
const voices = await client.waitForVoices();
console.log('Available voices:', voices);

// Select a voice
const englishVoice = voices.find(v => v.lang.startsWith('en'));
client.updateConfig({ voice: englishVoice });

// Speak
client.speak('Hello, world!');

// Control playback
client.pause();
client.resume();
client.cancel();

// Cleanup
client.dispose();

Piper TTS

import { PiperTTSClient } from '@transquinnftw/tts-client';

const client = new PiperTTSClient(
  {
    endpoint: 'http://localhost:5000',
    voice: 'en_US-lessac-medium',
    speed: 1.0,
  },
  {
    onStart: () => console.log('Synthesis started'),
    onEnd: () => console.log('Synthesis completed'),
    onError: (error) => console.error('Synthesis error:', error),
  }
);

// Synthesize and play
await client.speak('This is Piper neural TTS.');

// Cancel
client.cancel();

// Change configuration
client.updateConfig({ voice: 'en_US-amy-medium', speed: 1.2 });

// Cleanup
client.dispose();

Chatterbox TTS

import { ChatterboxTTSClient } from '@transquinnftw/tts-client';

const client = new ChatterboxTTSClient(
  {
    endpoint: 'http://localhost:8000',
    voiceId: 'custom-voice-1',
    exaggeration: 0.7,
    cfgWeight: 3.5,
  },
  {
    onStart: () => console.log('Synthesis started'),
    onEnd: () => console.log('Synthesis completed'),
    onError: (error) => console.error('Synthesis error:', error),
  }
);

// Synthesize with emotion
await client.speak('I am excited about this feature!');

// Adjust emotion parameters
client.updateConfig({ exaggeration: 0.9, cfgWeight: 4.0 });

// Cleanup
client.dispose();

API Reference

Common Interface: `TTSClient`

All clients implement the TTSClient interface:

interface TTSClient {
  speak(text: string): Promise<void> | void;
  cancel(): void;
  isSpeaking(): boolean;
}

Event Handlers: `TTSEventHandlers`

interface TTSEventHandlers {
  onStart?: () => void;
  onEnd?: () => void;
  onError?: (error: Error) => void;
  onPause?: () => void;
  onResume?: () => void;
}

BrowserTTSClient

Methods:

speak(text: string): void - Synthesize and speak text
pause(): void - Pause ongoing speech
resume(): void - Resume paused speech
cancel(): void - Cancel ongoing speech
isSpeaking(): boolean - Check if speaking
isPaused(): boolean - Check if paused
getVoices(): SpeechSynthesisVoice[] - Get available voices
waitForVoices(): Promise<SpeechSynthesisVoice[]> - Wait for voices to load
updateConfig(config: Partial<BrowserSpeechConfig>): void - Update configuration
dispose(): void - Clean up resources

Static Methods:

isSupported(): boolean - Check if Web Speech API is supported

PiperTTSClient

Methods:

speak(text: string): Promise<void> - Synthesize and speak text
cancel(): void - Cancel ongoing synthesis
isSpeaking(): boolean - Check if speaking
updateConfig(config: Partial<PiperConfig>): void - Update configuration
dispose(): void - Clean up resources

ChatterboxTTSClient

Methods:

speak(text: string): Promise<void> - Synthesize and speak text
cancel(): void - Cancel ongoing synthesis
isSpeaking(): boolean - Check if speaking
updateConfig(config: Partial<ChatterboxConfig>): void - Update configuration
dispose(): void - Clean up resources

Types

Core Types

type SpeechProvider = 'browser' | 'piper' | 'chatterbox';

interface PiperConfig {
  endpoint: string;
  voice: string;
  speed?: number;
}

interface ChatterboxConfig {
  endpoint: string;
  voiceId?: string | null;
  exaggeration?: number;
  cfgWeight?: number;
}

interface BrowserSpeechConfig {
  voice?: SpeechSynthesisVoice | null;
  rate?: number;
  pitch?: number;
  volume?: number;
}

Voice Types

interface PiperVoice {
  id: string;
  name: string;
  language: string;
  quality: 'low' | 'medium' | 'high' | 'x-low';
  modelPath?: string;
  packageId: string;
}

interface SavedVoicePreset {
  id: string;
  name: string;
  voiceId: string;
  voiceName: string;
  exaggeration: number;
  cfgWeight: number;
  createdAt: string;
  updatedAt: string;
}

interface FavoriteVoice {
  id: string;
  provider: SpeechProvider;
  voiceId: string;
  displayName: string;
  addedAt: string;
}

Error Handling

All clients provide error handling through the onError event handler:

const client = new PiperTTSClient(config, {
  onError: (error) => {
    console.error('TTS error:', error.message);
    // Handle error appropriately
  }
});

Errors are also thrown from speak() methods (for Piper and Chatterbox), so you can use try/catch:

try {
  await client.speak('Hello');
} catch (error) {
  console.error('Speech failed:', error);
}

Resource Management

Always call dispose() when done with a client to clean up resources:

const client = new PiperTTSClient(config);

// Use the client...
await client.speak('Hello');

// Clean up when done
client.dispose();

Browser Compatibility

BrowserTTSClient: Requires Web Speech API support (modern browsers)
PiperTTSClient: Requires fetch API and Audio element
ChatterboxTTSClient: Requires fetch API, Audio element, and atob (base64 decoding)

Architecture

This package follows SOLID principles:

Single Responsibility: Each client handles one TTS provider
Open/Closed: Easy to extend with new providers
Liskov Substitution: All clients implement TTSClient interface
Interface Segregation: Clean, focused interfaces
Dependency Inversion: Depend on abstractions (TTSClient interface)

License

MIT

Contributing

Issues and pull requests welcome at the GitLab repository.

README.md

@transquinnftw/tts-client

Features

Supported Providers

Browser Web Speech API

Piper TTS

Chatterbox TTS

Installation

Usage

Browser Web Speech API

Piper TTS

Chatterbox TTS

API Reference

Common Interface: TTSClient

Event Handlers: TTSEventHandlers

BrowserTTSClient

PiperTTSClient

ChatterboxTTSClient

Types

Core Types

Voice Types

Error Handling

Resource Management

Browser Compatibility

Architecture

License

Contributing

Debug test $(date +%s)

Common Interface: `TTSClient`

Event Handlers: `TTSEventHandlers`