DeepSpeech
v0.7.4
Introduction
Using a Pre-trained Model
CUDA dependency
Getting the pre-trained model
Model compatibility
Using the Python package
Create a DeepSpeech virtual environment
Activating the environment
Installing DeepSpeech Python bindings
Using the Node.JS / Electron.JS package
Using the command-line client
Installing bindings from source
Dockerfile for building from source
Third party bindings
Training Your Own Model
Prerequisites for training a model
Getting the training code
Creating a virtual environment
Activating the environment
Installing DeepSpeech Training Code and its dependencies
Recommendations
Basic Dockerfile for training
Common Voice training data
Training a model
Training with automatic mixed precision
Checkpointing
Exporting a model for inference
Exporting a model for TFLite
Making a mmap-able model for inference
Continuing training from a release model
Fine-Tuning (same alphabet)
Transfer-Learning (new alphabet)
UTF-8 mode
Augmentation
Sample domain augmentations
Spectrogram domain augmentations
Multi domain augmentations
Supported platforms for inference
Linux / AMD64 without GPU
Linux / AMD64 with GPU
Linux / ARMv7
Linux / Aarch64
Android / ARMv7
Android / Aarch64
macOS / AMD64
Windows / AMD64 without GPU
Windows / AMD64 with GPU
Decoder and scorer
CTC beam search decoder
Introduction
External scorer
Decoding modes
Default mode (alphabet based)
UTF-8 mode
Implementation
External scorer scripts
Reproducing our external scorer
Building your own scorer
Architecture and training
DeepSpeech Model
Geometric Constants
n_input
n_context
n_hidden_1, n_hidden_2, n_hidden_5
n_cell_dim
n_hidden_3
n_hidden_6
Parallel Optimization
Asynchronous Parallel Optimization
Synchronous Optimization
Hybrid Parallel Optimization
Adam Optimization
API Reference
Error codes
C API
Data structures
Metadata
CandidateTranscript
TokenMetadata
.NET Framework
DeepSpeech Class
DeepSpeechStream Class
ErrorCodes
Metadata
CandidateTranscript
TokenMetadata
DeepSpeech Interface
Java
DeepSpeechModel
Metadata
CandidateTranscript
TokenMetadata
JavaScript (NodeJS / ElectronJS)
Model
Stream
Module exported methods
Metadata
CandidateTranscript
TokenMetadata
Python
Model
Stream
Metadata
CandidateTranscript
TokenMetadata
Examples
C API Usage example
Creating a model instance and loading model
Performing inference
Full source code
.NET API Usage example
Creating a model instance and loading model
Performing inference
Full source code
Java API Usage example
Creating a model instance and loading model
Performing inference
Full source code
JavaScript API Usage example
Creating a model instance and loading model
Performing inference
Full source code
Python API Usage example
Creating a model instance and loading model
Performing inference
Full source code
User contributed examples
DeepSpeech
Docs
»
Overview: module code
All modules for which code is available
native_client.python
Read the Docs
v: v0.7.4
Versions
master
latest
v0.7.3
v0.7.2
v0.7.1
v0.7.0
v0.6.1
v0.6.0
Downloads
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.