A single transcript computed by the model, including a confidence value and the metadata for its constituent tokens.
const unsigned int
Size of the tokens array
Approximated confidence value for this transcript. This is roughly the sum of the acoustic model logit values for each timestep/character that contributed to the creation of this transcript.
- const unsigned int
Stores text of an individual token, along with its timing information.