vosk

package
v0.3.43 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jul 6, 2022 License: Apache-2.0, Apache-2.0 Imports: 1 Imported by: 0

Documentation

Overview

Go bindings for Vosk speech recognition toolkit. Vosk is an offline open source speech to text API for Android, iOS, Raspberry Pi and servers. It enables speech recognition models for 18 languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian.

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

func GPUInit

func GPUInit()

GPUInit automatically selects a CUDA device and allows multithreading.

func GPUThreadInit

func GPUThreadInit()

GPUThreadInit inits CUDA device in a multi-threaded environment.

func SetLogLevel

func SetLogLevel(logLevel int)

SetLogLevel sets the log level for Kaldi messages.

Types

type VoskModel

type VoskModel struct {
	// contains filtered or unexported fields
}

VoskModel contains a reference to the C VoskModel

func NewModel

func NewModel(modelPath string) (*VoskModel, error)

NewModel creates a new VoskModel instance

func (*VoskModel) FindWord

func (m *VoskModel) FindWord(word []byte) int

FindWord checks if a word can be recognized by the model. Returns the word symbol if the word exists inside the model or -1 otherwise.

func (*VoskModel) Free

func (m *VoskModel) Free()

type VoskRecognizer

type VoskRecognizer struct {
	// contains filtered or unexported fields
}

VoskRecognizer contains a reference to the C VoskRecognizer

func NewRecognizer

func NewRecognizer(model *VoskModel, sampleRate float64) (*VoskRecognizer, error)

NewRecognizer creates a new VoskRecognizer instance

func NewRecognizerGrm

func NewRecognizerGrm(model *VoskModel, sampleRate float64, grammer []byte) (*VoskRecognizer, error)

NewRecognizerGrm creates a new VoskRecognizer instance with the phrase list.

func NewRecognizerSpk

func NewRecognizerSpk(model *VoskModel, sampleRate float64, spkModel *VoskSpkModel) (*VoskRecognizer, error)

NewRecognizerSpk creates a new VoskRecognizer instance with a speaker model.

func (*VoskRecognizer) AcceptWaveform

func (r *VoskRecognizer) AcceptWaveform(buffer []byte) int

AcceptWaveform accepts and processes a new chunk of the voice data.

func (*VoskRecognizer) FinalResult

func (r *VoskRecognizer) FinalResult() []byte

FinalResult returns a speech recognition result. Same as result, but doesn't wait for silence.

func (*VoskRecognizer) Free

func (r *VoskRecognizer) Free()

func (*VoskRecognizer) PartialResult

func (r *VoskRecognizer) PartialResult() []byte

PartialResult returns a partial speech recognition result.

func (*VoskRecognizer) Reset

func (r *VoskRecognizer) Reset()

Reset resets the recognizer.

func (*VoskRecognizer) Result

func (r *VoskRecognizer) Result() []byte

Result returns a speech recognition result.

func (*VoskRecognizer) SetMaxAlternatives

func (r *VoskRecognizer) SetMaxAlternatives(maxAlternatives int)

SetMaxAlternatives configures the recognizer to output n-best results.

func (*VoskRecognizer) SetSpkModel

func (r *VoskRecognizer) SetSpkModel(spkModel *VoskSpkModel)

SetSpkModel adds a speaker model to an already initialized recognizer.

func (*VoskRecognizer) SetWords

func (r *VoskRecognizer) SetWords(words int)

SetWords enables words with times in the ouput.

type VoskSpkModel

type VoskSpkModel struct {
	// contains filtered or unexported fields
}

VoskSpkModel contains a reference to the C VoskSpkModel

func NewSpkModel

func NewSpkModel(spkModelPath string) (*VoskSpkModel, error)

NewSpkModel creates a new VoskSpkModel instance

func (*VoskSpkModel) Free

func (s *VoskSpkModel) Free()

Directories

Path Synopsis

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL