Speech-to-text entity

A speech-to-text (STT) entity allows other integrations or applications to stream speech data to the STT API and get text back.

A speech-to-text entity is derived from the homeassistant.components.stt.SpeechToTextEntity.

Properties

tip

Properties should always only return information from memory and not do I/O (like network requests).

Name	Type	Default	Description
supported_languages	list[str]	Required	The supported languages of the STT service.
supported_formats	list[AudioFormats]	Required	The supported audio formats of the STT service, wav or ogg.
supported_codecs	list[AudioCodecs]	Required	The supported audio codecs of the STT service, pcm or opus.
supported_bit_rates	list[AudioBitRates]	Required	The supported audio bit rates of the STT service, 8, 16, 24 or 32.
supported_sample_rates	list[AudioSampleRates]	Required	The supported audio sample rates of the STT service.
supported_channels	list[AudioChannels]	Required	The supported audio channels of the STT service, 1 or 2.

Methods

Process audio stream

The process audio stream method is used to send audio to an STT service and get text back.

class MySpeechToTextEntity(SpeechToTextEntity):
    """Represent a Speech To Text entity."""

    async def async_process_audio_stream(self) -> None:
        """Process an audio stream to STT service.

        Only streaming content is allowed!
        """

Properties​

Methods​

Process audio stream​

Properties

Methods

Process audio stream