SpeechRecognition: start() method

Limited availability

This feature is not Baseline because it does not work in some of the most widely-used browsers.

The start() method of the Web Speech API starts the speech recognition service listening to incoming audio (from a microphone or audio track) with intent to recognize it and return the results of that recognition.

Syntax

start()
start(audioTrack)

Parameters

audioTrack Optional: A MediaStreamTrack instance to perform speech recognition on. If this property is not specified, the service attempts to recognize audio input from the user's microphone instead.

Return value

None (undefined).

Exceptions

InvalidStateError DOMException

Thrown if an audioTrack is specified and:

Its kind property is not audio.
Its readyState property is not live.

Examples

Recognizing speech from a microphone

In our Speech color changer example, we create a new SpeechRecognition object instance using the SpeechRecognition() constructor. Later on, we create a click event handler on a <button> so that when it is clicked, we start the speech recognition service and await audio input from the user's microphone:

const recognition = new SpeechRecognition();

const diagnostic = document.querySelector(".output");
const bg = document.querySelector("html");
const startBtn = document.querySelector("button");

startBtn.onclick = () => {
  recognition.start();
  console.log("Ready to receive a color command.");
};

When a result has been successfully recognized, the result event fires, we extract the color that was spoken from the event object by grabbing the transcript of the first SpeechRecognitionAlternative of the first SpeechRecognitionResult in the returned results list. We then set the background color of the <html> element to that color.

recognition.onresult = (event) => {
  const color = event.results[0][0].transcript;
  diagnostic.textContent = `Result received: ${color}`;
  bg.style.backgroundColor = color;
};

Recognizing speech from an audio track

This code (excerpted from our audio track recognition demo) shows how to recognize speech from an audio track. To begin with, we create a new SpeechRecognition instance and set its lang to en-US. We then then grab a reference to a <button> element and a <p> to output result and diagnostic information to.

const recognition = new SpeechRecognition();
recognition.lang = "en-US";

const startBtn = document.querySelector("button");
const diagnostic = document.querySelector(".output");

Next, we add a click event handler to the <button>. When it is clicked, we create a new <audio> element using the Audio(), loading an MP3 file into it. Once the MP3 is ready to play (determined by the canplay event), we capture it as a MediaStream using the captureStream() method, then extract the audio MediaStreamTrack from it using getAudioTracks().

We then play the audio (required for the recognition to take place), and pass the MediaStreamTrack into the start() method to start the recognition.

startBtn.addEventListener("click", () => {
  diagnostic.textContent = "";
  console.log("Loading audio track");
  const audioElement = new Audio("cheese-on-toast.mp3");
  audioElement.addEventListener("canplay", () => {
    const stream = audioElement.captureStream();
    const audioTrack = stream.getAudioTracks()[0];
    audioElement.play();
    recognition.start(audioTrack);
    console.log("Recognition started");
  });
});

To output the recognized audio, we listen for the result event. When that fires, we grab the transcript of the first SpeechRecognitionAlternative of the first SpeechRecognitionResult in the returned results list. We output the transcript itself to the output <p>, and log its confidence rating to the console.

recognition.addEventListener("result", (event) => {
  const speech = event.results[0][0].transcript;
  diagnostic.textContent = "Speech recognized: " + speech + ".";
  console.log("Confidence: " + event.results[0][0].confidence);
});

Specifications

Specification
Web Speech API # dom-speechrecognition-start

Browser compatibility

Help improve MDN

Learn how to contribute

This page was last modified on ⁨Jan 1, 1970⁩ by MDN contributors.

View this page on GitHub • Report a problem with this content