Tips for Building an Alexa Skill with SLU in Mind

Tia Francis Nov 13, 2020
Share:
Build
Blog_Header_Post_Img

To create a successful Alexa skill, something to consider is how Spoken Language Understanding (SLU) is interpreted by Alexa. This blog will give you a brief overview of SLU and tips on how you can create proper example phrases, build robust sample utterances, and not duplicate sample utterances or slot values within intents. 

How SLU Works
SLU is the process of how Alexa recognizes a user interaction. This interaction begins with the skill’s wake word.
The wake word for an Echo device can be one of the following preset names: Alexa, Amazon, Echo, and Computer. To initiate interaction with an Echo device, a user must first say the wake word. From there, users can then tell Alexa what they would like to do next. 

Here is an example of a user utterance that would begin a skill interaction:
“Alexa, ask Horoscope Reader my horoscope for Leo today.”

In this example, “Alexa” is the wake word, “ask” is the launch phrase, “Horoscope Reader” is the skill name, and “my horoscope for Leo today” is the request or utterance. 

If the device does not understand the wake word, the user cannot complete their interaction. 

From the wake word, the next step in the SLU process is Automatic Speech Recognition (ASR). ASR is when speech from the user is converted into text. 

After the speech recognition stage, the top result is passed to Natural Language Understanding (NLU). 
NLU is how Alexa interprets the users’ command. If Alexa does not understand the launch phrase or request, she might suggest something similar or come back with an error response like, “Sorry, I didn’t understand that.” For the best user experience, the phrase should be predictable.

The speech invokes the TTS (text-to-speech) service which converts text (or more specifically SSML – the Speech Synthesis Markup Language) into audio (i.e. Alexa’s voice). The speech sends this audio to the device (e.g. Echo Dot), which renders it through its speakers.

How to build a successful skill with SLU in mind 
Now that you have an understanding of how SLU works, here are some tips and resources for how you can create proper example phrases, building robust sample utterances, and not duplicate sample utterances or slot values within intents in order to help improve the user experience for your skill. 

Example Phrases:
Example phrases are the phrases displayed with the skill in the skills store. They act as suggestions on how to interact with the skill and highlight its key functionalities. 

The basic structure of how an example phrase works can be found in our documentation, Understanding How Users Invoke Custom Skills, and can be briefly explained with the following:

Wake word: This is “Alexa” by default on Alexa devices, but can be adjusted by customers based on their preferences. 
Launch word: As specified in our documentation, this includes a number of starting phrases including "open," "ask," "start," and more. 
Invocation name: This is the invocation name you assigned to your skill under Skill Information within the skill-creation workflow.
Connecting word: These are words used to connect the launch word to utterances and include "and," "to," "for,” “when,” and more. For a full list, please see Understanding How Users Invoke Custom Skills
Utterance: These are required and should be modeled based on the sample utterances within your interaction model.

It is important to Review and Test Example Phrases so the user experience with the skill is positive. After you have created example phrases, make sure to test each one to ensure the skill can be launched successfully. 

Sample Utterances:
While building your skill, you’ll likely have created sample utterances. The usability of a skill is dependent on how well the sample utterances and custom slot values represent natural conversation sentence structure. To dive deeper, see our documentation Best Practices for Sample Utterances and Custom Slot Type Values. Some key points are listed below.

An example of this would be if a user wanted to interact with the Daily Horoscope skill. The developer would have needed to create as many sample utterances they could think of so the interaction is smooth. 

For example, users might say one of the following to request a horoscope:

  • "Alexa, ask Daily Horoscopes for the horoscope for Gemini"

For each intent, include as many variations of the phrases as you expect users to speak. For example, for the utterance "what is my horoscope", include variations such as:

  • "what is…"
  • "what's…"
  • "tell me…"

It is better to provide too many samples than to provide too few, so test different phrases and add additional phrases as needed.

Custom Slot Types: 
When using custom slot types, it is important to ensure the skill covers any expected dialogue from the user. Tips from our documentation, Best Practices for Sample Utterances and Custom Slot Type Values, include:

  • Start with data sources that reflect real usage. 
  • The total number of custom slot type values depends on the overall size of your interaction model, including the total number of slot types and values across all custom slots.
  • If your list of custom values does not contain all the values your skill expects, provide a representative set of custom values with representative word counts. 
  • When you edit a custom slot type in the developer console, suggested values that might be useful for your slot type are displayed at the top. See Use suggested slot values.


Test your utterances, check for conflicts, and revise
As you create your utterances and slot values, it’s possible to duplicate utterances that map to more than one intent in your model. To make sure everything is in order, you can use the developer console to find these utterance conflicts and make any appropriate updates. The utterance profiler and NLU evaluation tool are also great resources that you can use to test your utterances and measure the accuracy of your model.

Some examples of what these utterance conflicts may look like can be found at our documentation Find Utterance Conflicts in Your Model and are outlined below:

  • Conflict between custom intents
    • Occurs when an utterance maps to two or more of your custom intents. Typically this happens when multiple intents have the same sample utterances, or the sample utterances use slot types with duplicate slot values. If you want to ensure that a given utterance always resolves to a particular intent, change your interaction model to fix the conflict.
  • Conflict between a built-in intent and custom intent
    • Occurs when an utterance maps to both your custom intent and one of the standard built-in intents. For example, a custom intent with the utterance "help" conflicts with the built-in AMAZON.HelpIntent. Alexa prioritizes your custom intent over the built-in, so in this example, "help" always resolves to your intent instead of the built-in. If you want the utterance to map to your custom intent rather than the built-in, leave this conflict in place.
  • Conflict between a built-in intent and multiple custom intents
    • Occurs when an utterance maps to both a built-in intent, and multiple custom intents. Alexa ignores the built-in intent and attempts to resolve the correct custom intent.
  • Conflict between built-in intents extended with custom utterances

Note that conflicts between utterances that use the AMAZON.SearchQuery slot types are not included.

Utterance conflicts are generated after you successfully do a full model build. If any conflicts are detected, the developer console displays an alert. On the Build page, navigate to Custom > Interaction Model > Utterance. From there you will be able to make any necessary updates.

SLU is an integral aspect of understanding how to build a successful Alexa skill. By using the above guides and resources, users will be able to more easily navigate and interact with your skill.

Related Articles

Improving Natural Language Understanding Accuracy of Your Alexa Skills

Alexa’s NLU technology helps map customer utterances to the correct response, making it the most critical ingredient in the recipe for your skill’s success.

 

Subscribe