How to Localize Your Alexa Skills : Alexa Blogs

Alexa Blogs

Want the latest?

Subscribe to Alexa Announcements
Subscribe via RSS

alexa topics

Objectives for Alexa Skill Localization

To keep our code maintainable and easily extensible with future languages, we want to:

Separate our strings from our code logic
Make it easy to pick a string from our code logic but keep our code DRY
Be able to support wildcards in our strings so we can fill them with variables from the code later (e.g. if we provide the name 'Andrea', 'Hello %s!' will turn into 'Hello Andrea!')
Be able to add multiple versions of the same response (e.g. 'hello', 'hi', 'hey') and have the code pick one at random

How to Implement Localization Using Node.js and AWS Lambda

First off, separate your strings from your logic. Even in terms of files. Your working directory should look something like:

/
├── i18n/    // your language strings are here
│   ├── en.js
│   ├── de.js
│   ├── fr.js
│   ├── it.js
│   └── es.js
├── lib/
│   └── ...  // your other logic
├── node_modules/
│   └── ...  // your npm modules
└── index.js // your lambda entry point

We structure our language files as follows. You'll notice string IDs can contain either a string or an array of strings. If you choose to use an array of strings, the localization library will automatically pick a random value from the array, helping you give variety to your skill responses.

You will also notice we can add wildcards to the strings in the form of '%s' (for string-like variables) or '%d' (for number-like variables). How they work is simple: you just need to pass in an additional argument for every wildcard you have in your string, like: requestAttributes.t('GREETING_WITH_NAME', 'Andrea').

// en.js
module.exports = {
    translation : {
        'SKILL_NAME' : 'Super Welcome', // <- can either be a string...
        'GREETING' : [                  // <- or an array of strings.
            'Hello there',
            'Hey',
            'Hi!'
        ],
        'GREETING_WITH_NAME' : [
            'Hey %s',         // --> That %s is a wildcard. It will
            'Hi there, %s',   //     get turned into a name in our code.
            'Hello, %s'       //     e.g. requestAttributes.t('GREETING_WITH_NAME', 'Andrea')
        ],
        // ...more...
    }
}

Similarly, another locale would have the same keys, but different values, like this:

// it.js
module.exports = {
    translation : {
        'SKILL_NAME' : 'Iper Benvenuto',
        'GREETING' : [
            'Ciao!',
            'Ehila!',
            'Buongiorno'
        ],
        'GREETING_WITH_NAME' : [
            'Ciao %s',        // --> That %s is a wildcard. It will
            'Ehila %s',       //     get turned into a name in our code.
            'Buongiorno, %s'  //     e.g. requestAttributes.t('GREETING_WITH_NAME', 'Andrea')
        ],
        // ...more...
    }
}

In order to make the above work, we need two node modules: i18next and i18next-sprintf-postprocessor.

If you are creating your lambda from the Serverless Application Repository, make sure you select the 'alexa-skills-kit-nodejs-howtoskill' template instead of the fact skill, as that former will already have these modules as dependencies.
If you are developing locally with the ASK Command-Line Interface (CLI), simply install them as you would any other npm module using:

npm i —save i18next i18next-sprintf-postprocessor

In our main index.js we require the two node modules:

// in the index.js file, we add i18next and 
// i18next-sprintf-postprocessor as dependencies
const i18n = require('i18next'); 
const sprintf = require('i18next-sprintf-postprocessor');

We also need to aggregate those language files inside the index.js file.

// further down the index.js
const languageStrings = {
    'en' : require('./i18n/en'),
    'it' : require('./i18n/it'),
    // ... etc
}

Now we need a little bit of code to adapt the generic (and open-source) i18next localization framework and make it work nicely with the SDK. Add the following updated LocalizationInterceptor below. The interceptor will automatically parse the incoming request, detect the user's locale and pick the right language strings to use. It will also combine the power of i18next, the sprintf functionality of the i18next-sprintf-postprocessor and automatically pick a response at random if a specific key (like 'GREETING') has an array of possible responses.

// inside the index.js
const LocalizationInterceptor = {
    process(handlerInput) {
        const localizationClient = i18n.use(sprintf).init({
            lng: handlerInput.requestEnvelope.request.locale,
            fallbackLng: 'en', // fallback to EN if locale doesn't exist
            resources: languageStrings
        });

        localizationClient.localize = function () {
            const args = arguments;
            let values = [];

            for (var i = 1; i < args.length; i++) {
                values.push(args[i]);
            }
            const value = i18n.t(args[0], {
                returnObjects: true,
                postProcess: 'sprintf',
                sprintf: values
            });

            if (Array.isArray(value)) {
                return value[Math.floor(Math.random() * value.length)];
            } else {
                return value;
            }
        }

        const attributes = handlerInput.attributesManager.getRequestAttributes();
        attributes.t = function (...args) { // pass on arguments to the localizationClient
            return localizationClient.localize(...args);
        };
    },
};

Don't forget to register the LocalizationInterceptor when you instantiate your Skill Builder object:

// at the bottom of the index.js
const skillBuilder = Alexa.SkillBuilders.standard();

exports.handler = skillBuilder
    .addRequestHandlers(
        // your intent handlers here
    )
    .addRequestInterceptors(LocalizationInterceptor) // <-- ADD THIS LINE
    .addErrorHandlers(ErrorHandler)
    .lambda();

The reason we added a request interceptor is so that the SDK will run this every time a request comes in before we handle any request, pre-filling our request attributes with the localization strings for the incoming locale.

How to Use the Localization String in Your Handlers

Now that the setup is complete, using it in your handlers is a breeze! Here are some examples:

// IN THE CASE OF A SIMPLE GREETING, WITHOUT THE NAME

const LaunchRequestHandler = {
    canHandle(handlerInput) {
        const request = handlerInput.requestEnvelope.request;
        return request.type === 'LaunchRequest';
    },
    async handle(handlerInput) {
        // we get the translator 't' function from the request attributes
        const requestAttributes = handlerInput.attributesManager.getRequestAttributes();

        // we call it using requestAttributes.t and reference the string key we want as the argument.
        const speechOutput = requestAttributes.t('GREETING');

        // -> speechOutput will now contain a 'GREETING' at random, such as 'Hello'

        return handlerInput.responseBuilder
            .speak(speechOutput)
            .getResponse();
    },
};

// IN THE CASE WE HAVE THE USER'S NAME

const LaunchRequestHandler = {
    canHandle(handlerInput) {
        const request = handlerInput.requestEnvelope.request;
        return request.type === 'LaunchRequest';
    },
    async handle(handlerInput) {
        const requestAttributes = handlerInput.attributesManager.getRequestAttributes();
        const sessionAttributes = handlerInput.attributesManager.getSessionAttributes();

        const username = sessionAttributes.username // <-- let's assume = 'Andrea' 

        const speechOutput = requestAttributes.t('GREETING_WITH_NAME', username); // < -- note the second argument
        // -> speechOutput now contains a 'GREETING_WITH_NAME' at random, such as 'Hello, %s'
        // and filled with the attribute we provided as the second argument 'username', i.e. 'Hello, Andrea'. 
            
        return handlerInput.responseBuilder
            .speak(speechOutput)
            .getResponse();
    },
};

The great thing about the i18next framework is that you don't need to use arrays, you can use static strings if you want, and likewise you don't need to use the wildcard feature '%s'.

Adding Different Strings for Same-Language Locales

In some cases, we want some language files to cover multiple languages. For example: en.js might cover en-US, en-GB, en-IN. In other cases, we might want to override a specific string to make it more culturally relevant (e.g. sneakers vs trainers, brilliant vs awesome, etc). How would we do this?

Let's say for example, that we want to override the en.js with a custom en-GB greeting for our UK users. Simple! We would have a dedicated en-GB.js file with only the key/value pairs that change. i18next will automatically pick the available language string that is the most specific (similar to CSS selectors). For example, if there is a “en-GB” string available, it will pick it over the “en” equivalent.

/
├── i18n/ 
│   ├── en.js
│   ├── en-GB.js // <-- we add a special en-GB file
...
└── index.js

Our language files would look like this:

// en.js
module.exports = {
    translation : {
        // ... other strings
        'SNEAKER_COMPLIMENT' : 'Those are awesome sneakers!' // <-- default
        // ... other strings
}

Then we would add an en-GB file that ONLY overrides strings we want to change for en-GB.

// en-GB.js
module.exports = {
    translation : {
        'SNEAKER_COMPLIMENT' : 'Those are sick trainers!' // <-- 
}

Then, in our index.js file, we will make sure to add it to the languageStrings object:

// inside the index.js
const languageStrings = {
    'en' : require('./i18n/en'),
    'en-GB' : require('./i18n/en-GB'),
    // ... etc
}

We're done!

Looking Ahead

While this method is great for simple skills, as your skill gets more and more complex, your list of strings will increase, and so will the effort required to maintain and localize them. If that happens it could make sense to ramp up your localization strategy with any of the below.

String Extraction

Sometimes your back end is already using hardcoded strings for speech output all over the place and sometimes these are just too many to process manually. Fortunately there are plenty of localization support libraries in Node.js (even a i18next scanner) that can scan your code, extract translation keys/values, and merge them into i18n resource files ready to use in the format described above. In order to automate the process you could even use gulp to run a string extraction task and generate the string resources programmatically.

String Management

It might be worth using a CMS or proper localization framework that allows external (non-technical) people to collaborate and provide translations for individual strings without giving access to your production environment. You would then export those strings and generate the language string files that your skill will read.

Some skill developers use cloud-based user friendly databases and ask their beta testers to contribute string resources and translations. Others prefer more i18n specific services where you can crowdsource the translation of strings to then manage them via API. Whatever you choose, the more strings your skill is dealing with the more necessary it becomes to centralize the management of these resources.

Translation API

A fallback strategy when your backend gets incoming requests in unsupported locales would be to use a machine learning based translation service like Amazon Translate. In a Node.js based AWS Lambda function you would use the AWS SDK to get an AWS.Translate instance passing parameters like source language, destination language and the text to translate. Fortunately, AWS Lambda makes it very easy to connect to other AWS services: it already includes the AWS SDK as part of the execution environment so you don’t have to manually add it as a dependency. Additionally, it will automatically set the credentials required by the SDK to those of the IAM role associated with your function (you do not need to take any additional steps).

Resources

For an example localized skill, check out the fact skill sample skill in the Alexa skill-building cookbook on GitHub
Read more about the i18next framework we are using for Node.js.
Check out the documentation on developing skills in multiple languages.

Want the latest?

Subscribe to Alexa Announcements
Subscribe via RSS

Alexa Blogs

Want the latest?

alexa topics

Recent Posts

Archive

Objectives for Alexa Skill Localization

How to Implement Localization Using Node.js and AWS Lambda

How to Use the Localization String in Your Handlers

Adding Different Strings for Same-Language Locales

Looking Ahead

String Extraction

String Management

Translation API

Resources

Related Content

Want the latest?

alexa topics

Recent Posts

Archive