Get Started
Add Visuals and Audio to Your Skill
- Tutorial: Add Your First Visual Response to a Custom Skill
- What Makes Up an APL Visual Response?
- Test APL Examples with the Code Sandbox
What's New in APL
APL Known Issues
Create Visual Experiences for Your Skill
Add APL Support to Your Skill
- Configure Your Skill with the APL Interface
- Add APL Support to Your Skill Code
- Test APL Skills in the Developer Console Simulator
Build Documents in the Developer Console
- Create and Edit an APL Document
- Import a Lottie Animation
- Import a Scalable Vector Graphic (SVG) (Beta)
- Import and Export APL Documents
- Preview an APL Document
- Experiment with APL Examples in the Authoring Tool
Support Different Types of Devices
- Build Responsive APL Documents
- Support Tablets and Other Devices that Can Change Size
- Select the Viewport Profiles Your Skill Supports
- Support Devices with Character Displays
Display Content on the Screen
- Display Text on the Screen
- Combine Content with Backgrounds, Borders, and Headers
- APL Support for Item Selection
Combine Visual Content with Alexa Speech and Audio
- Synchronize Spoken Text with Text on the Screen
- Integrate Visual and Audio Responses
Host Layouts, Graphics, and Other Resources in an APL Package
Enable User Interactions in Your Visual Content
Display a Widget
About Widgets
Add a Widget to Your Skill
- Create and Manage Widgets
- Test a Widget
Widgets Reference
Use Pre-built Templates and Components
Alexa Design System for APL
- Viewport Profiles
- Styles and Resources
- Responsive Components and Templates
- Alexa Icon Package
  - Alexa Icon
  - Alexa Icon Library Reference
APL Best Practices
APL Best Practices for Developers
- Plan Your APL Experience
- Build Your APL Templates
APL Accessibility Guide
- Build APL Visuals that Support Screen Readers
- Best Practices for Screen Reader Support
Reduce Latency for your APL Documents
APL Cheat Sheets
APL Reference
APL for Screen Devices
- Documents and Packages
- Styles
  - Style Definition and Evaluation
  - Styled Properties
- Data Sources and Data Binding
- Components
- Commands
- Bound Variables
- Filters
- Vector Graphics Format (AVG)
- Keyboard Events and Handlers
- Tick Handlers
- Visibility Change Event Handlers
- Gestures
  - DoublePress
  - LongPress
  - SwipeAway
  - Tap
- Extensions
APL for Audio
- Documents
- Data Sources and Data Binding
- Components
- Filters
APL for Character Displays
- Document
- Inter-Segment Characters
- Viewport Information
- Data Types
- Components
  - Component
  - Container
  - Pager
  - Text
  - TimeText
- Standard Commands
APL Interface Reference
- Alexa.Presentation.APL Interface (Screen Devices)
- Alexa.Presentation.APLA Interface Reference (Audio)
- Alexa.Presentation.APLT Interface (Character Displays)
- Alexa.DataStore Interface Reference
- Alexa.DataStore.PackageManager Interface
- APL Visual Context in the Skill Request
- Widget Information in the Skill Request
APL Package Reference
Data Store REST API Reference

Add Visuals and Audio to Your Skill

Note: Learn how to improve your skills with APL with Build visually rich experiences using APL at the Alexa Learning Lab.

Create a visual experience for your skill with graphics, images, slideshows, video, and animations using Alexa Presentation Language (APL). APL is a responsive layout language that lets you build visuals to render on Alexa-enabled multimodal devices. You can also build audio responses that mix and layer multiple Alexa voices, sound effects, and background music with APL for audio. You can combine audio and visual responses.

Visual and audio responses work on devices with screens, such as the Echo Show, TVs, and Alexa-enabled tablets. Audio responses also work on speaker devices such as the Amazon Echo and Echo Dot.

The APL content is part of the skill response

Custom voice model skills use a request and response interface. Alexa sends your AWS Lambda function or web service a request, such as a LaunchRequest or IntentRequest. Your skill handles this request and returns a response.

APL works within this framework. When your skill returns a response, you include a directive to display a visual response or play an audio response. You pass the directive two items:

An APL document, which is a JSON object that defines either a visual or audio template. The document provides the structure and layout for the response. Conditional logic in the document lets the template adapt to different devices and situations.
An APL data source, which is a JSON object you define. The data source provides the content to populate the template. You use a data source for the content that might change when the user invokes your skill. This approach lets you separate the visual or audio presentation from the data.

User: Alexa, open Hello World and say hello.

Alexa sends your skill an IntentRequest. Your skill returns a response with speech and visual content.
Alexa: Hello World! (Alexa speaks this response and displays the visual content on the screen at the same time.)

The following example shows an APL document that displays a "Hello World" visual response

The following example shows an APL for audio document that plays an audio response.

Users interact with the APL response in different ways

Users can interact with Alexa-enabled devices with screens. For example, users can tap buttons on Echo Show devices, or use a remote to navigate the screen and select items on Fire TV devices. Users can also speak their requests to the skill, as they would with any Alexa device.

A visual response you build with APL can take advantage of these input modes. You define buttons and other touchable items in your APL document. This items run commands. A command can change the presentation on the screen, such as by changing the text the user sees on the screen. A command can also send a message to your skill in a request. You write handlers for these requests, similar to the intent handlers you write for voice requests like IntentRequest.

For speech interactions, you define intents to capture spoken requests and intent handlers to handle those requests in your code. When Alexa sends your skill an IntentRequest, the request includes information about the APL content displayed on the screen. Your handler can use this information to provide a relevant response.

APL works on different types of devices

You can use APL present both audio and visual content:

Play rich audio content on all Alexa devices with APL for audio.
Display content on devices with screens, such as the Echo Show, Fire Tablet, and Fire TV. APL provides full support for user interaction and rich content, such as images, video, and animation.

Devices with screens come in different shapes and sizes. You can use conditional logic to adapt your design to the device. For example, you might display a horizontal list on a landscape device, but a vertical list on a portrait device.
Display content on devices with alphanumeric clock displays, such as the Echo Dot with clock. You can use a smaller set of APL features to display content on these devices.

APL supports showing alphanumeric data on the display. These devices also support unique features like the ability to marquee text and show timers and countdowns. For details, see Understand Alexa Presentation Language and Character Displays.

The APL concepts are the same regardless of the device you target.

Learn more about the parts of APL

For more about all the different parts of APL you use when building an audio or visual response, see What Makes Up an APL Visual Response?.

Get started with a tutorial or training course

To get started with a short tutorial that introduces APL, see Tutorial: Add Your First Visual Response to a Custom Skill.

To learn from an online course on APL, see the Alexa Learning Lab. Start with the Build visually rich experiences using APL curriculum.

Was this page helpful?

Provide feedback

Last updated: Nov 28, 2023