Vielen Dank für Ihren Besuch. Diese Seite ist momentan nur auf Englisch verfügbar. Wir arbeiten an der deutschen Version. Vielen Dank für Ihr Verständnis.

Understand the Video Skill API

The Video Skill API is a set of APIs that enable the far-field control of video devices and streaming services using an Alexa device. The Video Skill API provides both customers and developers with a consistent experience and interaction model.

Introduction

The Video Skill API is designed to enable customers to easily find and consume video content without invoking a specific skill. For example, a customer could say, "Alexa, play Manchester by the Sea" without specifying a provider or device. Through the Video Skill API, Alexa is aware of video devices and services a user has or subscribes to, and enables the user to control experiences across these devices and services by voice. The Video Skill API intelligently routes users to the content they want. In contrast, custom skills require users to invoke a skill by name, and result in individual skills that aren’t aware of the user’s broader set of devices and services.

For example, a video provider produces custom content, a new television series called "The Adventures of Sam". If that provider uses the Video Skill API to create an Alexa skill, that content can be ingested into Alexa’s content catalog. Alexa then understands the provider for the content. When a customer says, "Alexa, play The Adventures of Sam" , Alexa identifies the content provider, and walks the customer through the process of enabling the appropriate skill. Without the Video Skill API, a customer would have to remember that "The Adventures of Sam" came from a specific provider, make sure they had enabled that provider’s custom skill, and then ask the skill for the content.

The Video Skill API focuses on the video experience. Developers can take advantage of a streamlined setup process for customers and a set of APIs that are designed specifically for integrating video functionality. One of the main goals of the Video Skill API is to reduce developer friction and skill implementation time. To help with this, the Video Skill API team provides documentation and code examples for getting started quickly.

The Video Skill API exposes a variety of functionality including:

  • Searching and playing content
  • Playback controls such as pause and rewind
  • Launching an app or GUI shortcut
  • Recording video content
  • Channel navigation
  • Video catalog ingestion

Understand How a Video Skill Works

An Alexa video ecosystem contains the following:

  • Customer: The person interacting with the Alexa device and the subscriber to your video service
  • The Video Skill API: A service that understands the voice commands and converts them to directives (JSON messages) that are sent to video skills
  • AWS Lambda: A compute service offered by Amazon Web Services (AWS) that hosts the video skill code
  • Video skill: Code and configuration that interpret directives and sends messages to a video app or device to complete the customer request
Video Skill API

Who Can Develop Video Skills for Alexa

Anyone can develop a video skill. Developing video skills is relatively easy because the voice interactions are defined by Amazon. The Alexa service knows how to interpret the user’s speech and what messages to send to your video skills.

Generally, video skill development falls into two categories:

  • Developers who represent device manufacturers that want to enable voice interactions with their devices
  • Developers who represent video service providers that want to enable voice interactions with their video services

Prerequisites to Video Skill Development

In order to develop a Video Skill you must have the following:

  • An Amazon developer account. Sign up is free
  • A connected device with a cloud API or cloud-enabled video service
  • An Alexa device such as Amazon Echo
  • An AWS account. You host your skill code as an AWS Lambda function
  • Knowledge of Java, Node.js, Python or C# as Lambda functions can be written in any of these languages
  • An understanding of OAuth 2.0

Documentation Guide

Topic Description
Steps to Create a Video Skill Step-by-step guide to create a video skill
Message and Data Formats Describes the structure of messages to and from Alexa and how to format dates and other data.
State Reporting for Video Skills Describes when and how to send change reports to Alexa when the device state changes
Device Discovery Describes Alexa device discovery interface and discovery format.
Alexa.ErrorResponse and Alexa.Video.ErrorResponse Describes messages for error conditions.
Alexa.ChannelController Describes messages to change the channel.
Alexa.Launcher Describes messages to launch an app or shortcut.
Alexa.RecordController Describes messages to record the video content currently playing.
Alexa.RemoteVideoPlayer Describes messages to search and play content or display search results.
Alexa.PlaybackController Describes messages to control playback of content.
Alexa.PlaybackStateReporter Provides events for reporting the state of playback endpoints.
Alexa.VideoRecorder Describes messages to schedule a video content recording.
Entity Types for Video Content Describes how video entities are represented in search and play messages.
Launch Target Reference Lists app and shortcut identifiers that could be sent in an Launcher request.
Content Catalog Ingestion Provides details on the catalog ingestion process, which helps speech recognition for media entities such as channels, TV shows and movies.