Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
confi

Background

Automotive Grade Linux (AGL) is a collaborative open source project that is bringing together automakers, suppliers and 
technology companies to accelerate the development and adoption of a fully open software stack for the connected car. 
Being a part of speech expert group, Amazon (Alexa Automotive) team intends to collaborate to help define the voice 
service APIs in AGL platform.

...

     <Yet to be standardized>

  }

}

3) Configuration

  • Provides mechanism for OEMs to configure its functionality. OEMs should be able to configure 
    • List of active agents
    • Assign roles and responsibilities of each agent
    • Language setting
    • Default Agent
    • Enable/Disable Fallback Invocation mode
    • Enable/Disable Agent Switching during multi turn dialog
    • ... more

...

Code Block
languagecpp
titlevshl/setActive
collapsetrue
vshl/setActive

Activate or deactivate a voice agent.

"permission": "urn:AGL:permission:speech:public:accesscontrol"


Request:
{ 
  "agent_id":"integer"
  "is_active":"boolean"
}


Responses: { 
  "jtype":"afb-reply",
  "request":
  {
    "status":"string" // success or bad-state or bad-request }
  }
} 

...

afb-voiceservice-wakeword-detector

...

  • Provides an interface primarily for the core afb-voiceservice-highlevel to listen for wakeword detection events and make request routing decisions.

  • This binding will internally talk to or host voice assistant vendor specific wake word solutions to enable the wake word detection.


Voice Agent Vendor Software

1) voice-agent-binding

  • The API specification of voice agent is defined in this document. All the vendor specific voice agent bindings will follow the same specific to integrate with the high level voice service.
  • Voice Agent will listen to audio input when instructed by the high level voice service.
  • Voice Agent will run its own automatic speech recognition, natural language processing, generates intents to perform requested action.
  • Voice Agent will have its own authentication, connection and dialog management flows. And generates events to notify the high level voice service of its state transitions.
  • Voice Agent will use the high level voice service's interaction manager to command system applications to perform tasks, like Route to a specific geo code, Dial a Number, Play music etc.

...