Xmos Vocalfusion xvf3510 User manual

VocalFusion®XVF3510 USER GUIDE

V4.0

XM-014232-PC 2

CONTENTS

1. vocalFusion®VF3510 User Guide.......................................................................5

1.1. Scope of document...................................................................................................... 5

1.2. XVF3510 Far-field Voice processors............................................................................ 5

1.3. System Block Diagrams ............................................................................................... 6

1.3.1. XVF3510-INT Configuration .......................................................................... 6

1.3.2. XVF3510-UA Configuration........................................................................... 6

1.4. Device firmware and configuration .............................................................................. 7

2. XVF3510 Voice Processor Architecture ..............................................................8

2.1. Overview ...................................................................................................................... 8

2.2. Audio Processing Pipeline ........................................................................................... 8

2.3. ASR and Communication Processing .......................................................................... 9

2.4. XVF3510-INT - For integrated voice interface applications ......................................... 9

2.5. XVF3510-UA - For USB accessory voice interface applications ............................... 10

3. Principles of configuration, control and usage .................................................11

3.1. Firmware release package......................................................................................... 11

3.1.1. “bin” Directory............................................................................................. 12

3.1.2. “data-partition” Directory ............................................................................ 12

3.1.3. “host” Directory........................................................................................... 12

3.2. Required Tools ........................................................................................................... 13

3.2.1. xTIMEcomposer.......................................................................................... 13

3.2.2. Python 3 ...................................................................................................... 14

3.2.3. Host build tools ........................................................................................... 14

3.3. Command-line interface (vfctrl).................................................................................. 14

3.4. Configuration via Control interface............................................................................. 15

3.5. Control operation........................................................................................................ 15

3.5.1. Host Application ......................................................................................... 16

3.5.2. Device Application...................................................................................... 16

3.6. Configuration via Data Partition.................................................................................. 16

3.7. XVF3510 Development kits ........................................................................................ 16

3.8. Updating the firmware................................................................................................ 18

3.9. Operation ................................................................................................................... 19

3.9.1. XVF3510-INT Amazon AVS demonstration................................................. 19

3.9.2. XVF3510-UA USB connected demonstration ............................................. 20

3.9.3. Host Utilities ................................................................................................ 22

3.10. Default operation........................................................................................................ 23

XM-014232-PC 3

4. XVF3510 Features and Configuration ...............................................................24

4.1. Booting ....................................................................................................................... 24

4.1.1. Flash storage structure ............................................................................... 24

4.1.2. Programming the Factory Boot image and Data Partition .......................... 25

4.1.3. Upgrade Images and Data Partitions ......................................................... 25

4.1.4. Factory restore ............................................................................................ 27

4.1.5. Boot Image and Data Partition Compatibility checks ................................. 27

4.1.6. Custom flash memory devices.................................................................... 28

4.1.7. SPI Slave Boot ............................................................................................ 29

4.2. Configuration and the Data Partition .......................................................................... 30

4.2.1. Data Partition definition............................................................................... 30

4.2.2. Generating a Data Partition for custom applications.................................. 31

4.2.3. Serial Number ............................................................................................. 32

4.3. Device Interfaces, ...................................................................................................... 32

4.3.1. USB Interface.............................................................................................. 33

4.3.2. USB HID...................................................................................................... 34

4.3.3. I2C Slave Control interface (XVF3510-INT only)......................................... 35

4.3.4. General Purpose Input and Output and Peripheral Bridging ..................... 35

4.3.5. General purpose I/O pINS .......................................................................... 36

4.3.6. General purpose inputs .............................................................................. 36

4.3.7. General purpose outputs ............................................................................ 38

4.3.8. I2C Master peripheral interface (XVF3510-UA Only)................................... 40

4.3.9. SPI Master................................................................................................... 42

4.4. Audio Routing and Filtering........................................................................................ 45

4.4.1. Signal Routing and Scaling......................................................................... 46

4.4.2. General Purpose Filter ................................................................................ 49

4.5. Far-Field Voice Processing ........................................................................................ 51

4.5.1. PDM microphone interface ......................................................................... 51

4.5.2. Automatic Echo Cancellation (AEC) ........................................................... 52

4.5.3. Automatic Delay Estimation & Correction (ADEC)...................................... 53

4.5.4. Interference canceller................................................................................. 57

4.5.5. Noise Suppressor (NS) ............................................................................... 58

4.5.6. Automatic Gain Control (AGC) and Loss Control ....................................... 58

4.5.7. Alternative Architecture mode (ALT_ARCH)............................................... 60

XM-014232-PC 4

5. Additional Information .......................................................................................63

5.1. Documentation ........................................................................................................... 63

5.2. Device firmware and drivers ...................................................................................... 63

6. Revision History ................................................................................................63

Appendices ...............................................................................................................64

APPENDIX A: PARAMETER SUMMARY .................................................................................... 65

APPENDIX B: BOOT STATUS CODES (RUN_STATUS) ............................................................ 75

APPENDIX C: EXAMPLE .SPISPEC FILE FORMAT ................................................................... 76

APPENDIX D: SPI BOOT CUSTOM CONNECTION .................................................................. 77

APPENDIX E: USB ENUMERATION .......................................................................................... 78

APPENDIX F: USB HID - EXAMPLE USING THE DEVELOPMENT KIT ..................................... 79

APPENDIX G: GENERAL PURPOSE FILTER EXAMPLE............................................................ 80

APPENDIX H: COMMAND TRANSPORT PROTOCOL .............................................................. 81

APPENDIX J: CAPTURING PACKED SAMPLES ....................................................................... 84

XM-014232-PC 5

1. VOCALFUSION®XVF3510 USER GUIDE

1.1. SCOPE OF DOCUMENT

The VocalFusion® XVF3510 User guide is written for system architects and engineers designing far-

field voice systems using the XVF3510 Voice processor. The document describes typical usage

models, the processor architecture, key feature operation, and interface definitions. In conjunction with

the product datasheet, these two documents provide all the information required for system design,

from concept to production testing and verification.

It is expected that this document is read in conjunction with the relevant datasheet and that the user

is familiar with basic voice processing terminology.

NOTE: This issue of the user guide covers the functionality supported by version 4.0 of the XVF3510

application firmware.

1.2. XVF3510 FAR-FIELD VOICE PROCESSORS

The XMOS XVF3510 range of processors use microphone array processing to capture clear, high-

quality voice from anywhere in the room. XVF3510 processors use highly optimised digital signal

processing algorithms implementing ‘barge-in', point noise and ambient noise reduction to increase

the Signal-to-Noise Ratio (SNR) achieving a reliable voice interface whatever the environment.

The XVF3510 processor is designed for seamless integration into consumer electronic products

requiring voice interfaces for Automatic Speech Recognition (ASR), communications or conferencing.

In addition to its class-leading voice processing, the XVF3510 Voice processor provides a

comprehensive set of interfaces and configuration options to simplify the integration of a voice

interface into a wide range of system architectures. This includes specific features required in TV and

set-top box applications, including audio switching and digital inputs and outputs that support

switches and LED indicators.

The XVF3510 Voice processor is initially configured with data stored from a flash memory device or

sent from a host processor. The Device Firmware Upgrade function of the processor allows in field

upgrade ensuring all products can benefit from the latest releases. While the Voice processor is

running, this configuration can be modified by the host system over the XVF3510 control interface. The

control interface also allows the host system to control peripheral devices and obtain status information

from the device and its digital inputs.

Two variants of the XVF3510 are available which have been optimised for different application use

cases. These two variants require different firmware to be loaded onto the device.

Table 1-1 XVF3510 variants

PRODUCT

KEY FEATURES

TARGET APPLICATION

XVF3510-INT

Far-field voice interface

Audio interface: I2S (Slave)

Control interface: I2C (Slave)

Device Firmware Upgrade: I2C (Slave)

Voice interface integrated into the product

XVF3510-UA

Far-field voice interface

Audio interfaces: USB UAC1.0 (and optionally

I2S Master)

Control interface: USB2.0 Full Speed

Device Firmware Upgrade: USB

USB plug-in voice accessory, and integrated

products using USB

XM-014232-PC 6

These application use cases are described in more detail in the following sections.

1.3. SYSTEM BLOCK DIAGRAMS

1.3.1. XVF3510-INT CONFIGURATION

The XVF3510-INT device has been optimized for integration on a system board. A standard I2C

interface is provided to enable the main processor on the system board to configure and monitor the

XVF3510-INT. The processed voice signal is output over an I2S bus to the host system and the

XVF3510 receives its I2S audio reference signal for the Acoustic Echo Cancellation function.

Figure 1-1 XVF3510-INT Integrated configuration

1.3.2. XVF3510-UA CONFIGURATION

The XVF3510-UA device replaces the I2C interface of the XVF3510-INT with a USB2.0 compliant PHY

which supports a UAC1.0 audio device for both reference signal input and processed audio output.

In addition, the USB device supports a standard USB Endpoint 0 for device control and a standard

USB HID for status events. An optional I2S master interface is also available on the device to output

an audio signal to an external audio device.

The following block diagram illustrates the typical configuration.

Figure 1-2 XVF3510-UA Configuration for USB-only use case

XM-014232-PC 7

In addition to the standard USB configuration shown above, the XVF3510-UA also supports an

alternative configuration in which the AEC reference signal is supplied over an I2S bus.

Figure 1-3 XVF3510-UA Configuration using I2S audio reference

1.4. DEVICE FIRMWARE AND CONFIGURATION

The operation of the XVF3510 device is controlled through a firmware image that is loaded onto the

device when it is powered up. Two modes of operation are supported:

} The firmware image can either be stored in a QSPI Flash device which is read by the XFV3510

processor automatically, or

} The firmware image is downloaded to the XVF3510 processor over the SPI interface by the host

processor on the system board.

Selection of the boot mode is made via setting the QSPI_D1/BOOTSEL pin on the device as described

in the datasheet.

The firmware image configures the XVF3510 into a standard, default operational mode. This mode can

be modified at startup via a set of configuration parameters that are stored in the flash device along

with the firmware in the XVF3510 Data Partition. These commands can be used to reconfigure the

device during startup, and also initialise other devices attached to it.

If the device firmware is downloaded from the host, then the data partition is not required and the

device is configured directly over the control interface.

XM-014232-PC 8

2. XVF3510 VOICE PROCESSOR ARCHITECTURE

2.1. OVERVIEW

The core of the XVF3510 Voice processor is a high-performance audio processing pipeline that takes

its input from a pair of the microphone and executes a series of signal processing algorithms to extract

a voice signal from a complex soundscape. The audio pipeline can accept a reference signal from a

host system which is used to perform Acoustic Echo Cancellation (AEC) to remove audio being played

by the host. The audio pipeline provides two different output channels - one that is optimized for

Automatic Speech Recognition systems and the other for voice communications.

Flexible audio signal routing infrastructure and a range of digital inputs and outputs enable the

XVF3510 to be integrated into a wide range of system configurations, that can be configured at start

up and during operation through a set of control registers.

In addition, the XVF3510-UA variant supports a standard USB PHY interface which supports a UAC

audio device and device control over USB. The following sections describe the voice pipeline and the

surrounding infrastructure in more detail.

2.2. AUDIO PROCESSING PIPELINE

The audio processing pipeline is common to both the XVF3510-UA and XVF3510-INT firmware

variants. The signal processing chain is described below, with individual blocks and usage described

in more detail in subsequent sections.

The XVF3510 audio processing pipeline takes inputs from a pair of MEMS Pulse Density Modulation

(PDM) microphones and uses advanced signal processing to create audio streams suitable for use in

Automatic Speech Recognition (ASR) and voice communication applications. The pipeline enhances

the captured audio stream using a set of complementary signal enhancement and noise reduction

processes.

Figure 2-1 The XVF3510 audio pipeline

The pipeline takes its input from a pair of low-cost PDM microphones and converts this signal to PCM

for further processing:

}

Acoustic Echo Cancellation (AEC):

Continuously modelling the room acoustics allows the AEC

to remove audio being played into the room by the product which the XVF3510 is a component

XM-014232-PC 9

of. A reference copy of the audio is provided to the AEC in order for it to accurately estimate the

echo.

}

Automatic Delay Estimation & Control (ADEC):

Automatically monitors and automatically

compensates for the delay between the reference audio and the echo received by the

microphone.

Following echo cancellation, the ASR and communications paths diverge to permit parameter tuning

appropriate for the individual audio output use cases.

}

Interference Cancellation (IC):

Suppresses static noise from point sources such as cooker

hoods, washing machines, or radios for which there is no reference audio signal available.

}

Voice Activity Detection (VAD):

Controls adaption the IC and AGC to optimise output for near-

end speech.

}

Noise Suppression (NS):

Suppresses diffuse noise from sources whose frequency

characteristics do not change rapidly over time (i.e., diffuse stationary noise).

}

Automatic Gain Control (AGC):

Controls the audio output level via separate AGC channels for

Automatic Speech Recognition (ASR) and communications output. The VAD is used to prevent

gain changes during speech to improve speech recognition performance.

The pipeline has been designed to minimise the need to tune and modify these functions. However, if

required for specific use cases, these later sections of this document provide details of the relevant

parameters and processes.

2.3. ASR AND COMMUNICATION PROCESSING

The audio pipeline discussed above produces two separate audio streams, one specifically tuned for

integration with keyword and ASR services and the other designed for conferencing and

communication applications. Both processed audio streams are available to be output at the same

using the left and right channels of USB and I2S. The default configuration is as follows:

Table 2-1 Default channel mapping (both USB and I2S)

CHANNEL

DEFAULT

[0] - Left

Automatic Speech Recognition (ASR) optimised

[1] - Right

Communications

In situations where an ASR is used to invoke a call it may be necessary to continually monitor the ASR

channel for a ‘end call’ intent. The parallel output of both ASR and Communications processed streams

allow the combination of high-quality calling audio with the tuned ASR capability.

The IO_MAP configuration parameter (see

Signal flow and processing

section) allows users to also

configure both channels to be ASR or Communications if required.

2.4. XVF3510-INT - FOR INTEGRATED VOICE INTERFACE APPLICATIONS

The XVF3510-INT product embeds the core audio processing pipeline in an audio infrastructure that

supports rate conversion, filtering and signal routing. This infrastructure is controllable by the host

system via a set of control registers. In addition, the XVF3510-INT provides a set of peripheral

interfaces to the host system to other devices, eg digital inputs, LEDs, SPI peripherals etc.

The peripheral interfaces supported include an interface to an optional QSPI Flash device containing

the XVF3510 firmware and configuration information that is loaded by the processor on start-up.

XM-014232-PC 10

The system architecture of the XVF3510-INT is shown below.

Figure 2-2 XVF3510-INT System architecture

2.5. XVF3510-UA - FOR USB ACCESSORY VOICE INTERFACE APPLICATIONS

The XVF3510-UA variant includes the same audio infrastructure as the XFV3510-INT, but it includes a

USB interface that implements a UAC1.0 audio device to interface to the host system. The USB

interface also supports an Endpoint 0 control channel and a USB HID to signal input events to the

host.

The system architecture of the XVF3510-UA is shown below.

Figure 2-3 XVF3510-UA System architecture

NOTE: The XVF3510-UA product also supports a hybrid mode of operation where the reference signal

is delivered via I2S rather than USB. This mode is selected via modification of the configuration data

stored in the flash device.

This manual suits for next models

Table of contents

Other XMOS Computer Hardware manuals

XMOS

XMOS XK-XMP-64 User manual

XMOS

XMOS MultiUART User manual

XMOS

XMOS xTAG v3.0 User manual

XMOS

XMOS xCORE-200 Multi-channel Audio board Guide

XMOS

XMOS XS1-L2 User manual

XMOS

XMOS SliceKit GPIO User guide

XMOS

XMOS XTAG-2 User manual

XMOS

XMOS XC-1 User manual

XMOS

XMOS XVF3800 User manual

XMOS

XMOS XC-1A User manual

XMOS

XMOS XS1-L1 User manual

XMOS

XMOS xCORE-Analog sliceKIT User manual

XMOS

XMOS XK-XMP-64 User manual

XMOS

XMOS SliceKit User manual

Popular Computer Hardware manuals by other brands

Toshiba

Toshiba TOSVERT VF-MB1/S15 IPE002Z Function manual

Shenzhen

Shenzhen MEITRACK MVT380 user guide

TRENDnet

TRENDnet TEW-601PC - SUPER G MIMO WRLS PC CARD user guide

StarTech.com

StarTech.com CF2IDE18 instruction manual

Texas Instruments

Texas Instruments LMH0318 Programmer's guide

Gateway

Gateway 8510946 user guide

Devon IT

Devon IT TC2D Quick setup guide

Krüger & Matz

Krüger & Matz Air Shair2 owner's manual

Crystalio

Crystalio VPS-2300 quick guide

MYiR

MYiR FZ3 user manual

Protech Systems

Protech Systems BC-K200 Quick reference guide

Miranda

Miranda DENSITE series DAP-1781 Guide to installation and operation

Sierra Wireless

Sierra Wireless Sierra Wireless AirCard 890 quick start guide

Leadtek

Leadtek Killer Xeno Pro Quick installation guide

Star Cooperation

Star Cooperation FlexTiny 3 Series Instructions for use

Hotone

Hotone Ampero user manual

Connect Tech

Connect Tech Xtreme/104-Express user manual

Yealink

Yealink WF50 user guide

XMOS VocalFusion XVF3510 User manual

Popular Computer Hardware manuals by other brands