Wanson asr demo
1 Functional Overview
Based on the local speech recognition library provided by the third-party company
Huazhen
, the local offline voice wake-up word and command word recognition functions are realized.
2 Code Path
demo path:
\components\demos\media\audio\audio_asr
Wanson local speech recognition library (floating point library) path:
\components\bk_thirdparty\asr\wanson
For detailed description of Wanson’s local speech recognition API interface, please refer to the source file:
\components\bk_thirdparty\asr\wanson\include\asr.h
3 Cli command introduction
The commands supported by the demo are as follows:
Command |
Description |
---|---|
asr_file_test {xxx.pcm} |
Identify recordings |
aud_intf_asr_test {start|stop} |
real-time recognition |
DEMO Compilation instruction: make bk7256 PROJECT=thirdparty/wanson_asr
4. Demonstration introduction
The steps performed to identify recordings are as follows:
Mount TF card
Uart sends AT command
fatfstest M 1
to mount TF card
Identify audio files
The demo board sends the command
asr_file_test xxx.pcm
to recognize the wake-up word and command word in the audio file
Note
The format of the audio file required to identify the recording file is:
mono, 16K sampling rate, 16 bit width
Recognition of recording files does not require CPU computing power, which is convenient for verifying that the function of the speech recognition library is normal
The steps performed by real-time recognition are as follows:
Turn on real-time speech recognition
The demo board sends the command
aud_intf_asr_test start
to open real-time speech recognition
Test the wake-up words and command words, the supported wake-up words and command words are as follows:
Little Bee Steward
recognizes successfully and prints the logxiao feng guan jia
on the serial port
armino
recognizes successfully and prints the loga er mi nuo
on the serial port
Visitor mode
recognize successful serial port print loghui ke mo shi
dining mode
recognizes successfully and prints the logyong can mo shi
on the serial port
Leave mode
recognize successful serial port print logli kai mo shi
home mode
identification success serial port print loghui jia mo shi
Turn off real-time speech recognition
5. Reference link
6. Shanghai Huazhen Electronic Technology Co., Ltd.
Official website: http://www.wanson.cn/Headquarters Address: Room 307-308, Huigaoguang Innovation Park, No. 789 Shenwang Road, Minhang District, Shanghai | Shenzhen Office Address: Room 2215-16, East Block, Building 1A, Huiyi City One Center, Xixiang, Baoan District, ShenzhenTel: 021-61557858Mobile: 1352485917613296017858