Hi, I am working on the Voice related app. I want to detect the presence of Voice (speech) activity. I have tried open source projects like web-RTC, pockect spinx, Speex library and others, but the results are not satisfactory. Some will give better results in high SNR but fails to detect the same in low SNR. Is there any frame work form hexagon DSP form which we can check the captured/recorded audio (in Android MSM8974) is speech/voice or noise.?
Voice(speech) Activity detection
Posted: Mon, 2014-10-13 06:05
Hi,
There is no such framework for VAD for hexagon SDK users.
Thanks,
Haseeb
Did you find information on how to specify a sound model?
Some Android devices support this feature, for example Google Pixel reacts to keyphrase "Ok Google". In the source code of the android, I found the code responsible for loading the keyphrase into the DSP processor (The Hexagon DSP processor is built into the Qualcomm processor):
https://android.googlesource.com/platform/hardware/libhardware/+/master/modules/soundtrigger/sound_trigger_hw.chttps://android.googlesource.com/platform/system/media/+/master/audio/include/system/sound_trigger.h
The sound model description structure sound_trigger_sound_model pass in the method stdev_load_sound_model. Sound model structure:
Does anyone know how to generate a binary data of the sound model or where to find information about it?
You can download the sound model for the keyphrase ''Ok Google" by link:https://drive.google.com/open?id=0B9jcQJRmjR0yaDJhOXN2M2ZLYm8. I loaded it into the DSP processor and it works.
Helpfull Android classes:https://android.googlesource.com/platform/frameworks/base/+/master/services/voiceinteraction/java/com/android/server/soundtrigger/SoundTriggerHelper.javahttps://android.googlesource.com/platform/frameworks/base/+/master/core/java/android/hardware/soundtrigger/SoundTriggerModule.javahttps://android.googlesource.com/platform/frameworks/base/+/master/core/jni/android_hardware_SoundTrigger.cpphttps://android.googlesource.com/platform/frameworks/av/+/master/soundtrigger/SoundTrigger.cpphttps://android.googlesource.com/platform/frameworks/av/+/master/soundtrigger/ISoundTrigger.cpp