First problem is the Arabic speech recognition engine.
Please see:
http://international.sakhr.com/arabic-speech-recognition-and-arabic-TTS.html[
^],
http://arabic1.software.informer.com/download-arabic-speech-recognition/[
^],
http://softwaretopic.informer.com/dragon-arabic-speech-recognition/[
^].
Try to find some more:
http://lmgtfy.com/?q=speech+recognition+arabic[
^].
Now, let's come to the main thing: do you really want to match audio to audio?
Forget it. Urgent? It will probably take you the lifetime. I never heard that such problem was resolved anywhere even remotely. Apparently you have no idea of the complexity of the task, but I do. Who knows the criteria for two audio clips to be close? And the criteria is the most complex. Do you want to recognize the personal voice characteristics or just the phrase? Oh, why am I even ask you.
Just forget it.
However, I can offer you the reasonable work around. When you're done with the speech recognition using the engine, do the following: in your database, store the text. Don't even play with the idea of storing audio and audio matching. Store the text, perform speech recognition with the engine and match recognized text. This is the most you will be able to do; and this is quite feasible.
—SA