Abstract:
In this paper, we describe a spoken term detection (STD) method for a spoken document information retrieval. In general, STD method detects a query term from the spoken documents which are translated from acoustic signal data to text data by the automatic speech recognition system. Because the automatic speech recognition systems are able to output some types of recognition results, we are available for various types of the translated text data for STD. In this paper, we focus on the syllable-based transcriptions and the word-based transcriptions. Because of detecting the query term from a large size of these transcriptions, a rapid STD method is required. Therefore, we have proposed the rapid STD method using a bit-sequence representation and the suffix array. Our method, first, extracts the sub-sequences from the syllable-based transcriptions, and then converts them into the bit-sequence using a hash function. The STD candidates are retrieved using these bit-sequences. Finally, the distance between the query term and these candidates represented as bit-sequences is calculated by using Dynamic Programing (DP) matching. At the same time, our method searches the query term from the word-based transcription using a suffix array method. Then, our method detects the query term by combining these results. In the workshop of NTCIR10, our method has achieved the best performance in STD task. In this workshop, we have submitted the results under the limit conditions of our method. Hence, in this paper, we conduct STD experiments using NTCIR10 SpokenDoc2 Task under the other conditions and evaluate our method. In this experiment, we investigate the STD performances as a function of the number of the candidates of speech recognition and type of candidates of speech recognition. Experimental results show that our method significantly improve the STD method using each transcription. Therefore, we conclude that our method is useful for the STD.