In:
The Journal of the Acoustical Society of America, Acoustical Society of America (ASA), Vol. 120, No. 5_Supplement ( 2006-11-01), p. 3378-3379
Abstract:
In this paper, we introduce a spoken language interface for music information retrieval. In response to voice commands, the system searches for a song through an internet music shop or a ‘‘playlist’’ stored in the local PC; the system then plays it. To cope with the almost unlimited size of the vocabulary, a remote server program with which a user can customize their recognition grammar and dictionary is implemented. When a user selects favorite artists, the server program automatically generates a minimal set of recognition grammars and a dictionary. The system then sends them to the interface program. Therefore, on average, the vocabulary is less than 1000 words for each user. To perform a field test of the system, we implemented a speech collection capability, whereby speech utterances are compressed in free lossless audio codec (FLAC) format and are sent back to the server program with dialogue logs. Currently, the system is available to the public for experimental use. More than 100 users are involved in field testing. In our presentation, we will report details of the system and the results of field tests, which include motorcycle environments. a)Currently at Graduate School of Computer and Information Sciences, Hosei University, Tokyo 184-8584, Japan.
Type of Medium:
Online Resource
ISSN:
0001-4966
,
1520-8524
Language:
English
Publisher:
Acoustical Society of America (ASA)
Publication Date:
2006
detail.hit.zdb_id:
1461063-2
Permalink