An investigation and comparison of speech recognition software for determining if bird song recordings contain legible human voices

Hunt, Tim D. and Ryan, Grant and Ryan-Pears, Cameron (2017) An investigation and comparison of speech recognition software for determining if bird song recordings contain legible human voices. Journal of Applied Computing and Information Technology, 2017 (21(1)). ISSN 2230-4398

[img]
Preview
PDF
330Kb

Official URL: http://www.citrenz.ac.nz/jacit/index.html

Abstract or Summary

The purpose of this work was to test the effectiveness of using readily available speech recognition API services to determine if recordings of bird song had inadvertently recorded human voices. A mobile phone was used to record a human speaking at increasing distances from the phone in an outside setting with bird song occurring in the background. One of the services was trained with sample recordings nd each service was compared for their ability to return recognized words. The services from Google and IBM performed similarly and the Microsoft service, that allowed training, performed slightly better. However, all three services failed to perform at a level that would enable recordings with recognizable human speech to be deleted in order to maintain full privacy protection.

Item Type:Journal article
Subjects:Q Science > QA Mathematics > QA76 Computer software
Divisions:Schools > Centre for Business, Information Technology and Enterprise > School of Information Technology
ID Code:5391
Deposited By:
Deposited On:28 Jul 2017 01:38
Last Modified:31 Jul 2017 00:34

Repository Staff Only: item control page