Automated Classification of Avian Vocal Activity Using Acoustic Indices in Regional and Heterogeneous Datasets

Authors
Daniel Yip
Lisa Mahon
Alexander MacPhail
Erin Bayne
Resource Date:
2021

Acoustic indices combined with clustering and classification approaches have been increasingly used to automate identification of the presence of vocalizing taxa or acoustic events of interest. While most studies using this approach standardize data collection and study design parameters at the project or study level, recent trends in ecological research are to investigate patterns at regional or continental scales. Large-scale studies often require collaboration between research groups and integration of data from multiple sources to fulfil objectives, which can lead to variation in recording equipment and data collection protocols.


Our objectives were to determine how analytical approaches and variation in data collection and processing that is typical of regional acoustic monitoring programmes influences accuracy when identifying vocal activity in breeding birds. We used data from three regional datasets in Northern Alberta, Northern British Columbia, and Southern and Central Yukon, Canada to investigate the effect of analytical framework, sample size, local species richness and data collection variables on classification accuracy.


We found supervised classification approaches to be the most effective, with boosted regression trees identifying vocalizations of breeding birds in audio recordings with a 92.0% accuracy and easily able to accommodate variation in data collection and processing parameters. We also provide recommendations on effectively processing large and heterogeneous datasets including sufficient sample size, accommodating potentially confounding variables and selecting suitable model training data.


The results presented in this study can help inform decisions in data collection, data processing, and study design and analysis, maximize performance and accuracy during analysis, and efficiently process large, heterogeneous acoustic datasets to answer questions at scales previously difficult to investigate.