Document Type
Article
Publication Date
2024
DOI
10.1371/journal.pone.0305708
Publication Title
PLoS One
Volume
19
Issue
8
Pages
e0305708
Abstract
The classification of bird species is of significant importance in the field of ornithology, as it plays an important role in assessing and monitoring environmental dynamics, including habitat modifications, migratory behaviors, levels of pollution, and disease occurrences. Traditional methods of bird classification, such as visual identification, were time-intensive and required a high level of expertise. However, audio-based bird species classification is a promising approach that can be used to automate bird species identification. This study aims to establish an audio-based bird species classification system for 264 Eastern African bird species employing modified deep transfer learning. In particular, the pre-trained EfficientNet technique was utilized for the investigation. The study adapts the fine-tune model to learn the pertinent patterns from mel spectrogram images specific to this bird species classification task. The fine-tuned EfficientNet model combined with a type of Recurrent Neural Networks (RNNs) namely Gated Recurrent Unit (GRU) and Long short-term memory (LSTM). RNNs are employed to capture the temporal dependencies in audio signals, thereby enhancing bird species classification accuracy. The dataset utilized in this work contains nearly 17,000 bird sound recordings across a diverse range of species. The experiment was conducted with several combinations of EfficientNet and RNNs, and EfficientNet-B7 with GRU surpasses other experimental models with an accuracy of 84.03% and a macro-average precision score of 0.8342.
Rights
© 2024 Baowaly et al.
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability
Article states: "The datasets utilized in this article were obtained from "BirdCLEF 2023: Identify bird calls in soundscapes" webpage, which is freely accessible for all scientists and investigators to conduct experiments and can be accessed through the website: https://www.kaggle.com/competitions/birdclef-2023/data.
Original Publication Citation
Baowaly, M. K., Sarkar, B. C., Walid, M. A. A., Ahamad, M. M., Singh, B. C., Alvarado, E. S., Ashraf, I., & Samad, M. A. (2024). Deep transfer learning-based bird species classification using mel spectrogram images. PLoS One, 19(8), 1-16, Article e0305708. https://doi.org/10.1371/journal.pone.0305708
Repository Citation
Baowaly, M. K., Sarkar, B. C., Walid, M. A. A., Ahamad, M. M., Singh, B. C., Alvarado, E. S., Ashraf, I., & Samad, M. A. (2024). Deep transfer learning-based bird species classification using mel spectrogram images. PLoS One, 19(8), 1-16, Article e0305708. https://doi.org/10.1371/journal.pone.0305708
ORCID
0000-0002-2870-8137 (Singh)
Included in
Artificial Intelligence and Robotics Commons, Poultry or Avian Science Commons, Zoology Commons