REAL

Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the collared flycatcher (Ficedula albicollis)

Zsebők, Sándor and Nagy-Egri, Máté Ferenc and Barnaföldi, Gergely Gábor and Laczi, Miklós and Nagy, Gergely and Vaskuti, Éva and Garamszegi, László Zsolt (2019) Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the collared flycatcher (Ficedula albicollis). ORNIS HUNGARICA, 27 (2). pp. 59-66. ISSN 1215-1610

[img]
Preview
Text
Automatic-bird-song-and-syllable-segmentation-with-an-opensource-deeplearning-object-detection-method-a-case-study-in-the-Collared-Flycatcher-Ficedula-albicollis.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview

Abstract

The bioacoustic analyses of animal sounds result in an enormous amount of digitized acoustic data, and we need effective automatic processing to extract the information content of the recordings. Our research focuses on the song of Collared Flycatcher (Ficedula albicollis) and we are interested in the evolution of acoustic signals. During the last 20 years, we obtained hundreds of hours of recordings of bird songs collected in natural environment, and there is a permanent need for the automatic process of recordings. In this study, we chose an open-source, deep-learning image detection system to (1) find the species-specific songs of the Collared Flycatcher on the recordings and (2) to detect the small, discrete elements so-called syllables within the song. For these tasks, we first transformed the acoustic data into spectrogram images, then we trained two deep-learning models separately on our manually segmented database. The resulted models detect the songs with an intersection of union higher than 0.8 and the syllables higher than 0.7. This technique anticipates an order of magnitude less human effort in the acoustic processing than the manual method used before. Thanks to the new technique, we are able to address new biological questions that need large amount of acoustic data.

Item Type: Article
Additional Information: Behavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, Pázmány Péter sétány 1/C, Budapest, 1117, Hungary Wigner Research Centre for Physics, Konkoly-Thege Miklós út 29-33, Budapest, 1121, Hungary The Barn Owl Foundation, Temesvári út 8., Orosztony, 8744, Hungary MTA-ELTE, Theoretical Biology and Evolutionary Ecology Research Group, Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány 1/C, Budapest, 1117, Hungary Evolutionary Ecology Group, Centre for Ecological Research, Institute of Ecology and Botany, Alkotmány utca 2-4, Vácrátót, 2163, Hungary Cited By :3 Export Date: 21 April 2022 Correspondence Address: Zsebők, S.; Behavioural Ecology Group, Pázmány Péter sétány 1/C, Hungary; email: zsebok.s@gmail.com
Uncontrolled Keywords: bird song, deep-learning, object detection, Collared Flycatcher, automatic segmentation
Subjects: Q Science / természettudomány > QL Zoology / állattan
Z Bibliography. Library Science. Information Resources / könyvtártudomány > ZA Information resources / információforrások > ZA4450 Databases / adatbázisok
SWORD Depositor: MTMT SWORD
Depositing User: MTMT SWORD
Date Deposited: 04 Oct 2024 06:16
Last Modified: 04 Oct 2024 06:16
URI: https://real.mtak.hu/id/eprint/206849

Actions (login required)

Edit Item Edit Item