A COLLABORATIVE SYSTEM FOR GEOREFERENCED MONITORING OF NOISE POLLUTION DATA CLASSIFIED BY MACHINE LEARNING MODEL
DOI:
https://doi.org/10.56238/revgeov17n2-138Keywords:
Noise Pollution, Urban Sound Classification, Machine Learning, Collaborative Systems, GeoreferencingAbstract
This paper proposes a collaborative system for the georeferenced monitoring of noise pollution in urban environments, utilizing a machine learning model for automatic sound classification. The system adopts a modular architecture composed of three main applications: a Client Application (collection), a Server Application (processing), and a Client Application (visualization), which allows for scalability and ease of maintenance. In the collection Client Application, data capture is performed via a chatbot integrated with the Telegram application, enabling users to submit audio recordings along with their locations. These data are processed by a Server Application, which extracts acoustic features, classifies the sounds using the UrbanSound8K_ECAPA model, and stores the results in a NoSQL database. The visualization Client Application consists of a web page that requests data from the Server Application and presents the captured data interactively, allowing for the exploration of classified sounds and their corresponding geographical locations. A comparative analysis of audio classification models was conducted to support the choice of the most suitable model. Preliminary results demonstrate the viability of the proposed solution, which proves to be a promising, low-cost, accessible, and scalable tool for mapping and analyzing urban noise pollution, with potential applications in public policy and smart city initiatives.
Downloads
References
AHMED, A. A.; PRADHAN, B.; CHAKRABORTY, S. et al. Developing vehicular traffic noise prediction model through ensemble machine learning algorithms with GIS. Arabian Journal of Geosciences, v. 14, n. 16, p. 1–15, 2021. DOI: 10.1007/s12517-021-08114-y.
ARINAITWE, Chris; MUGISHA, Samuel; OPIO, Daniel. Review of Techniques used in Speech Signal Processing. Journal of Signal Processing Systems, v. 3, n. 1, p. 63–70, 2024. DOI: 10.59568/KJSET-2024-3-1-07.
BELTRAN, C. A. R. et al. Plataforma de aprendizado de máquina para detecção e monitoramento de alunos com risco de evasão. Anais do CBIE/SBIE, v. 30, n. 1, p. 1591, 2019. DOI: 10.5753/cbie.sbie.2019.1591.
BEZERRA, E. A. A. C. et al. Instrumentos construídos/validados ao processo de enfermagem em saúde mental: revisão integrativa. Revista Eletrônica Acervo Saúde, v. 25, p. e19772, 2025. DOI: 10.25248/reas.e19772.2025.
BEZERRA, E. A. A. C.; FERREIRA, A. A.; PORTELA, L. C. Diagnóstico de enfermagem em saúde mental infantojuvenil. Miscellaneous, v. 24, n. 1, p. e13700, 2024. DOI: 10.25248/reas.e13700.2024.
BOUMCHICH, A. A Smartphone-Based Crowd-Sourced Database for Environmental Noise Assessment: from data quality assessment to the production of relevant noise maps. 2023. 122 f. Tese (Doutorado em Física) – Le Mans Université. Disponível em: https://theses.hal.science/tel-04444620/
CAN, A.; et al. Framework for urban sound assessment at the city scale based on citizen action, with the smartphone application NoiseCapture as a lever for participation. Noise Mapping, v. 10, n. 1, p. 20220166, 2023. DOI: 10.1515/noise-2022-0166.
CERIOTTI, V. C. Classificação de sons ambientais utilizando redes neurais convolucionais para aplicações em hardwares com recursos limitados. 2023. Trabalho de Conclusão de Curso (Graduação) – Universidade Federal do Rio Grande do Sul. Disponível em: http://hdl.handle.net/10183/267582.
CHEN, Kai; KONG, Qiuqiang; IQBAL, Turab; WANG, Yuxuan; WANG, Wenwu; PLUMBY, Mark D. HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022. IEEE, p. 646–650. DOI: 10.1109/ICASSP43922.2022.9746667.
CHEN, Shuai; PANDA, Asish; HSU, Wei-Ning; MANOCHA, Pranay; SYNNAEVE, Gabriel; MOHAMED Abdelrahman. BEATs: Audio Pre-Training with Acoustic Tokenizers. arXiv preprint, arXiv:2212.09058, 2022. Disponível em: https://arxiv.org/abs/2212.09058.
DUBEY, Rakesh; BHARDWAJ, Shruti; SHARMA, Vinamra B.; BHATT, Arpit; BISWAS, Shusham. Smartphone-based traffic noise mapping system. ISPRS Archives, XLIII-B4-2022, p. 613–620, 2022. DOI: 10.5194/isprs-archives-XLIII-B4-2022-613-2022.
GONG, Yuan; CHUNG, Yu-An; GLASS, James. AST: Audio Spectrogram Transformer. arXiv preprint, arXiv:2104.01778, v. 3, p. 1–5, 2021. Disponível em: https://arxiv.org/abs/2104.01778.
KONG, Qiuqiang; CAO, Yin; IQBAL, Turab; WANG, Yuxuan; WANG, Wenwu; PLUMBLEY, Mark D. PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, v. 28, p. 2880–2894, 2020. DOI: 10.1109/TASLP.2020.3030497.
KOUTINI, Khaled; SCHLÜTER, Jan; EGHBAL-ZADEH, Hamid; WIDMER, Gerhard. Efficient Training of Audio Transformers with Patchout. In: Interspeech 2022. ISCA, 2022. DOI: 10.21437/Interspeech.2022-227.
LARANJA, Andrea C.; SAITER, Júlia C. Análise de poluição sonora urbana em Vitória–ES a partir do uso de smartphones. ENCAC, v. 17, p. 1–9, 2023. DOI: 10.46421/encac.v17i1.3885.
LIU, Shuai; YUAN, Yi; LIU, Xiulong; LIU, Yan. Audio self-supervised learning: A survey. Patterns, v. 3, n. 12, p. 1–18, 2022. DOI: 10.1016/j.patter.2022.100616.
MACEDO, Alessandra; PERSONA, Leandro; MELONI, Fernando. Recognition of Emotions through Facial Geometry with Normalized Landmarks. In: Proceedings of the 30th Brazilian Symposium on Multimedia and the Web, Juiz de Fora/MG. Porto Alegre: SBC, 2024. p. 257–266. DOI: 10.5753/webmedia.2024.243252.
MEHRISH, Amit; RAJ, Rahul; KUMAR, Sushil; ROY, Partha P. A review of deep learning techniques for speech processing. Information Fusion, v. 99, p. 1–72, 2023. Disponível em: https://arxiv.org/abs/2305.00359.
NERY, Sannyer C. C.; BATISTA, Fani T. S.; BEZERRA, Sergio A. C. ColabSomGeo: Ferramenta de monitoramento de poluição sonora subsidiada por modelo de aprendizagem. In: SIMPÓSIO BRASILEIRO DE SISTEMAS MULTIMÍDIA E WEB, 31º, 2025, Rio de Janeiro. Anais. Local: Sociedade Brasileira de Computação, 2025.
PURWINS, Hendrik et al. Deep Learning for Audio Signal Processing. IEEE Journal of Selected Topics in Signal Processing, v. 13, n. 2, p. 206–219, 2019. DOI: 10.1109/JSTSP.2019.2908700.
RIBEIRO, Murilo B.; BRAGHETTO, Kelly R. A Scalable Data Integration Architecture for Smart Cities: Implementation and Evaluation. Journal of Information and Data Management, v. 13, n. 2, p. 207–223, 2022. DOI: 10.5753/dm.2022.2485.
SANTOS, Alessandro; FREITAS, Leandro de; TEIXEIRA, Igor; VAGNER, Gava; TAIRA, Gustavo; QUILE, Rosa E.; BRAGHETO, Kelly. Desafios e oportunidades da aplicação de Sistemas Ciberfísicos no monitoramento da poluição urbana. 2020. p. 276–289. DOI: 10.5753/courb.2020.12369.
SANTOS, Carlos J.; SOUZA, Luiz E.; PAIVA, Benedito G. M. Análise da influência de diferentes codificadores de voz (CODECs) em redes convergentes. Revista Científica e-Locução, v. 1, n. 17, p. 26–26, 2020. DOI: 10.57209/e-locucao.v1i17.249.
SANTOS, Luís Conde; LEÃO, Cristina; FIGUEIREDO, João; MELO, Afonso; COVAS, Guilherme; CARVALHO, André. PROJETO MIRA – Desenvolvimento de uma Plataforma Inteligente de Monitorização de Ruído Ambiental. 2023. Disponível em: https://www.dbwave.pt/uploads/media_items/mira-article-tecniacustica-20230915-lcs.original.pdf.
SARKER, Iqbal H.; KAYES, A. S. M.; WATTERS, Paul. Effectiveness analysis of machine learning classification models for predicting personalized context-aware smartphone usage. Journal of Big Data, v. 6, n. 57, p. 2–28, 2019. DOI: 10.1186/s40537-019-0219-y.
SILVA, Igor S. C.; CUNHA, Felipe. Monitoramento da poluição sonora em Belo Horizonte utilizando redes de sensoriamento participativo. 2022. Disponível em: https://bib.pucminas.br/pergamumweb/vinculos/00000b/00000bc6.pdf.
SOFIANOPOULOS, Sofianos et al. Citizens as Environmental Sensors: Noise Mapping and Assessment on Lemnos Island, Greece, Using VGI and Web Technologies. European Journal of Geography, v. 15, n. 2, p. 106–119, 2024. Disponível em: https://www.eurogeojournal.eu/index.php/egj/article/view/476.
STRAUSS, Edilberto; VILLAS BÔAS JÚNIOR, Manoel; FERREIRA, Wagner L. L. A importância de utilizar métricas adequadas de avaliação de performance em modelos preditivos de machine learning. Projectus, v. 7, n. 2, p. 52–62, 2022. DOI: 10.15202/25254146.2022v7n2p52.
TRICCO, A. C. et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation. Annals of Internal Medicine, v. 169, n. 7, p. 467-473, 2018. Disponível em: https://doi.org/10.7326/M18-0850. Acesso em: 1 nov. 2025.
ZAMAN, Khalid et al. A survey of audio classification using deep learning. IEEE Access, v. 11, p. 106620–106649, 2023. DOI: 10.1109/ACCESS.2023.3318015.
ZHAO, Baishan; LIANG, Jiwen. Hierarchical-Concatenate Fusion TDNN for sound event classification. PLOS ONE, v. 19, n. 10, p. 1–17, 2024. DOI: 10.1371/journal.pone.0312998.