TY - GEN
T1 - A Machine Learning and SUMO-Based Framework for CO2 Emission Prediction in Urban Areas with Web Application Deployment
AU - Casa-Vaca, David
AU - Lemus-Cárdenas, Leticia
AU - Sánchez-Balseca, Joseph
AU - Astudillo-León, Juan Pablo
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
PY - 2026
Y1 - 2026
N2 - The accelerated growth of vehicle fleets in Latin American cities, coupled with high altitudes and heavy traffic congestion, has substantially increased the environmental impact of carbon dioxide (CO2) emissions. This work presents a practical methodology to predict CO2 emissions in urban areas, avoiding the need for computationally expensive traffic simulations. To achieve this, a dataset was generated by performing extensive microscopic traffic simulations with SUMO, using OpenStreetMap data to extract the urban road network and configuring vehicle flows based on official registration statistics from Ecuador. Multiple scenarios with varying vehicle densities and fleet compositions were simulated to build a diverse dataset. Three machine learning models, Linear Regression, Random Forest, and Neural Networks, were trained on this data set to predict CO2 emissions as a function of input traffic parameters. The Random Forest model outperformed the others, achieving R2=0.9875 and MAPE = 3.61%. This trained model was then deployed in a web application using Streamlit, allowing users to estimate emissions in real time by inputting simple traffic parameters, thereby eliminating the need for running new extensive SUMO simulations for each scenario. This framework offers an efficient decision support tool for urban planning and environmental assessment in high-altitude, traffic-congested cities like Quito.
AB - The accelerated growth of vehicle fleets in Latin American cities, coupled with high altitudes and heavy traffic congestion, has substantially increased the environmental impact of carbon dioxide (CO2) emissions. This work presents a practical methodology to predict CO2 emissions in urban areas, avoiding the need for computationally expensive traffic simulations. To achieve this, a dataset was generated by performing extensive microscopic traffic simulations with SUMO, using OpenStreetMap data to extract the urban road network and configuring vehicle flows based on official registration statistics from Ecuador. Multiple scenarios with varying vehicle densities and fleet compositions were simulated to build a diverse dataset. Three machine learning models, Linear Regression, Random Forest, and Neural Networks, were trained on this data set to predict CO2 emissions as a function of input traffic parameters. The Random Forest model outperformed the others, achieving R2=0.9875 and MAPE = 3.61%. This trained model was then deployed in a web application using Streamlit, allowing users to estimate emissions in real time by inputting simple traffic parameters, thereby eliminating the need for running new extensive SUMO simulations for each scenario. This framework offers an efficient decision support tool for urban planning and environmental assessment in high-altitude, traffic-congested cities like Quito.
KW - CO emissions
KW - Machine learning
KW - OpenStreetMap
KW - SUMO
KW - Web application
UR - https://www.scopus.com/pages/publications/105020769907
U2 - 10.1007/978-3-032-08366-1_17
DO - 10.1007/978-3-032-08366-1_17
M3 - Contribución a la conferencia
AN - SCOPUS:105020769907
SN - 9783032083654
T3 - Communications in Computer and Information Science
SP - 253
EP - 268
BT - Information and Communication Technologies - 13th Ecuadorian Conference, TICEC 2025, Proceedings
A2 - Berrezueta, Santiago
A2 - Gualotuña, Tatiana
A2 - Fonseca C., Efrain R.
A2 - Rodriguez Morales, Germania
A2 - Maldonado-Mahauad, Jorge
PB - Springer Science and Business Media Deutschland GmbH
T2 - 13th Ecuadorian Conference on Information and Communication Technologies, TICEC 2025
Y2 - 16 October 2025 through 17 October 2025
ER -