Trajectory analysis using data mining techniques

Authors

  • Gary Reyes Universidad de Guayaquil

DOI:

https://doi.org/10.24215/16666038.25.e06

Keywords:

Clustering algorithm, Congestion detection, GPS trajectory, Traffic flow, Trajectory clusterin

Abstract

This study presents an innovative method for identifying variability in vehicular flow, designed for dynamic
urban environments with fluctuating traffic conditions. The proposed approach integrates real-time data flow
processing with a two-level clustering strategy to detect and analyze vehicular density patterns. The first level
performs dynamic clustering of GPS locations, forming microclusters that represent spatially homogeneous
traffic zones. Each microcluster is continuously updated based on similarity criteria and a forgetting mechanism
that ensures data relevance. Periodic snapshots capture the temporal evolution of the traffic distribution, which
serves as input for the second level of clustering. The second level aggregates microclusters based on proximity,
taking advantage of historical density data to classify traffic variability. By comparing current and baseline
densities, the method identifies congestion-prone areas and dynamically adjusts cluster formations. This twolevel
approach improves traffic management and provides a robust framework for detecting congestion trends.
Through validation in three urban case studies, San Francisco, Rome and Guayaquil, the methodology
successfully captured the spatial and temporal variability of traffic, identifying congestion hotspots and
uncovering patterns of flow evolution over time.

Downloads

Download data is not yet available.

References

S. Y. Soumia Goumiri and S. Djahel, “Smart Mobility

in Smart Cities: Emerging challenges, recent advances

and future directions,” Journal of Intelligent Transportation

Systems, vol. 0, no. 0, pp. 1–37, 2023.

R. Zhou, H. Chen, H. Chen, E. Liu, and S. Jiang,

“Research on traffic situation analysis for urban road

network through spatiotemporal data mining: A case

study of xi’an, china,” IEEE Access, vol. 9, pp. 75 553–

567, 2021.

G. R. Zambrano and R. N. H. Veliz, “Aplicaciones

de algoritmos de trayectorias GPS en gadgets/[GPS

trajectories algorithms applications in gadgets],” International

Journal of Innovation and Applied Studies,

vol. 16, no. 3, p. 549, 2016.

G. Reyes, L. Lanzarini, W. Hasperu´e, and A. F. Bariviera,

“GPS trajectory clustering method for decision

making on intelligent transportation systems,” Journal

of Intelligent & Fuzzy Systems, vol. 38, no. 5, pp.

–5535, 5 2020.

W. Hasperu´e, C. Estrebou, G. Camele, P. L´opez, P. J.

Santana, G. R. Zambrano, L. Lanzarini, and A. F. Bariviera,

“Procesamiento inteligente de grandes vol´umenes

de informaci´on y de flujos de datos,” 2021.

G. Reyes, V. Estrada, R. Tolozano-Benites, and

V. Maquil´on, “Batch Simplification Algorithm for Trajectories

over Road Networks,” ISPRS International

Journal of Geo-Information, vol. 12, no. 10, p. 399, 9

G. Reyes, R. Tolozano-Benites, L. Lanzarini, C. Estrebou,

A. F. Bariviera, and J. Barzola-Monteses,

“Methodology for the Identification of Vehicle Congestion

Based on Dynamic Clustering,” Sustainability,

vol. 15, no. 24, p. 16575, 12 2023.

J. D. Mazimpaka and S. Timpf, “Trajectory data mining:

A review of methods and applications,” Journal

of Spatial Information Science, no. 13, pp. 61–99, 12

Jimmy Sornoza Moreira, Christopher Crespo Le´on,

Gary Reyes Zambrano, and Cortez Mercado Julio Joel,

“Parameters influencing in the vehicular overcrowding,”

International Journal of Innovation and Applied Studies,

vol. 24, no. 4, pp. 1440–1455, 2018.

J. Kim and H. S. Mahmassani, “Spatial and temporal

characterization of travel patterns in a traffic network

using vehicle trajectories,” Transportation Research

Procedia, vol. 9, pp. 164–184, 2015. [Online].

Available: https://www.sciencedirect.com/science/

article/pii/S2352146515001702

L. Ochoa and G. Reyes, “Arquitectura de un sistema

inteligente de transportaci´on (ITS) que permita mejorar

la operaci´on y seguridad del transporte terrestre de

Ecuador,” Universidad de Guayaquil, 2015.

S. J. Kamble and M. R. Kounte, “Machine Learning

Approach on Traffic Congestion Monitoring System in

Internet of Vehicles,” Procedia Computer Science, vol.

, pp. 2235–2241, 2020.

S. Sun, J. Chen, and J. Sun, “Traffic congestion

prediction based on GPS trajectory data,” International

Journal of Distributed Sensor Networks, vol. 15, no. 5,

2019, publisher: SAGE Publications. [Online].

Available: https://doi.org/10.1177/1550147719847440

A. Shahraki, M. Abbasi, A. Taherkordi, and A. D. Jurcut,

“A comparative study on online machine learning

techniques for network traffic streams analysis,” Computer

Networks, vol. 207, p. 108836, 2022.

L. C. Lanzarini, W. Hasperu´e, A. Villa Monte,

P. Jimbo Santana, G. Reyes Zambrano, J. P. Corvi,

A. Fern´andez Bariviera, and J.

´A. Olivas Varela,

“Miner´ıa de datos, miner´ıa de textos y Big Data,” in

XXI Workshop de Investigadores En Ciencias de La

Computaci´on (WICC 2019, Universidad Nacional de

San Juan)., 2019.

M. Azimi and Y. Zhang, “Categorizing Freeway Flow

Conditions by Using Clustering Methods,” Transportation

Research Record: Journal of the Transportation

Research Board, vol. 2173, no. 1, pp. 105–114, 1 2010.

F. Rempe, G. Huber, and K. Bogenberger, “SpatioTemporal

Congestion Patterns in Urban Traffic Networks,”

Transportation Research Procedia, vol. 15, pp.

–524, 2016.

Q. Shang, Y. Yu, and T. Xie, “A Hybrid Method for

Traffic State Classification Using K-Medoids Clustering

and Self-Tuning Spectral Clustering,” Sustainability,

vol. 14, no. 17, p. 11068, 7 2022.

G. Reyes Zambrano, “GPS trajectory compression

algorithm,” in Computer and Communication Engineering:

First International Conference, ICCCE 2018,

Guayaquil, Ecuador, October 25–27, 2018, Proceedings

Springer, 2019, pp. 57–69.

Gary Reyes Zambrano and Lissette Ochoa Vera, “Reference

architecture for an intelligent transportation system,”

International Journal of Innovation and Applied

Studies, vol. 15, no. 1, pp. 175–182, 2016.

Y. Zhang, K. Tangwongsan, and S. Tirthapura, “Streaming

k-Means Clustering with Fast Queries,” in 2017

IEEE 33rd International Conference on Data Engineering

(ICDE). San Diego, CA, USA: IEEE, 4 2017, pp.

–460.

C. Mu, Y. Hou, J. Zhao, S. Wei, and Y. Wu, “StreamDBSCAN:

A Streaming Distributed Clustering Model

for Water Quality Monitoring,” Applied Sciences,

vol. 13, no. 9, p. 5408, 4 2023.

A. Anil Meera and M. Wisse, “Dynamic Expectation

Maximization Algorithm for Estimation of Linear Systems

with Colored Noise,” Entropy, vol. 23, no. 10, p.

, 10 2021.

G. Reyes, L. Lanzarini, W. Hasperu´e, and A. F. Bariviera,

“Proposal for a Pivot-Based Vehicle Trajectory

Clustering Method,” Transportation Research Record,

vol. 2676, no. 4, pp. 281–295, 4 2022.

Y. Zhang, N. Ye, R. Wang, and R. Malekian,“A method

for traffic congestion clustering judgment based on

grey relational analysis,” ISPRS International Journal

of Geo-Information, vol. 5, no. 5, 2016. [Online].

Available: https://www.mdpi.com/2220-9964/5/5/71

T. Erdeli´c, T. Cari´c, M. Erdeli´c, L. Tiˇsljari´c,

A. Turkovi´c, and N. Jeluˇsi´c, “Estimating congestion

zones and travel time indexes based on the floating

car data,” Computers, Environment and Urban

Systems, vol. 87, p. 101604, 2021. [Online].

Available: https://www.sciencedirect.com/science/

article/pii/S0198971521000119

G. Reyes, L. C. Lanzarini, C. A. Estrebou, and

V. Maquil´on, “Vehicular Flow Analysis Using Clusters,”

in XXVII Congreso Argentino de Ciencias de La

Computaci´on (CACIC) (Modalidad Virtual, 4 al 8 de

Octubre de 2021), 2021.

G. Reyes, L. Lanzarini, C. Estrebou, A. Bariviera, and

V. Maquil´on, “Evaluation of a Grid for the Identification

of Traffic Congestion Patterns,” in Technologies

and Innovation, ser. Communications in Computer and

Information Science, R. Valencia-Garc´ıa, M. BucaramLeverone,

J. Del Cioppo-Morstadt, N. Vera-Lucio, and

P. H. Centanaro-Quiroz, Eds. Cham: Springer Nature

Switzerland, 2023, pp. 277–290.

G. Reyes, C. Crespo, O. Le´on-Granizo, W. Baz´an,

and R. Horta, “Propuesta de m´etodo de extracci´on de

ubicaciones georreferenciales de una red de carreteras

para el an´alisis de trayectorias GPS,” Investigaci´on,

Tecnolog´ıa e Innovaci´on, vol. 14, no. 16, pp. 1–15, 7

G. Reyes, L. Lanzarini, C. Estrebou, and A. Fernandez

Bariviera, “Dynamic grouping of vehicle trajectories,”

Journal of Computer Science and Technology,

vol. 22, no. 2, p. e11, 10 2022.

J. Liu, D. Wu, H. Mohammed, and R. Seidu, “A Novel

Method for Anomaly Detection and Signal Calibration

in Water Quality Monitoring of an Urban Water Supply

System,” Water, vol. 16, no. 9, p. 1238, 04 2024.

M. Kossakov, A. Mukasheva, G. Balbayev, S. Seidazimov,

D. Mukammejanova, and M. Sydybayeva, “Quantitative

Comparison of Machine Learning Clustering

Methods for Tuberculosis Data Analysis,” in CIEES

MDPI, 01 2024, p. 20.

D. Moulavi, P. A. Jaskowiak, R. J. G. B. Campello,

A. Zimek, and J. Sander, “Density-Based Clustering

Validation,” in Proceedings of the 2014 SIAM International

Conference on Data Mining. Society for

Industrial and Applied Mathematics, 04 2014, pp. 839–

N. V. Thieu, “PerMetrics: A Framework of

Performance Metrics for Machine Learning Models,”

Journal of Open Source Software, vol. 9, no. 95,

p. 6143, Mar. 2024. [Online]. Available: https:

//joss.theoj.org/papers/10.21105/joss.06143

N. Van Thieu and S. Mirjalili, “Mealpy: An opensource

library for latest meta-heuristic algorithms in

python,” Journal of Systems Architecture, 2023.

Downloads

Published

2025-04-30

Issue

Section

Thesis Overview

How to Cite

[1]
“Trajectory analysis using data mining techniques”, JCS&T, vol. 25, no. 1, p. e06, Apr. 2025, doi: 10.24215/16666038.25.e06.

Similar Articles

1-10 of 171

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)