Streamlined Traffic Prognosis using Flexible Reservoir Sampling and Regression Methods
Received: 9 December 2024 | Revised: 12 January 2025 | Accepted: 15 January 2025 | Online: 3 April 2025
Corresponding author: V. R. Srividhya
Abstract
The increased use of Internet of Things (IoT) technologies has resulted in an exponential increase in real-time data streams, particularly in smart city applications, especially for traffic management. Accurate prediction of traffic parameters in such environments is critical for optimizing traffic flow, reducing congestion, and enabling efficient resource management. This study presents an approach to the prediction of traffic intensity and occupancy using IoT streaming data, time series analysis, and machine learning algorithms. The proposed method includes preprocessing steps such as data interpolation to handle missing values and temporal alignment, followed by feature extraction and model training using a combination of regression and sampling techniques. Experiments were carried out on a real-world IoT traffic dataset, and the results show significant improvements in the prediction accuracy in terms of MAPE values. It also predicts the complex event of congestion, using a rule-based algorithm. The proposed method can pave the way for smarter and more efficient urban infrastructure.
Keywords:
IoT streaming, traffic data, adaptive, reservoir sampling, regression, predictionDownloads
References
B. Yan and G. Huang, "Supply chain information transmission based on RFID and internet of things," in 2009 ISECS International Colloquium on Computing, Communication, Control, and Management, Sanya, China, Aug. 2009, pp. 166–169.
L. Xiao and Z. Wang, "Internet of things: A new application for intelligent traffic monitoring system," Journal of networks, vol. 6, no. 6, pp. 887–894, 2011.
A. Gupta, R. Birkner, M. Canini, N. Feamster, C. Mac-Stoker, and W. Willinger, "Network Monitoring as a Streaming Analytics Problem," in Proceedings of the 15th ACM Workshop on Hot Topics in Networks, Atlanta, GA USA, Nov. 2016, pp. 106–112.
Y. Vivek, V. Ravi, A. A. Mane, and L. R. Naidu, "ATM Fraud Detection using Streaming Data Analytics." arXiv, Mar. 08, 2023.
A. Moraru and D. Mladenić, "Complex event processing and data mining for smart cities," in 15th International Multiconference on Information Society, Ljubljana, Slovenia, 2012.
K. A. Alaghbari, M. H. M. Saad, A. Hussain, and M. R. Alam, "Complex event processing for physical and cyber security in datacentres - recent progress, challenges and recommendations," Journal of Cloud Computing, vol. 11, no. 1, Oct. 2022, Art. no. 65.
J. Y. C. Wen et al., "A complex event processing architecture for energy and operation management: industrial experience report," in Proceedings of the 5th ACM international conference on Distributed event-based system, New York, NY, USA, Jul. 2011, pp. 313–316.
S. Shetty, "Disaster Management (Early Information about the Occurrence of Tsunami)," International Journal for Research in Applied Science and Engineering Technology, vol. 7, no. 4, pp. 3960–3963, Apr. 2019.
K. A. B. Hamou, Z. Jarir, and S. Elfirdoussi, "Design of a Machine Learning-based Decision Support System for Product Scheduling on Non Identical Parallel Machines," Engineering, Technology & Applied Science Research, vol. 14, no. 5, pp. 16317–16325, Oct. 2024.
H. Cai, S. Shen, Q. Lin, X. Li, and H. Xiao, "Predicting the Energy Consumption of Residential Buildings for Regional Electricity Supply-Side and Demand-Side Management," IEEE Access, vol. 7, pp. 30386–30397, 2019.
O. Etzion and P. Niblett, Event processing in action. Manning Publications, 2011.
J. Dunkel, A. Fernández, R. Ortiz, and S. Ossowski, "Event-driven architecture for decision support in traffic management systems," Expert Systems with Applications, vol. 38, no. 6, pp. 6530–6539, Jun. 2011.
A. Khan, B. Lazzerini, G. Calabrese, and L. Serafini, "Soccer Event Detection," in Computer Science & Information Technology, Apr. 2018, pp. 119–129.
M. Lima, R. Lima, F. Lins, and M. Bonfim, "Beholder – A CEP-based intrusion detection and prevention systems for IoT environments," Computers & Security, vol. 120, Sep. 2022, Art. no. 102824.
Y. Wang and K. Cao, "A Proactive Complex Event Processing Method for Large-Scale Transportation Internet of Things," International Journal of Distributed Sensor Networks, vol. 10, no. 3, Mar. 2014, Art. no. 159052.
G. Lee, U. Yun, and K. H. Ryu, "Sliding window based weighted maximal frequent pattern mining over data streams," Expert Systems with Applications, vol. 41, no. 2, pp. 694–708, Feb. 2014.
N. F. Omran, S. F. Abd-el Ghany, H. Saleh, A. A. Ali, A. Gumaei, and M. Al-Rakhami, "Applying Deep Learning Methods on Time-Series Data for Forecasting COVID-19 in Egypt, Kuwait, and Saudi Arabia," Complexity, vol. 2021, no. 1, 2021, Art. no. 6686745.
P. Arumainayagam, E. Y. A. Charles, and S. R. Kodituwakku, "A Review on processing of data streams," Research Journal of Computer Systems Engineering - An International Journal, vol. 2, no. 2, pp. 73–77, Jun. 2011.
Q. Wang, B. Yan, H. Su, and H. Zheng, "Anomaly Detection for Time Series Data Stream," in 2021 IEEE 6th International Conference on Big Data Analytics (ICBDA), Xiamen, China, Mar. 2021, pp. 118–122.
A. Fais, G. Lettieri, G. Procissi, S. Giordano, and F. Oppedisano, "Data Stream Processing for Packet-Level Analytics," Sensors, vol. 21, no. 5, Jan. 2021, Art. no. 1735.
Q. Zhang, C. Pang, S. Mcbride, D. Hansen, Charles Cheung, and M. Steyn, "Towards Health Data Stream Analytics," in IEEE/ICME International Conference on Complex Medical Engineering, Gold Coast, Australia, Jul. 2010, pp. 282–287.
C. Q. Jin, W. N. Quan, and A. Y. Zhou, "Analysis and management of streaming data: A survey," Journal of Software, vol. 15, no. 8, pp. 1172–1181, Aug. 2004.
J. C. Hardin, "Introduction to Time Series Analysis," National Aeronautics and Space Administration, NAS 1.61:1145, Mar. 1986. [Online]. Available: https://ntrs.nasa.gov/citations/19860014920.
W. F. Velicer and J. L. Fava, "Time series analysis," Research methods in psychology, vol. 2, 2003.
A. S. Singh and M. B. Masuku, "Sampling techniques & determination of sample size in applied statistics research: An overview," International Journal of Economics, Commerce and Management, vol. 2, no. 11, 2014.
S. Ulas and U. M. Diwekar, "Role of sampling in process design, optimization and control," presented at the AIChE Annual Meeting, Nov. 2006.
J. S. Vitter, "Random sampling with a reservoir," ACM Transactions on Mathematical Software, vol. 11, no. 1, pp. 37–57, Mar. 1985.
Y. Tillé, "A general result for selecting balanced unequal probability samples from a stream," Information Processing Letters, vol. 152, Dec. 2019, Art. no. 105840.
Z. Wen, D. L. Quoc, P. Bhatotia, R. Chen, and M. Lee, "ApproxIoT: Approximate Analytics for Edge Computing," in 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS), Vienna, Jul. 2018, pp. 411–421.
"Traffic Monitoring XML Data," informo.madrid.es. https://informo.madrid.es/informo/tmadrid/pm.xml.
S. Mehrmolaei and M. R. Keyvanpour, "Time series forecasting using improved ARIMA," in 2016 Artificial Intelligence and Robotics (IRANOPEN), Qazvin, Iran, Apr. 2016, pp. 92–97.
A. J. Smola and B. Schölkopf, "A tutorial on support vector regression," Statistics and Computing, vol. 14, no. 3, pp. 199–222, Aug. 2004.
M. J. Sax, "Apache Kafka," in Encyclopedia of Big Data Technologies, S. Sakr and A. Zomaya, Eds. Springer International Publishing, 2018, pp. 1–8.
A. de Myttenaere, B. Golden, B. Le Grand, and F. Rossi, "Mean Absolute Percentage Error for regression models," Neurocomputing, vol. 192, pp. 38–48, Jun. 2016.
A. Y. Sun, Z. Zhong, H. Jeong, and Q. Yang, "Building complex event processing capability for intelligent environmental monitoring," Environmental Modelling & Software, vol. 116, pp. 1–6, Jun. 2019.
B. Khazael, H. T. Malazi, and S. Clarke, "Complex Event Processing in Smart City Monitoring Applications," IEEE Access, vol. 9, pp. 143150–143165, 2021.
T. Beckers, "An Introduction to Gaussian Process Models." arXiv, Feb. 10, 2021.
L. Bottou, "Stochastic Gradient Descent Tricks," in Neural Networks: Tricks of the Trade, vol. 7700, G. Montavon, G. B. Orr, and K. R. Müller, Eds. Springer Berlin Heidelberg, 2012, pp. 421–436.
R. G. Brereton and G. R. Lloyd, "Support Vector Machines for classification and regression," The Analyst, vol. 135, no. 2, pp. 230–267, 2010.
K. Thurnhofer-Hemsi, E. López-Rubio, M. A. Molina-Cabello, and K. Najarian, "Radial basis function kernel optimization for Support Vector Machine classifiers." arXiv, Jul. 16, 2020.
Downloads
How to Cite
License
Copyright (c) 2025 V. R. Srividhya, Kayarvizhy

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.