A Scalable Platform for Big Data Analysis in Public Transport

Concurrency and Computation: Practice and Experience

Summary: Any life event or action can be seen as a potential source of data to analyze. By analyzing such data, we can gain insights into the facts. The situation is no different in public transport. Researchers working in the fields of transport and traffic have stated that such an analysis would be invaluable in designing urban transport and particularly in adapting to current changes. In this study, a scalable public transport analysis platform named Cermoni is developed using the Apache Beam programming model. It can analyze in near-real-time smart card and vehicle location data collected, classified as big data with its high production speed. The performance of the platform was tested on Google Cloud Dataflow service using real-world data gathered from Konya, one of the largest metropolitan cities in Turkey, and the results are discussed in detail.