Urban traffic analysis through multi-modal sensing

Pers Ubiquit Comput DOI 10.1007/s00779-015-0833-4 ORIGINAL ARTICLE Urban traffic analysis through multi-modal sensing Mikko Perttunen • Vassilis Kos...

Author: Logan Adams

8 downloads 2 Views 1MB Size

Report

Download PDF

Recommend Documents

Urban Traffic Analysis Using HSL Dataset

Improving traffic behaviour and safety through urban design

Multimodal Oriented Discriminant Analysis

REMOTE SENSING EFFICIENCY FOR URBAN ANALYSIS OF MECCA AND SURROUNDS

URBANEARS URBAN ECOSYSTEM ANALYSIS SUPPORTED BY REMOTE SENSING

Urban Regional Traffic State Analysis Software System Emphasizing Pattern Transition

Traffic Analysis

Traffic maintained through all

Urban Building Inventory Development using Very High-Resolution Remote Sensing Data for Urban Risk Analysis

Selective Perception Policies for Guiding Sensing and Computation in Multimodal Systems: A Comparative Analysis

Traffic Signal Warrant Analysis

APPENDIX K. Traffic Analysis

Efficient Pedestrian Detection in Urban Traffic Scenes

Satellite Remote Sensing Analysis Platform

URBAN TRAFFIC STATE ESTIMATION & PREDICTION. Master Thesis

Optimal Multimodal Fusion for Multimedia Data Analysis

3D Gesture Recognition through RF Sensing

Analysis of Scheduled Airline Traffic

Analysis of traffic in Stockholm

Traffic Morphing: An Efficient Defense Against Statistical Traffic Analysis

Grounding Water: Building Conceptual Understanding through Multimodal Assessment

Information Fusion Based Learning for Frugal Traffic State Sensing

Traffic Morphing: An Efficient Defense Against Statistical Traffic Analysis

Pers Ubiquit Comput DOI 10.1007/s00779-015-0833-4

ORIGINAL ARTICLE

Urban traffic analysis through multi-modal sensing Mikko Perttunen • Vassilis Kostakos Jukka Riekki • Timo Ojala

•

Received: 30 September 2013 / Accepted: 31 January 2014 Ó Springer-Verlag London 2015

Abstract This paper makes contributions toward adopting a systemic view of city-wide ubiquitous systems. Here, we present methods and techniques for combining multiple sensing modalities to measure and model traffic patterns in urban environments. We show how noise in one modality can be reduced by considering another more reliable modality and how two modalities can be combined. While much work in the literature deals with simulated data or small data sets, our work focuses on analyzing data from a permanent data collection infrastructure in a downtown area. We present results using a 3-week data set containing data of two modalities: inductive loop traffic detectors and Bluetooth scanners. Keywords Bluetooth Inductive loop Detection Origin–destination

1 Introduction Our long-term research agenda is to develop methods and techniques for systematically analyzing urban traffic and flows from digital traces. In our work, we adopt a holistic view of the city, perceiving people, spaces and technologies as a system. We argue that building ubiquitous and M. Perttunen V. Kostakos (&) J. Riekki T. Ojala University of Oulu, 90014 Oulu, Finland e-mail: [email protected] M. Perttunen e-mail: [email protected] J. Riekki e-mail: [email protected] T. Ojala e-mail: [email protected]

pervasive systems on city scales can benefit from a systemic understanding of cities, which at the same time must be enabled by reproducible methods, techniques and metrics. A substantial amount of research in our discipline focuses on probes and interventions, studying their effects in particular urban contexts. Such research is aimed at understanding how to design experiences in a city setting. Similarly, substantial work is focused at understanding human behavior in the city with the purpose of better supporting it by designing novel applications and services. A characteristic of these approaches is that the work cannot be systematically generalized or compared across different urban environments in a systematic way. The reason is the lack of a basis of understanding of city-scale ubiquitous systems, but more importantly of techniques to systematically build, compare and assess city-scale ubiquitous systems. While some work has been done in identifying metrics and techniques in this way [13], our discipline is still maturing in this respect. Here, we present the development of methods, techniques and metrics for sensing and modeling of city-scale traffic. Sensing and modeling of city-scale traffic can enable researchers to capture trends and changes in the environment, infer context in urban settings and provide dynamic services to users on the move. An understanding of city-scale traffic is also crucial in describing human mobility in general, and how it may affect the various aspects of infrastructure in a particular city. Cities have a long history of traffic monitoring tools, ranging from traditional traffic surveys to analyzing tickets, traffic cameras and road surface inductive loops. For a large part, traffic analysis relies on knowing where vehicles (and people) reside at any given moment, that is, analysis of location data [17]. A challenge in measuring and

123

Pers Ubiquit Comput

modeling city-scale traffic is the rapid variability of the ‘‘digital urban landscape,’’ with particular wireless technologies and traffic monitoring techniques becoming more or less popular over time. Recently, technologies such as GPS, WiFi, Bluetooth and mobile communication networks have facilitated collecting data on urban mobility. In this paper, we argue that the constant evolution of the deployed technologies suggests that relying on any single urban technology for mapping traffic is likely to be short lived. For instance, while a lot of research has demonstrated the use of Bluetooth for mapping traffic [9, 18, 23], recent decisions of handset manufacturers to limit Bluetooth functionality suggest that this technique may soon be outdated. However, it is likely that location databases continue to grow with the emergence of new proximity technologies. Although proximity technologies offer opportunities for passive, infrastructure-centric monitoring, it should be noted that GPS data is valuable for accurate tracking in wide geographic areas and particularly when studying subjects that naturally lend themselves to continuous GPS tracking, such as taxis [16]. Therefore, we argue that instead of considering a single modality for capturing traffic, we require techniques for multi-modal traffic detection. In other words, tools are needed to systematically take advantage of multiple technologies, whatever those may be now or in the future, to effectively capture urban traffic. In this paper, we demonstrate how two traffic-sensing modalities, inductive loops and Bluetooth scanners, can be combined. While for pragmatic reasons, we are forced to rely on these two modalities due to their current popularity, our contributions generalize to other proximity-based technologies which may become popular in the future. In this paper, we make three contributions: –

–

–

we demonstrate a technique for filtering traffic ‘‘noise’’ in one modality by considering data from a second modality, we show how two modalities can be combined to estimate the volume of traffic at various points in the city center, and we explore the use of a constant conversion factor as well as a classifier model for doing so, we describe a technique for calculating the turning volume on road junctions (i.e., how many cars turn left or right at a specific junction) using one modality to calculate the ratio and another to calculate the volume.

Thus, the paper presents techniques for establishing comprehensive urban traffic volume data from multi-modal sources. This is an important prerequisite for trajectory analysis in geospatial studies. The work we present is grounded in a real-world deployment that took place in a northern European city.

123

2 Related work A number of projects have attempted to accurately reconstruct mobility patterns by exploiting people’s mobile devices. In the past, mobile phone tracking has been used as an approach to measure the flows of passengers between parts of a city and for estimating speeds and travel times [3, 8]. The results typically have low spatial resolution and are most effective for long-distance segments such as highways. Lu [17] categorizes past research in geospatial analysis to three groups; first, in a data-driven approach, spatiotemporal patterns are mined from trajectory data. Another direction of research aims to analyze and model dynamic interactions between people. Third, ‘‘urban study’’ focuses on studying human and vehicular flows in cities. The third category best describes the aim of this paper, as our focus is on techniques for establishing comprehensive urban traffic volume data from multi-modal sources. A very popular approach for mapping traffic has been the use of proximity-based technologies, such as Bluetooth or WiFi traces [1, 4, 6, 13, 19, 20, 25]. These studies suggest that due to its current popularity and widespread usage, Bluetooth technology is not only useful for capturing individual mobility traces, but can be also used to analyze the spatiotemporal behavior of masses. Gauging the popularity of a technology such as Bluetooth is challenging and is likely to be a moving target. It is important to note that Bluetooth devices may operate in non-discoverable mode, and hence not be detectable. This means that only a subset of existing Bluetooth devices is technically observable. Estimates show the ratio of observable Bluetooth devices to range between 2 % for Bremen, Germany to 7 % for Bath, UK [21, 22]. These results indicate that while potentially a great subset of the population has Bluetooth-capable devices, ceteris paribus only a small portion keeps their Bluetooth devices in discoverable mode. We note that while 7 % is not necessarily a big portion of the population, nevertheless it is potentially greater than the approximate 3 % of the population that traditional transport surveys cover in any particular region. However, mobile handset manufacturers have recently opted to substantially limit the functionality on Bluetooth on their handsets. For instance, iOS devices by Apple are typically not detectable by Bluetooth, while recent Android devices are by default limited to a small time window of a few seconds when they are detectable. Despite these developments, Bluetooth remains heavily used for traffic monitoring in the context of highways and major transport arteries, where the deployment of Bluetooth scanners at strategic locations allows for the approximation of macro-travel behavior [9, 18, 23]. Similarly, Barcelo et al. [2] made use of statistical methods

Pers Ubiquit Comput

(e.g., Kalman filtering) in order to estimate traveling time and origin–destination (OD) matrices in highways. In [5], opportunities for signal timing improvement were studied by identifying time periods with long travel times using Bluetooth-based vehicle re-identification, and in [12], Bluetooth was used to estimate when passengers get on and off a public transport bus. An important challenge that urban traffic sensor systems face is the threat of any single technology, such as Bluetooth, declining in popularity. For this reason, it is important that we have techniques and methods to complement multiple modalities and use each one’s strength for improving our overall understanding of traffic and mobility. Most previous work that has relied on proximity technologies such as Bluetooth has used it as the sole modality, and possibly relied on manual observations for verification, or post-hoc correlation with other aggregated traffic data for cross-validation. In our work, we demonstrate how two distinct real-time modalities can be used to improve our understanding and cancel out each other’s weaknesses.

3 Experimental setup In this section, we describe our experimental setup, located at the downtown area of a northern European city. Our setup consisted of inductive loop traffic detectors installed at traffic light intersections, and a network of Bluetooth access points deployed across the city center. For this study, we obtained a 3-week access to the live data from the inductive loop detectors and the Bluetooth access points, between September and October in 2011.

only able to obtain aggregated data from the inductive loops. The data that was made available to us was an aggregate of vehicle detection events over certain time periods, spanning between 2 and 15 min depending on the intersection. Overall, we obtained access to both downtown area sensors (which were dense) as well as main artery sensor (which were quite sparse). Figure 2 shows an extract of the downtown map schematic with intersections containing inductive loops annotated. 3.2 Bluetooth scanners For the purposes of this study, a total of 28 Bluetooth ‘‘scanners’’ in the city center were used, as shown in Fig. 3. These were hardware enclosures installed on top of traffic lights or lamp posts, and typically, they were placed at 3–4 m from the ground level. The enclosures contained a BlueGiga 2293 Bluetooth access server with three independent transceivers. The transceivers were programed to conduct continuous discovery cycles of 10.24 s, as specified in the Bluetooth protocol standard. From each scanning cycle, the data recorded was the time and unique ID of the discovered devices. The data was sent in real time to a central server that aggregated all data from the study. Figure 4 shows a photograph of one intersection with a Bluetooth scanner and a close-up snapshot of another location where the scanner is installed on a traffic light post. Traffic light posts are very convenient places to deploy such technology, because they are a source of constant power supply, they allow for protection from vandalism since the hardware can be placed out of reach of humans, and usually, provide an unobstructed environment with clear lines of sight and optimal conditions for proximity technologies to operate.

3.1 Inductive loop traffic detectors 3.3 Data collection Inductive loops are typically installed below the road surface, and using magnetic fields is able to detect the presence of vehicles above them. They are configured to operate in a binary state (car present/not present). While not the case in our study, some advanced hardware and setups (e.g., two consecutive loops) are able to detect the length of a vehicle, and therefore, separate buses and trucks from passenger cars. The principle of inductive loops is shown in Fig. 1. In our study, the city provided us access to sensors at 123 traffic light intersections. Depending on the size of the intersection and the total number of lanes, each intersection had up to 32 individual loop detectors, with each lane in each direction typically having two or more loop detectors. In this configuration, the sensors can identify static traffic waiting at traffic lights, and therefore, can prioritize traffic based on volume. Due to technical restrictions, we were

We now provide a brief summary of our data collection mechanisms, and how the sensed data was prepared for analysis. Raw data from the inductive loops was captured, processed and stored by a commercial traffic monitoring system purchased by city’s central administration. We were provided with API access to this data, and subsequently, we built a real-time polling mechanism to continuously fetch new data from the city administration and store it on our own server. Each data point we were able to retrieve contained a unique ID assigned to the particular inductive loop detector, two timestamps (start and end) and the number of vehicles detected between the two particular timestamps. Raw data from the Bluetooth access points was sent directly to our server for storage. In its raw format, the data included a timestamp, a unique identifier of the scanner and

123

Pers Ubiquit Comput Fig. 1 Inductive loops are installed below the road surface and rely on spikes in their electromagnetic fields to detect the presence of vehicles

Fig. 2 Excerpt from the downtown area of the study. The intersections with inductive loops are shown on the map as green dots. Each dot represents from 2 to 32 inductive loop sensors installed in the intersection (color figure online)

the unique identifier of the discovered device. To facilitate further analysis, the data was treated in two ways, following the work in [22]. First, the data was obfuscated by replacing the Bluetooth ID of the device by a unique, database-specific identifier. In addition, consecutive detection events below a certain temporal threshold (in this case, 5 min) were treated as a single discovery event and were assigned a duration value. These are referred to as ‘‘sessions.’’ This approach has the advantage of making it easy to separate static from moving devices simply by considering the duration of the session. As such, vehicles will record relatively shorter sessions when passing by a Bluetooth scanner, while pedestrians’ sessions will be relatively longer as pedestrians move slower than vehicles. In addition to the raw and processed data stored in real time by our server, we also stored the geographic coordinates of each inductive loop and Bluetooth scanners.

123

Fig. 3 The Bluetooth scanners shown on the map as markers

Fig. 4 Left: Aerial view of a junction where the Bluetooth scanner was installed. The location of the scanner is indicated by the red circle. Right: a close-up view of the Bluetooth scanner installed on a lamp post also hosting traffic lights (color figure online)

Pers Ubiquit Comput

There are certain intricacies of the two sensing modalities which we need to highlight. Since the data is unfiltered, we expect that in the Bluetooth data some devices belong to pedestrians and not cars. In the case of the intersection data, we expect that at each intersection a car may be registered more than once if, for example, a single lane has more than one inductive loop detector. As also suggested in prior work [15], a few busy locations and intersections generated orders of magnitude more data than others.

4 Analysis Our analysis first presents a comparison of the data sets we have collected via each modality. We then discuss ways in which these two modalities can be combined to estimate the volume of traffic at various points in the city center, and we explore the use of a constant conversion factor as well as a classifier model for doing so. Finally, we use these findings to develop a technique for calculating the turning ratios at intersections across the city center. 4.1 Correlating two modalities In Fig. 5, we show the variation of recorded data during the course our study. Here, the diagrams show weekly variations in the data and demonstrate the variability of the data. Weekdays were much busier than weekends in our study, a finding which was expected due to the nature of the city and its weekly patterns. We expect certain other types of

Daily volumes 1500

1000

500

11-09-24

11-09-25

11-09-26

11-09-27

11-09-28

11-09-29

11-09-30

11-10-01

11-09-24

11-09-25

11-09-26

11-09-27

11-09-28

11-09-29

11-09-30

11-10-01

11-10-04

11-09-23 11-09-23

11-10-04

11-09-22 11-09-22

11-10-03

11-09-21 11-09-21

11-10-03

11-09-20 11-09-20

11-10-02

11-09-19 11-09-19

11-10-02

11-09-18 11-09-18

11-09-17

11-09-16

11-09-15

11-09-14

11-09-13

0

4

10

x 10

8 6 4 2 Loop detectors 11-09-17

11-09-16

11-09-15

11-09-14

0 11-09-13

volume

Bluethooth scanner

volume

Fig. 5 Data collected during the course of 3 weeks from a Bluetooth scanner and inductive loop detectors in an intersection. The x-axis shows the date, and the y-axis the volume of data collected. Note the different scales in the top and bottom graphs

areas such as weekend getaway locations to have an inverted weekly pattern. In Figs. 6 and 7, we show a breakdown of the recorded activity per hour of day at the sample intersection shown earlier in Fig. 4 (left), for both modalities. In these figures, we show for the 3-week period each day as a line running across the graph, showing the volume of detections at that location as a function of time. The red lines represent the average volume of detected events. The correlation between the values of the two red lines (means of hourly volumes) is shown in Fig. 8. Note that the y-intercept was set to zero when fitting the regression, because both volumes should naturally be zero, when no vehicles pass by. Despite the fact that no filtering has been done on this data, we achieve a correlation of R2 ¼ 0:83 on the averages of the data. However, in Fig. 9, we see the correlation drops to R2 ¼ 0:63 when we consider the raw hourly data between the modalities. This discrepancy suggests that hourly comparisons of two modalities produces noisy results, while averaging the data over longer periods produces much more coherent results. It is also interesting to note that in our case the Bluetooth raw data fluctuated more than the inductive loop data, with the weekday and weekend data forming two distinct clusters in Figs. 6 and 7. We should also note that in both modalities we had occasional system outages resulting in no data being recorded for short periods of time, typically less than an hour. Outages were attributed to networking disconnections and software glitches. Incomplete data was removed.

123

Pers Ubiquit Comput Fig. 6 Mean of hourly Bluetooth detection volume over the study period. The red line represents the average values (color figure online)

Hourly Bluetooth detection volumes for scanner 1196 from 2011-09-13 to 2011-10-04 140 Saturday and Sunday Weekdays Mean 120

volume

100

80

60

40

Fig. 7 Mean of hourly inductive loop detection volume over the study period. The red line represents the average values (color figure online)

23:00

00:00

21:00

22:00

20:00

18:00

19:00

16:00

17:00

14:00

15:00

12:00

13:00

11:00

10:00

09:00

07:00

08:00

05:00

06:00

04:00

03:00

02:00

01:00

0

00:00

20

Hourly loop detection volume for intersection 16, from 2011-09-13 to 2011-10-04 9000 Saturday and Sunday Weekdays Mean

8000

7000

volume

6000

5000

4000

3000

2000

123

23:00

00:00

22:00

21:00

20:00

19:00

17:00

18:00

16:00

15:00

14:00

13:00

12:00

11:00

10:00

08:00

09:00

07:00

06:00

05:00

04:00

03:00

02:00

01:00

0

00:00

1000

Pers Ubiquit Comput Fig. 8 Correlation between the two modalities. Graph shows the number of Bluetooth devices and cars detected per hour for the same location during our study. Each point represents an average of 21 values (one per day of our study)

Correlation of mean hourly Bluetooth and loop detector volume from 2011-09-13 to 2011-10-04 7000 hourly data points 2

y = 73.90x, R = 0.8279

Hourly loop detection volume

6000

5000

4000

3000

2000

1000

0

0

10

20

30

40

50

60

70

80

90

100

Hourly Bluethooth detection volume

Fig. 9 Correlation between the two modalities. Graph shows the number of Bluetooth devices and cars detected per hour for the same location during our study. Each point represents a single 1-h slot during our study

Correlation of hourly Bluetooth and loop detector volume from 2011-09-13 to 2011-10-04 9000 hourly data points 2

y = 68.12x, R = 0.6316 8000

Hourly loop detection volume

7000

6000

5000

4000

3000

2000

1000

0

0

20

40

60

80

100

120

140

Hourly Bluethooth detection volume

123

Pers Ubiquit Comput

4.2 Multi-modal estimation of traffic volume Our approach to multi-modal traffic volume estimation is to use the less noisy modality to improve the reliability of the noisier modality. In our case, we use inductive loop detectors to improve the reliability of the Bluetooth modality. As shown in the previous section, raw multimodal data can be correlated, even when one modality may be noisier than the other. In our case, we believe the Bluetooth modality to be much noisier than inductive loops. Thus, we hypothesize that given appropriate noise filtering, we can estimate the (total) traffic flow through an intersection by multiplying the traffic volume detected by the noisy modality (e.g., Bluetooth devices) by a specific factor, i.e., TrafficFlow ¼ k TrafficFlowBT . In order to establish an estimate for k, accurate estimates of the vehicle traffic counts from both modalities are required. The first step in our analysis is to estimate traffic volume or the number of vehicles traveling through an intersection at the sites we are monitoring. We achieved this by handpicking a subset of inductive loops at each intersection, such that each vehicle is counted only once. For example, if a particular lane had three inductive loops leading up to the traffic lights, we only considered the loop farther away from the traffic lights. The city has installed multiple inductive loops in each traffic lane to determine the number of cars waiting at a traffic light, so unfortunately a moving vehicle will trigger all loops and will be registered more than once. Analysis of the data showed that the variation of the readings between inductive loops in the same lane is about one vehicle per hour, and therefore, we believe that the collected data from a single inductive loop is reliable. Subsequently, we filtered the data from the Bluetooth modality to closely match the ground truth obtained from the filtered inductive loop detector data. In this case, the filtering of the Bluetooth data was based on the session duration: By adapting the cutoff point we are able to selectively discard data that we believe is attributed to slow-moving pedestrians in the environment. The result of this process is shown in Fig. 11. The best correlation results were obtained by using a cutoff point of 30 s, giving a correlation between the per-hour number of Bluetooth devices detected and cars detected at R2 ¼ 0:6366 (Fig. 10), thus providing an approximate 0.5 % increase in the quality of the correlation reported on the right of Fig. 8, and resulting in k ¼ 75:95. We applied this technique to multiple locations across the city, including noisier environments where more pedestrians passed by our Bluetooth scanners. We found that the ideal cutoff point ranges between 20 and 40 s depending on the setting (Fig. 11).

123

We further hypothesized that while this correlation between the two modalities only considers the actual number of devices detected by the Bluetooth scanner, improvements in the correlation can be made by adding more information into the prediction. Using a tenfold crossvalidation with a rule-based classifier (M5Rules), we obtained R2 ¼ 0:68 when using only Bluetooth data as the predictor values. By incorporating the hour-of-day into the model, we were able to achieve R2 ¼ 0:88, and by also utilizing the day of week in the model, we obtained in R2 ¼ 0:95. 4.3 Multi-modal estimation of turning ratios Next, in our analysis, we attempt to estimate the number of cars that turn at a particular intersection. This so-called turning ratio is important in the sense that it can be used to understand and predict driver behavior [14] and can be used to extend the scope of a particular model. For example, due to the typical placements of inductive loops, on their own they are often not sufficient for calculating turning ratios. In many cases, a street may have inductive loops shortly before the traffic lights at an intersection, but not immediately after the intersection. Moreover, many inner-city intersections do not have separate lanes for turning traffic. In some cases, additional inductive loops may be located further down the road, for example, at the next traffic lights. In this case, it may be possible to derive a turning ratio, but this may be inaccurate depending on whether there are many possible exits for cars between the two inductive loop locations. Our technique for this kind of analysis across the whole city is to construct a variant of an origin–destination matrix using a modality that supports unique identifiers. In our case, the Bluetooth modality supports unique identifiers which we can use to reconstruct the journeys of the detected vehicles at various points in the city—which is not possible with inductive loops alone. Thus, we derive matrix M with dimensions n n, where n is the number of locations in the city where we have an observation point (in our case a Bluetooth scanner). In M, the elements ði; iÞ on the main diagonal represent the number of devices detected at location i. Typically, in traditional OD matrices, all other elements ðk; jÞ would represent the number of devices observed transitioning from location k to location j [27]. However, in our matrix M, these cells contain a richer structure describing all detected routes from k to j. These are represented as walks through the graph specified by the scanners (vertices) and their ordered pairwise connections (edges). We do not impose restrictions on the walks relative to the physical constrains on connections between the scanners (e.g., one-way streets), and thus, the underlying

y = 83.85x, R = 0.6333

5000

0

0

20

40 60 80 100 Hourly Bluetooth detection volume

120

Max session duration limit: 30s 10000 2

y = 75.95x, R = 0.6366

5000

0

0

20

40 60 80 100 Hourly Bluetooth detection volume

120

140

Max session duration limit: 120s 10000 2

y = 70.07x, R = 0.6287

5000

0

0

20

40 60 80 100 Hourly Bluetooth detection volume

120

140

Hourly loop detection volume

2

Hourly loop detection volume

Max session duration limit: 1s 10000

Hourly loop detection volume

Hourly loop detection volume

Hourly loop detection volume

Hourly loop detection volume

Pers Ubiquit Comput

Max session duration limit: 10s y = 83.85x, R2 = 0.6333

0

20

40 60 80 100 Hourly Bluetooth detection volume

120

Max session duration limit: 60s 2

y = 72.03x, R = 0.6353

0

20

40 60 80 100 Hourly Bluetooth detection volume

120

140

120

140

Max session duration limit: Infs 2

y = 68.12x, R = 0.6316

0

20

40 60 80 100 Hourly Bluetooth detection volume

Fig. 10 Correlations between the mean number of cars detected per hour and the number of Bluetooth devices detected per hour, filtered by enforcing different maximum Bluetooth session durations

Filtering out static devices in the environment for scanner 1196 from 2011-09-13 to 2011-10-04 2500 < 1s < 10s < 30s < 60s < 120s < Infs max-min

2000

1500

volume 1000

500

23:00

22:00

21:00

20:00

19:00

18:00

17:00

15:00

16:00

13:00

14:00

11:00

12:00

09:00

10:00

08:00

07:00

06:00

05:00

04:00

03:00

02:00

00:00

0 01:00

Fig. 11 The top group of lines indicates different thresholds for filtering static devices in the environment. Starting from the top line representing all data, we subsequently discard devices that were near the scanner from more than 120, 60, 30 and 10 s respectively. The line at the bottom shows the amount of devices removed in the most aggressive filtering approach (1 s)

123

Pers Ubiquit Comput

graph is undirected. The walks may also visit the intermediate vertices (scanners) multiple times. As a result, each cell in M may contain multiple walks through the street graph, indicating the volume and durations for each route that vehicles were observed taking from k to j. Visualization of a segment of the OD-matrix M is shown in Fig. 12. The thicker lines on the edges of the street network correspond to higher volumes of traffic on main streets, whereas the lines across the center of the figure represent routes through downtown with less volume, routes around downtown from detector to detector far apart or possibly to some degree detection misses at some detection points on the route. Given M, it is possible to systematically infer turning ratios for any particular intersection in our study. For instance, consider a sample intersection shown in Fig. 13. In this case, we can estimate the turning ratio (from A to D via B) by looking up in M the values for a ¼ ðA; CÞ, where the walk is ðA; B; CÞ (number of vehicles continuing forward), and b ¼ ðA; DÞ (number of vehicles turning right), with the associated walk ðA; B; DÞ. Given these values, the turning ratio can be calculated as T r ¼ b=ða þ bÞ, which can also be expressed as a percentage. Hence, we have used one modality to estimate the turning ratio at the intersection, and using the second modality (in our case, the inductive loops), we can convert the ratio into an absolute volume. This is done simply by multiplying the ratio with the number of vehicles detected at position A by an inductive loop. We tested the accuracy of this technique by considering an intersection in our study where we had extensive data from both modalities. Specifically, the intersection was similar in layout to the one shown in Fig. 13 and had Bluetooth data from all directions leading to the intersection. In addition, the intersection contained separate lanes and dedicated inductive loop detectors for turning traffic,

Fig. 12 An origin–destination matrix superimposed on a map of the downtown area. The blue triangles indicate the location of the Bluetooth scanners, while lines connecting them indicate vehicle transitions. Relative volume of traffic from low to high is depicted using a range from thin blue lines to thick pink lines, respectively (color figure online)

123

Fig. 13 Turning ratio is given in relation to the total traffic through an intersection and describes the proportion of cars making a right turn toward D

which we used to derive the ground truth turning ratio in this case. In the experiments, we attempted to calculate the ratio of cars moving from location A toward C that made a right turn toward D. Table 1 shows the results of using our technique to estimate the ratio of devices turning at the particular junction. The results suggest that for a single day of observations the error is approximately 10 %, but we observed a sharp decrease in the error rates when considering longer observation periods. For instance, for a 3-week observation period, we observed a threefold decrease in the error rate down to 3.5 %.

5 Discussion This paper presents techniques for establishing comprehensive urban traffic volume data from multi-modal sources. This is an important prerequisite for trajectory analysis in geospatial studies. A number of previous projects have considered using proximity-based technologies, such as Bluetooth or WiFi traces, to capture urban mobility [1, 4, 6, 13, 19, 20, 25]. We argue than any single of these technologies may become obsolete in the future, making them less attractive tools. For example, recently, manufacturers have reduced the Bluetooth functionality on mobile handsets by limiting their discoverability. Despite this, some characteristics of proximity-based technologies, such as the use of unique identifiers, still make them attractive. We therefore argue in favor of developing techniques that can take advantage of multiple modalities in detecting urban traffic. This approach has two main advantages. First, as old technologies become obsolete and new technologies appear, our techniques can adapt appropriately. Second, as our work shows, using multiple modalities has the benefit of using one modality’s advantage to improve another modality’s weakness. Here, we make three contributions toward multi-modal traffic sensing and modeling. First, we demonstrate a technique for filtering noise in one modality by considering

Pers Ubiquit Comput Table 1 Results on estimating turning ratio

Test

Duration

1

1 day (10 samples)

2 Error rates are relative to ground truth turning ratios derived using inductive loop detectors

3

1 week (3 samples) 3 weeks (1 sample)

Method

Turning ratio (std)

Error (%)

Bluetooth scanners

0.400 (0.142)

9.7

Inductive loop detectors

0.497 (0.026)

Bluetooth scanners

0.436 (0.039)

Inductive loop detectors

0.504 (0.014)

Bluetooth scanners

0.450 (–)

Inductive loop detectors

0.485 (–)

6.8 3.5

data from a second, more reliable, modality. We then show how data from multiple modalities can be combined by using either a constant conversion factor or a classifier model. Finally, we demonstrate how two modalities can work together to give us improved junction turning ratios.

calculate the adequacy of sample sizes, and their quality [10], albeit in the context of traditional traffic counting. These guidelines indicate that samples small in size are not of adequate quality and suggest increasing observation periods to make the samples more reliable [11].

5.1 Sources of error

5.2 Reconciling modalities

In our analysis, we found that there are multiple ways of fusing multi-modal data, particularly by applying different temporal binning techniques. For instance, in Figs. 8 and 9, we observed that applying an hourly or daily binning results in different apparent correlation between the two modalities. Our analysis of these two models has indicated the adequacy of the linear model, as the residuals follow a normal distribution and 95 % are contained within the bounds defined by the standard error. Effectively, these results indicate an increase of correlation strength as the binning windows become larger. While it has been previously shown that relying on averages per time period leads to spurious auto-correlation [26], this is not the case here. Here, the sampling window is adjusted to calculate the correlation between the two modalities, hence the data is not averaged but aggregated. Therefore, both binning strategies are valid and increasing the window size does not introduce bias since the data is aggregated, not averaged [26]. The apparent relationship between observation window size, data correlation and sampling error suggests that short observation periods, or indeed observation periods with little activity, are likely to produce unreliable results. We also observed this effect when conducting our tests for estimating turning rations (Table 1), where an increase in the observation time frame resulted in decreasing standard deviation and error. This is likely due to both modalities having some uncertainty of detection, causing the detected volumes in some short time periods to match, while when one modality senses too high volume and the other too low volume in a short period, would the errors accumulate. Hence, it is advisable that observation periods with adequate activity should be used in such analyses. Our findings are consistent with existing guidelines in terms of transport engineering, where explicit measures are described to

In practice, the situation is often that data from one type of a sensor is available from some areas, while other areas are covered by another sensor modality. Our analysis showed that there is more than one way to convert data and aggregated results between modalities, and in particular, we experimented with both using a constant factor and a more elaborate machine learning classifier model. More specifically, our analysis showed a linear fit is indeed a simplistic but also simplified way of establishing a relationship between the two data sets. In particular, the use of a constant does not seem to account for about 30 % of the variation in traffic, which we attribute to daily and weekly variations. Conversely, a rule-based classifier can achieve correlations of R2 ¼ 0:95, arguably because it can account for daily and weekly variations in the data set. While a classifier approach can increase the accuracy of estimations, using a constant in place of a classification model makes analysis more transparent, easier to communicate and easier to deduce. At the cost of loss in accuracy, one may opt to use a constant because it allows for direct comparison across urban contexts and perhaps over different cities. This way, we can identify a constant k that we can use to improve the reliability of a noisy modality by transforming its data into more accurate traffic volume estimations. According to the correlation shown in Fig. 10, using the optimum cutoff value of 30 s, we establish that k ¼ 75:95 for the two modalities in our study, with a fit of about R2 ¼ 0:64. We expect these to differ across contexts or with different modalities. 5.3 Beyond Bluetooth and beyond sensing We argue that as existing technologies become obsolete, new technologies will become a more attractive mechanism for sensing urban traffic. Just as in the case of

123

Pers Ubiquit Comput

Bluetooth, we expect that no single modality will be without its own flaws, constraints and skewness. For instance, while WiFi has been used to track urban mobility [1], it suffers from many similar problems of demographic skewing as Bluetooth. In general, technologies tied to mobile devices are likely to suffer from demographic skewing. Moving beyond communication technologies, researchers have considered using shared social data, such as geotagged photographs [7, 24], to reconstruct and model urban mobility. However, these suffer from low spatial density and are also likely to be skewed toward certain parts of the population such as tourists in the case of photographs. Furthermore, online services such as Google have relied on effectively crowdsourcing traffic monitoring by considering the real-time geotagged speed of Android users to build up a picture of traffic conditions across the street network. Yet again, this approach can lead to skewed results since it requires high-end handsets, which are typically expensive and not affordable by parts of the population. Looking beyond the intricacies that any future modality may have, our work provides the methodological grounds for noise filtering of proximity-based modalities. A particular characteristic of proximity-driven modalities and technologies (be they may Bluetooth, WiFi, ZigBee, RFID and NFC) is that for the purposes of traffic detection a polling approach must be established. In particular, data is collected by constantly searching the environment for nearby devices using the proximity-based technology. Therefore, in this case, the collected data will inherently have temporal characteristics, and subsequently, temporal thresholds can be used to remove some of the noise. The technique we have described here for noise removal, which involves iterative thresholding of the data until a good correlation is achieved with a more reliable modality, can be used with other proximity-based technologies that may be popular in the future. Moving beyond sensing altogether, researchers have considered the use of predictive tools in urban traffic analysis. These can be very useful for situations where there is simply no sensing modality available. For instance, recent work [14] has demonstrated a turn prediction algorithm based on the assumption that drivers tend to choose roads that offer more destination options. This reflects the intuition that turning onto a short, dead-end road is relatively rare compared with turning onto a highway ramp. Similarly, the work in [28] suggests that a substantial component of driving behavior and choices relates to the structure of the road network itself, and therefore, it is possible to make probabilistic predictions about which routes people choose to take.

123

6 Conclusion and ongoing work In this paper, we demonstrate techniques to analyze traffic in the urban areas from sensor data. Our techniques are designed to exploit multiple modalities, therefore having the advantage of not being limited to the shortcomings of any single modality. The research was validated in a realworld setting, where we tested our techniques with inductive loop traffic detectors and Bluetooth sensing. We show how noise in one modality can be reduced by considering another more reliable modality, how two modalities can be combined and how two modalities can work hand-in-hand to address more complex issues, such as in the case of calculating turning ratios. We are now interested in exploring data fusion with additional modalities, such as WiFi and GPS, in order to study to which extent each additional modality improves the ability to reliably reconstruct movement trajectories in urban areas.

References 1. Balazinska M, Castro P (2003) Characterizing mobility and network usage in a corporate wireless local-area network. ACM Press, New York, pp 303–316 2. Barcelo J, Montero L, Marques L, Carmona C (2010) Travel time forecasting and dynamic od estimation in freeways based on bluetooth traffic monitoring. Transp Res Rec 2175(1):19–27 3. Caceres N, Wideberg JP, Benitez FG (2007) Deriving origin– destination data from a mobile phone network. Eng Technol 1(1):15–26 4. Chaintreau A, Hui P, Crowcroft J, Diot C, Gass R, Scott J (2007) Impact of human mobility on opportunistic forwarding algorithms. IEEE Trans Mob Comput 6(6):606–620 5. Day CM, Haseman R, Premachandra H, Brennan TM, Wasson JS, Sturdevant JR, Bullock DM (2010) Visualization and assessment of arterial progression quality using high resolution signal event data and measured travel time. Transp Res Rec 10–0039(2192):1–30 6. Eagle N, Pentland A (2006) Reality mining: sensing complex social systems. Pers Ubiquitous Comput 10:255–268 7. Girardin F, Calabrese F, Fiore F, Ratti C, Blat J (2008) Digital footprinting: uncovering tourists with user-generated content. Pervasive Comput IEEE 7(4):36–43 8. Gonzalez MC, Hidalgo CA, Barabasi A-L (2008) Understanding individual human mobility patterns. Nature 453(7196):779–782 9. Haghani A, Hamedi M, Sadabadi KF, Young S, Tarnoff P (2009) Data collection of freeway travel time ground truth with bluetooth sensors. Transp Res Rec 2160(–1):60–68 10. H.D. R (1994) ITE manual of transportation engineering studies. Prentice Hall, Englewood Cliffs 11. O. J.C. (1976) Sample size determination for travel time and delay studies. Traffic Eng. 46(9) 12. Kostakos V, Camacho T, Mantero C (2013) Towards proximitybased passenger sensing on public transport buses. Pers Ubiquitous Comput, pp. 1–10

Pers Ubiquit Comput 13. Kostakos V, Nicolai T, Yoneki E, O’Neill E, Kenn H, Crowcroft J (2009) Understanding and measuring the urban pervasive infrastructure. Pers Ubiquitous Comput 13:355–364 14. Krumm J (2010) Where will they turn: predicting turn proportions at intersections. Pers Ubiquitous Comput 14:591–599 15. Lammer S, Gehlsen B, Helbing D (2006) Scaling laws in the spatial structure of urban road networks. Phys A Stat Mech Appl 363(1):89–95 16. Liu L, Andris C, Ratti C (2010) Uncovering cabdrivers’ behavior patterns from their digital traces. Comput Environ Urban Syst 34(6):541–548 17. Lu Y, Liu Y (2012) Pervasive location acquisition technologies: opportunities and challenges for geospatial studies. Comput Environ Urban Syst 36(2):105–108 18. Martchouk M, Street NC, Suite NE (2010) Analysis of freeway travel time variability using bluetooth detection. J Transp Eng 2051(July):1–30 19. McNett M, Voelker GM (2005) Access and mobility of wireless PDA users. ACM SIGMOBILE Mob Comput Commun Rev 9(2):40 20. Nicolai T, Kenn H (2007) About the relationship between people and discoverable bluetooth devices in urban environments. In: Mobility. ACM, New York, pp 72–78 21. Nicolai T, Yoneki E, Behrens N, Kenn H (2006) Exploring social context with the wireless rope, vol 4277. Springer, Berlin

22. O’Neill E, Kostakos V, Kindberg T, Schieck AF gen, Penn, Fraser ADS, Jones T (2006) Instrumenting the city: developing methods for observing and understanding the digital cityscape. In: Ubicomp, vol 4206. Springer, pp 315–332 23. Quayle S, Koonce P (2010) Arterial performance measures using MAC Readers-Portland’s experience. Kittelson & Associates, Inc., Portland 24. Rattenbury T, Good N, Naaman M (2007) Towards automatic extraction of event and place semantics from flickr tags. In: ACM SIGIR conference on research and development in information retrieval, SIGIR ’07, New York, NY, USA, ACM, pp 103–110 25. Versichele M, Neutens T, Delafontaine M, de Weghe NV (2012) The use of bluetooth for analysing spatiotemporal dynamics of human movement at mass events: a case study of the ghent festivities. Appl Geogr 32(2):208–220 26. Working H (1960) Notes on the correlation of first differences of averages in a random chain. Econometrica 28(4):916–918 27. Zhou X, Mahmassani H (2006) Dynamic origin-destination demand estimation using automatic vehicle identification data. Intell Transp Syst IEEE Trans 7(1):105–114 28. Ziebart BD, Maas AL, Dey AK, Bagnell JA (2008) Navigate like a cabbie: probabilistic reasoning from observed context-aware behavior. In: Ubicomp. ACM, New York, pp 322–331

123