Development and Comparison of Air Pollution Exposure Surfaces

Feb 23, 2018 - We developed four LUR models and exposure surfaces, for the two pollutants and measurement protocols. Coefficients of determination (R2...
0 downloads 9 Views 955KB Size
Subscriber access provided by - Access paid by the | UCSB Libraries

Article

DEVELOPMENT AND COMPARISON OF AIR POLLUTION EXPOSURE SURFACES DERIVED FROM ON-ROAD MOBILE MONITORING AND SHORT-TERM STATIONARY SIDEWALK MEASUREMENTS Laura Minet, Rick Liu, Marie-France Valois, Junshi Xu, Scott Weichenthal, and Marianne Hatzopoulou Environ. Sci. Technol., Just Accepted Manuscript • DOI: 10.1021/acs.est.7b05059 • Publication Date (Web): 23 Feb 2018 Downloaded from http://pubs.acs.org on February 26, 2018

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

Environmental Science & Technology is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 30

Environmental Science & Technology

1 2 3 4 5 6

DEVELOPMENT AND COMPARISON OF AIR POLLUTION EXPOSURE SURFACES DERIVED FROM ON-ROAD MOBILE MONITORING AND SHORT-TERM STATIONARY SIDEWALK MEASUREMENTS

7

Marianne Hatzopoulou1*

Laura Minet1, Rick Liu1, Marie-France Valois2, Junshi Xu1, Scott Weichenthal3,

8 1

9 10

2

Department of Civil Engineering, University of Toronto, Toronto, Ontario, Canada

Division of Clinical Epidemiology, Faculty of Medicine, McGill University, Montreal,

11 12 13

Quebec, Canada 3

Epidemiology, Biostatistics & Occupational Health, Faculty of Medicine, McGill University, Montreal, Quebec, Canada

14

*Corresponding author, Department of Civil Engineering, University of Toronto

15

35 St George Street, Toronto, ON M5S 1A4

16

Telephone: 416-978-0864. Email: [email protected]

1 ACS Paragon Plus Environment

Environmental Science & Technology

Page 2 of 30

17

ABSTRACT

18

Land-use regression (LUR) models of air pollutants are frequently developed based on short-

19

term stationary or mobile monitoring approaches, which raises the question of whether these two

20

data collection protocols lead to similar exposure surfaces. In this study, we measured Ultrafine

21

Particles (UFP) and Black Carbon (BC) concentrations in Toronto during summer 2016, using

22

two short-term data collection approaches: mobile, involving 3,023 road segments sampled on

23

bicycles, and stationary, involving 92 sidewalk locations. We developed four LUR models and

24

exposure surfaces, for the two pollutants and measurement protocols. Coefficients of

25

determination (R2) varied from 0.434 to 0.525. Various small-scale traffic variables were

26

included in the mobile LUR. Pearson correlation coefficients between the mobile and stationary

27

surfaces were 0.23 for UFP and 0.49 for BC. We also compared the two surfaces using personal

28

exposures from a panel study in Toronto conducted during the same period. The personal

29

exposures differed from the outdoor exposures derived from the combination of GPS information

30

and exposure surfaces. For UFP, the median for personal outdoor exposure was 26,344 part/cm3,

31

while the cycling and stationary surfaces predicted medians of 31,201 and 19,057 part/cm3.

32

Similar trends were observed for BC, with median exposures of 1,764 (personal), 1,799 (cycling)

33

and 1,469 ng/m3 (stationary).

2 ACS Paragon Plus Environment

Page 3 of 30

34

Environmental Science & Technology

TOC ART

35 36 37

1

38

Understanding the small-scale variations of Ultrafine Particles (UFP) and Black Carbon (BC) in

39

an urban area is crucial for accurate exposure assessment. However, the cost of sampling

40

equipment often puts a curb to large monitoring campaigns with numerous simultaneous

41

measurements. Land-use regression models (LUR) have been widely used to develop air

42

pollution exposure surfaces for various traffic-related air pollutants, which were then used in the

43

context of epidemiologic studies1–4. Early LUR models were developed for nitrogen dioxide

44

(NO2), primarily because of the ease of measuring NO2 using passive samplers and its common

45

use as a proxy for traffic-related air pollution. The recent availability of portable monitors for

46

UFP and BC has motivated the development of LUR models. Yet, the nature of the instruments

47

available to measure these two pollutants (e.g. high cost, power requirements, need for daily

48

maintenance), have made it difficult to conduct long-term measurements at fixed locations thus

49

giving rise to short-term monitoring campaigns. These campaigns have either taken place via

INTRODUCTION

3 ACS Paragon Plus Environment

Environmental Science & Technology

Page 4 of 30

50

monitoring at fixed sites5–11, or via mobile monitoring, involving measurements conducted in

51

motion9,12–18. In addition to allowing the investigation of a spatially larger area, the costs of

52

sampling via mobile monitoring can be reduced5,19,20. However, the duration of sampling at each

53

location (usually represented by a road segment) is shorter than the time typically spent at fixed

54

locations, and it is difficult to quantify the number of repeated measures necessary in order to

55

obtain a representative average for each road segment.

56

LUR models and associated exposure surfaces are intrinsically dependent on the data

57

collected, e.g. the sampling locations, repeated observations, and the pollutant concentrations

58

associated with them. The influence of the data collection protocol on LUR models is therefore a

59

topical question. With the emergence of mobile data collection approaches, the question of

60

whether to promote the spatial extent at the expense of the duration of sampling becomes

61

pertinent since the two approaches may lead to different models. Besides spatial extent and

62

sampling duration, the two approaches differ in the location of the measurements: mobile

63

sampling inherently occurs on the road while stationary sampling often occurs on the sidewalk.

64

In the case of dissimilar results, knowing which approach leads to a more representative surface

65

is crucial to guide future studies.

66

In the context of UFP and BC, we explored the influence of the sampling approach on

67

LUR models and associated exposure surfaces. We designed short-term stationary and mobile

68

monitoring campaigns, occurring simultaneously during summer 2016 in Toronto, Canada. LUR

69

models and exposure surfaces were derived for both pollutants and based on the two data

70

collection approaches separately. The results of the exposure surfaces were compared with short-

71

term outdoor personal exposures collected in a panel study that occurred during the same time

72

period.

4 ACS Paragon Plus Environment

Page 5 of 30

Environmental Science & Technology

73

2

74

2.1

75

Two simultaneous campaigns based on mobile and stationary protocols were conducted between

76

May and August 2016 in Toronto with the aim of optimising spatial coverage and temporal

77

coverage of a 7am to 7pm day.

METHODOLOGY Data collection

78

The short-term stationary monitoring campaign was based on 92 near-road sampling

79

locations (on the sidewalk, at the edge of the road), equally distributed between intersections and

80

mid-blocks. Each visit to a fixed point consisted of a 20-minute sampling duration, whereby

81

research assistants counted vehicles (i.e. passenger car, passenger truck, light commercial truck,

82

short haul truck, long haul truck, coach bus, school bus, transit bus, and street cars) going

83

through the intersection, or driving along the road. In contrast, the mobile data collection was

84

performed using bikes. Ten routes of 24 to 31 km each were designed, passing through most of

85

the fixed points, and representing a total of 270 km and 3,895 unique road segments (FIGURE

86

S1).

87

Each day was divided into three time blocks of 4 hours each (7am-11am, 11am-3pm, and

88

3pm-7pm). Measurements along biking routes and fixed points were conducted at least once per

89

time block for a minimum of five repetitions. We randomized the direction of the routes and the

90

order of fixed points sampling, as well as the starting time. More detail on the campaign design

91

is available in the supplemental information (SI).

92

2.2

93

Miniature Diffusion Size Classifiers (DiSCMini) and Microaethalometers (Microaeth model

94

AE51) were used to collect UFP and BC concentrations with time resolutions of 1 and 30

Instrumentation

5 ACS Paragon Plus Environment

Environmental Science & Technology

Page 6 of 30

95

seconds respectively. Both instruments were carried in backpacks with a tube connected to them

96

and pointing outside of the bag.

97

Seven DiSCMinis and six Microaeths were used during the campaign. As such, all

98

instruments were collocated with a reference monitoring station (Particle Counter TSI 651 for

99

UFP and Aethalometer AE 633 for BC) by concomitantly sampling air from the same outdoor

100

source. We repeated this process before and after the campaign, over more than 48h each time.

101

The purpose of this collocation was to select a reference DiSCMini and a reference Microaeth

102

based on the Pearson correlation coefficients between the simultaneous measurements of all the

103

instruments. The DiSCMini and the Microaeth with the highest average correlation were chosen

104

as the reference, and the other instrument measurements were corrected based on simple linear

105

regressions between the reference measurements and the measurements to correct. Records from

106

the entire summer were then corrected based on the linear regressions determined.

107

Garmin 800 GPS and the GPS mobile application Strava were used for the mobile

108

monitoring campaign and they were set up to record GPS coordinates every second.

109

2.3

110

The BC measurements were corrected with the Optimized Noise-Reduction Algorithm (ONA)

111

developed by the U.S. EPA to improve BC measurements using Aethalometers21.

Data processing

112

We also attempted to adjust UFP and BC measurements for daily variations in air quality.

113

A fixed station located at street level next to a downtown university building recorded hourly

114

UFP and BC concentrations. We used these measurements to develop a correction factor similar

115

to other studies5,18. This correction factor was specific to each day, as shown in equation (1). All

116

measurements were multiplied by the correction factor (after correction by collocation, and ONA

117

for BC). 6 ACS Paragon Plus Environment

Page 7 of 30

Environmental Science & Technology

118 Median UFP ሺor BCሻ concentration over the campaign

119

Correction Factor = Median UFP ሺor BCሻ concentration over the day of measurement

(1)

120

The fixed points and cycling databases were then developed separately. For each visit to

121

a fixed point, the 20 minutes of sampling were averaged. Meteorological variables (temperature,

122

relative humidity, wind speed) corresponding to the measurement time and day were retrieved

123

from the Toronto International Airport station. Traffic data were also stored. Subsequently, all

124

visits to a same fixed point were averaged, providing average UFP and BC concentrations, and

125

average meteorological conditions and traffic counts encountered at that point.

126

For the cycling measurements, a few GPS records were sometimes missing due to signal

127

losses. We therefore linearly interpolated the missing coordinates. The second step was to

128

associate each GPS point with a road segment of the bikeways network provided by the City of

129

Toronto on its Open Data portal (June 2016), which includes the road network of the city to

130

which cycle tracks and trails not accessible by vehicles were added. For this step, we first drew

131

the prescribed biking routes on ArcMap 10.4.1 before snapping the GPS points to the closest

132

road segment of the route drawn to limit the potential mismatch of points with road segments.

133

Then, UFP and BC values were associated with each GPS point based on the time stamp. BC

134

concentrations were recorded every 30 seconds as an average of the 30 previous seconds. We

135

therefore associated a same BC concentration with the 30 corresponding seconds of sampling.

136

Hourly meteorological variables were also recorded. We first averaged UFP and BC

137

concentrations, and meteorological variables by visit to a segment. Subsequently, we averaged

138

all visits to a road segment in order to obtain unique UFP and BC values, and meteorological

139

variables. Our final database consisted of 3,023 rows, equivalent to the number of road segments

140

covered at least once during the campaign. 7 ACS Paragon Plus Environment

Environmental Science & Technology

Page 8 of 30

141

2.4

142

Built environment and land use variables were computed using ArcMap 10.4.1 for all the fixed

143

points and road segments (TABLE S5).

Land use and built environment characteristics

144

First, we calculated distance variables as the distance between the point or the segment

145

and a specific feature: distance to the shore, to the Central Business District (CBD) and to the

146

closest airport, rail line, highway, major road, as well as the nearest Nitrogen Oxides (NOx) and

147

Particulate Matter (PM) emitting chimney. Other variables computed were within buffers of

148

sizes 50m, 100m, 200m, 300m, 500m, 750m and 1000m created around each fixed point or road

149

segment, and normalized by the area of the buffer. Areas of building footprint and of

150

commercial, governmental, industrial, parks, residential, open area, and waterbody land uses

151

were computed, as well as lengths of highway, major road, road and bus line. The number of

152

intersections, bus stops, trees, NOx and PM emitting chimneys within the buffers were also

153

calculated. The number of inhabitants within the 500m, 750m and 1000m buffers was

154

determined. The average and the maximum building height were determined within buffers of

155

25m, 50m and 100m. Finally, we computed the average hourly traffic volume within the various

156

buffers.

157

Land use, building footprint, road network, rail lines, and aerodrome shapefiles were

158

extracted from the DMTI Spatial Inc. Database (2014). We retrieved the rail line, bus routes, bus

159

stops, trees and building height shapefiles from the City of Toronto Open Data portal (June

160

2016). We also downloaded on that portal the Neighbourhood shapefile and the Demographics

161

data of 2011 that we combined to determine the number of inhabitants within each

162

neighbourhood, a homogenous distribution of the inhabitants within each neighbourhood being

163

hypothesized. The shapefiles locating the NOx and PM emitting chimneys were downloaded

8 ACS Paragon Plus Environment

Page 9 of 30

Environmental Science & Technology

164

from the National Pollutant Release Inventory (NPRI) website of the Government of Canada.

165

Finally, average traffic was generated between the hours of 6am and 7pm using EMME, a traffic

166

assignment model for the Greater Toronto and Hamilton Area (GTHA). The Origin-Destination

167

(OD) matrix of trips from the Transportation Tomorrow Survey (year 2011) was used and a

168

static traffic assignment method was employed to generate a traffic volume on the roads of the

169

GTHA.

170

2.5

171

We developed separate UFP and BC LUR models based on the fixed points and cycling data

172

using the programming language R and its statistical package. The normal logarithms of the UFP

173

and BC concentrations were computed.

Land use regression models

174

Two procedures based on the literature were tested and applied to both the temporally

175

and non-temporally adjusted UFP and BC concentrations. The first procedure, later called the

176

forward model, was based on the methodology presented by Eeftens et al.22, which was also

177

followed by Kerckhoffs et al.9 and Sabaliauskas et al.16. The steps of the algorithm were as

178

follows: 1) The potential predictors were sorted based on the adjusted R2 of the simple linear

179

regression between the predictor and ln(BC) or ln(UFP). 2) When considering non-temporally

180

adjusted pollutant concentrations, the initial model consisted in the best combination of

181

meteorological variables identified, i.e. the combination providing the highest adjusted R2. When

182

considering temporally adjusted pollutant concentrations, no predictor was added to the initial

183

model. 3) The variables were added one at a time to the initial model based on the order

184

determined in step 1. Variables were kept only if including them increased the adjusted R2 by

185

more than 1%, if the sign of the new variable in the model was meaningful, and if adding the

186

new variable did not alter the signs of the variables already in the model. At this step, we 9 ACS Paragon Plus Environment

Environmental Science & Technology

Page 10 of 30

187

checked whether the new variable was from the same category as a predictor already included in

188

the model (i.e. same variable but different buffer). If it was the case, we tested which variable

189

was more appropriate by keeping one of the variables and checking the one that led to the

190

highest increase in the adjusted R2. Also, variables that were eliminated in a previous step were

191

always tested again. 4) Once an intermediate model was reached, i.e. when all variables had been

192

tested and adding any of them did not improve the adjusted R2, we removed one by one the

193

variables that had a p-value higher than 0.1 starting with the variable with the highest p-value. 5)

194

We checked the Variance Inflation Factor (VIF) of the variables and removed the variables with

195

a VIF higher than 3, starting with the variable with the highest VIF.

196

The second procedure, called the backward model, was based on the paper of

197

Weichenthal et al.16. The steps followed are as follows: 1) The potential predictors were sorted

198

based on the p-value of the simple linear regression between the predictor and ln(BC) or

199

ln(UFP). 2) All the variables that were significant (i.e. p-value ≤ 0.05) were added to the initial

200

model. If two variables were highly correlated (i.e. Spearman correlation coefficient ≥ 0.8), then

201

the variable presenting the highest R2 in the univariate regression with the pollutant was kept. 3)

202

Variables in the regression with a coefficient sign opposed to what was expected were removed.

203

4) Variables that were not statistically significant were removed one at a time and only if doing

204

so did not increase the Root Mean Square Error (RMSE) by more than 1%. 5) The non-

205

significant variables remaining (i.e. p-value higher than 0.1) were removed starting with the

206

variable with the highest p-value even if doing so increased the RMSE by more than 1%.

207

In addition, we performed a 100-fold cross-validation. We randomly selected 90% of the

208

points or road segments, and applied to them the forward and backward models developed

209

previously to determine new coefficients. We then applied the new model to the 10% hold-out

10 ACS Paragon Plus Environment

Page 11 of 30

Environmental Science & Technology

210

sample, and calculated the RMSE and Pearson correlation coefficient between the predicted and

211

measured concentrations of this sample. We repeated this 100 times. We based our model choice

212

on the adjusted R2 of the model, as well as on the median adjusted R2 of the validation models

213

(i.e. models developed with 90% of the points) and on the median RMSE and Pearson

214

correlation coefficients of the hold-out samples.

215

After choosing our final fixed point model, we added the proportion of trucks (calculated

216

as the number of short and long haul trucks divided by the total number of vehicles) as a

217

predictor to see its impact on the adjusted R2 of the model, although it could not be used for

218

predictive purposes since the traffic data we are using only includes a total traffic volume.

219

Finally, we developed models based only on the road segments within 100m around the

220

fixed points. The aim was to compare the predictors included in these models with the predictors

221

included in the fixed point models.

222

2.6

223

To develop exposure surfaces, we divided the city of Toronto into grid cells of 100m by 100m

224

and computed the predictors for the mid point of each cell. We calculated the predicted pollutant

225

concentrations for each mid-point based on the final LUR models and associated each value with

226

the corresponding cell to generate surfaces. We slightly modified the area of Toronto considered

227

for the development of the surfaces (FIGURE S2). Indeed, the International airport of Toronto is

228

located West of the city; however, since we did not conduct measurements close to the airport

229

(due to the nature of our monitoring campaign being based on pedestrian and cycling

230

measurements), we only predicted levels up to the extent of our sampling coverage.

Exposure surfaces and comparison with panel study

231

The surfaces were not intended to predict short-term exposures; however, to explore their

232

differences, they were compared with data from a panel study also conducted in Toronto during 11 ACS Paragon Plus Environment

Environmental Science & Technology

Page 12 of 30

233

the same time period as the cycling and stationary campaigns (from May to August 2016).

234

During this period, we collected data from a total of 62 visits by various participants, each visit

235

consisting in one day extending from 9am to 4pm. Several visits could take place during the

236

same day if more than one participant was recruited. The panel study was designed at the same

237

time as the stationary and cycling campaigns. The participants were equipped with DiSCMinis,

238

Microaeths and GPS during six hours each day and were asked to spend at least two hours

239

outdoors. For the current study, only the outdoor personal exposures were considered. We

240

determined the average outdoor UFP and BC exposure that each participant was exposed to

241

during the study (personal exposure). A participant's outdoor trajectory was then intersected with

242

the exposure surfaces to derive mobility-based exposures. The personal and mobility-based

243

exposures were compared to identify the extent of bias in short-term exposure estimates

244

generated by the various surfaces.

245

3

246

3.1

247

Hourly UFP and BC concentrations recorded by the DiSCMinis and Microaeths exhibited very

248

similar trends compared to the measurements of the reference station. However, the ranges could

249

vary a lot for UFP; the highest difference between two DiSCMini measurements was 21,000

250

particles/cm3 (noted part/cm3). The Pearson correlation coefficients between the DiSCMinis

251

ranged from 0.36 to 0.98 with an average of 0.82. The chosen reference DiSCMini had

252

correlation coefficients of 0.75 to 0.98 with the six other instruments. As for the Microaeths, the

253

differences between two instruments were low, especially during the second colocation when the

254

largest hourly averaged BC concentration difference between two instruments was less than 100

255

ng/m3. The correlation coefficients between the instruments ranged from 0.71 to 0.99 for an

RESULTS Instrument performance

12 ACS Paragon Plus Environment

Page 13 of 30

Environmental Science & Technology

256

average of 0.90. For these reasons, we did not correct the BC measurements. More detail is

257

provided in the SI.

258

3.2

259

The average number of visits per fixed point was 5.1, i.e. an average of 102 minutes of sampling.

260

Concerning the cycling campaign, the mean number of visits to each road segment was 6, the

261

minimum and maximum were 2 and 15 respectively, corresponding to an average of 121 seconds

262

per segment (0.8 sec/m).

Descriptive statistics

263

The UFP and BC measurements at the fixed points were similar to the measurements by

264

bike (FIGURE S5), and the distributions were all approximately lognormal. The median UFP

265

concentrations were 20,867 and 19,756 part/cm3, and the median BC concentrations were 1,155

266

and 1,223 ng/m3 for the fixed points and cycling campaigns, respectively. Interquartile ranges

267

were comparable as well, but the extreme values were higher for the cycling records. The

268

maximum UFP concentration for a fixed point was 61,255 part/cm3, while it was 241,547

269

part/cm3 for a road segment (averages across all visits). Similarly, the maximum BC

270

concentration for a fixed point was 7,268 ng/m3, and 39,152 ng/m3 for a road segment.

271

We compared average UFP and BC concentrations at fixed points and on the portions of

272

road segments within buffers of 100m around the fixed points (FIGURES S6 and S7). The

273

median levels on road segments were in general higher than at fixed points. The median UFP

274

concentrations at fixed points and on road segments were 23,556 and 28,304 part/cm3, while BC

275

levels were 1,148 and 1,628 ng/m3. Pearson correlation coefficients between measurements at

276

fixed points and on road segments within a 100m buffer around them (45 fixed points and an

277

average of 212 m of road segments sampled around each point) were 0.07 and -0.11 for UFP and

278

BC, indicating no correlation. 13 ACS Paragon Plus Environment

Environmental Science & Technology

Page 14 of 30

279

3.3

280

Using the meteorological variables to adjust for temporal variability proved to be more effective

281

than using the data from the fixed monitoring station, with higher adjusted R2 and lower RMSE.

282

We opted for the forward procedure for all models (more details in the SI).

LUR models

283

TABLE 1 presents the LUR models. For UFP, the two models (fixed vs. mobile

284

monitoring) present similar adjusted R2 (0.405 and 0.430 for the fixed point and mobile models).

285

Including the proportion of trucks in the fixed point model increased the adjusted R2 from 0.405

286

to 0.467. The adjusted R2 differed more distinctly between the BC fixed point and cycling

287

models (0.53 and 0.432 respectively). However, the proportion of trucks did not have an

288

important effect on the fixed point BC model as the adjusted R2 increased from 0.525 to 0.532.

289

There are few common variables between the fixed point and cycling models for the two

290

pollutants. Temperature and relative humidity are positively associated with UFP or BC, while

291

wind speed is negatively associated. Several times, the buffer size employed is different between

292

two models. For instance, traffic count is present in both BC models, but within a 750m buffer in

293

the fixed point model and within a 100m buffer in the cycling model. This is also the case for the

294

number of bus stops, the length of major roads, and the area of water. All coefficient signs are

295

consistent among the four models, except for the area of water. More small-scale traffic variables

296

(i.e. distances or predictors within buffers smaller than 500m) are included in the cycling models.

297

The results of the cross validations are available in the SI. The cycling models developed

298

with the road segments within 100m around fixed points were different from the fixed point

299

models (TABLE S16). This is not surprising as the measurements at fixed points and on the

300

surrounding road segments were not correlated.

14 ACS Paragon Plus Environment

Page 15 of 30

Environmental Science & Technology

301

The models developed based on the fixed points were used to predict concentrations at

302

the road segments sampled during the cycling campaign to investigate transferability of the

303

variables. Inversely, the cycling models were also used to predict concentrations at the fixed

304

points. The R2 of the linear regression between the predicted and the measured concentrations

305

ranged between 0.0182 and 0.0888, indicating poor transferability of the variables and

306

coefficients.

307

15 ACS Paragon Plus Environment

Environmental Science & Technology

308

Page 16 of 30

TABLE 1 UFP and BC LUR Models (Standardized Coefficients) for the Fixed Points and Cycling Data ln(UFP)

ln(BC)

Fixed points

Cycling

Fixed points

Cycling

Adjusted R²

0.405

0.430

0.525

0.434

Temperature

2.96E-01***

1.76E-01*

7.64E-02***

2.26E-01**

Relative Humidity Wind Speed

-1.94E-01*

-2.60E-01***

7.11E-02**

Number of bus stops within 50m 2.21E-01*

Number of bus stops within 100m Number of bus stops within 300m

1.89E-01* 2.09E-01*

Length of bus line within 100m Distance to the CBD

1.00E-01*** -2.37E-01***

3.88E-01** -4.17E-01 ***

Distance to the closest airport

-1.29E-01***

Distance to the closest major road -1.44E-01***

Distance to the shore Length of highway within 1000m

2.77E-01*

Number of intersections within 750m

3.33E-01** 8.84E-02***

Length of major roads within 200m Length of major roads within 500m

2.51E-01*

Open area within 1000m

-2.14E-01*

Population within 500m

1.77E-01.

Area of water within 50m

1.75E-01.

2.79E-01*** -1.80E-01***

Area of water within 750m

-2.79E-01*** 1.96E-01*

Commercial area within 500m 1.49E-01***

Traffic within 100m

1.57E-01*** 4.47E-01***

Traffic within 750m 1.05E-01***

Number of trees within 750m

6.93E-02***

Building footprint area within 500m 8.72E-02***

Maximum building height within 50m

-1.19E-01***

Park area within 500m

1.32E-01***

309

Length of roads within 500m Number of NPRI NOx chimneys within 200m Significance codes: 0 ‘***’, 0.001 ‘**’,0.01 ‘*’, 0.05 ‘.’

310

3.4

311

FIGURE 1 presents the four surfaces derived from the LUR models. They all look dissimilar,

312

which is confirmed by the weak correlations between them. The two BC surfaces have the

313

highest correlation, which is 0.49, but the two UFP surfaces only have a correlation coefficient of

7.29E-02***

Exposure surfaces

16 ACS Paragon Plus Environment

Page 17 of 30

Environmental Science & Technology

314

0.232. The two cycling surfaces (for BC and UFP) and the two fixed point surfaces are

315

moderately correlated, with Pearson correlation coefficients of 0.63 and 0.49 respectively. Major

316

roads are more visible in the two cycling surfaces, in light of the variables included in the cycling

317

LUR models.

318

FIGURE 2 compares the predictions of the surfaces to the panel study data. The

319

participants spent on average 4 hours outdoors during each visit (TABLE S17), between 9am and

320

4pm. The median UFP outdoor exposures derived from the combination of GPS information and

321

exposure surfaces were different from the personal exposures: 26,344 (personal), 31,201 (cycling

322

model) and 19,057 part/cm3 (fixed point model), and similar results were obtained for BC, with

323

median exposures of 1,764 (personal), 1,799 (cycling) and 1,469 ng/m3 (fixed point). For BC, the

324

cycling surface provides closer results while the mobility-based exposure derived from the fixed

325

point surface is lower. For UFP, the cycling surface tends to overestimate the exposure while the

326

fixed point surface tends to underestimate them. In general, both mobility-based methods seem

327

to correctly capture central tendencies of exposure. However, the maximum UFP and BC levels

328

recorded via personal outdoor monitoring are higher than those derived from the surfaces,

329

because personal outdoor monitoring captures very punctual events such as a bus or a truck

330

passing by, which can drastically affect the range of observed concentrations. Additional

331

comparisons suggesting low correlations are available in the SI (FIGURES S20 and S21).

332

17 ACS Paragon Plus Environment

Environmental Science & Technology

Page 18 of 30

(a) UFP surface derived from the LUR model based on the fixed points campaign

(b) UFP surface derived from the LUR model based on the cycling campaign

18 ACS Paragon Plus Environment

Page 19 of 30

Environmental Science & Technology

(c) BC surface derived from the LUR model based on the fixed points campaign

333

(d) BC surface derived from the LUR model based on the cycling campaign FIGURE 1 UFP and BC exposure surfaces obtained from the LUR models based on fixed points and cycling.

19 ACS Paragon Plus Environment

Environmental Science & Technology

Page 20 of 30

80,000 70,000

UFP (part/cm³)

60,000 50,000 40,000 30,000 20,000 10,000 0 Personal measurements

Cycling surface

Fixed points surface

(a) UFP (45 personal measurements) 6,000

5,000

BC (ng/m³)

4,000

3,000

2,000

1,000

0 Personal measurements

334 335 336 337

Cycling surface

Fixed points surface

(b) BC (62 personal measurements) FIGURE 2 Comparison of the UFP and BC personal exposures with mobility-based exposures predicted by the fixed point and cycling surfaces (the lower and upper whiskers correspond to the minimum and maximum, the box is drawn between the first and the third quartile with the middle line corresponding to the median; axes are truncated).

338

20 ACS Paragon Plus Environment

Page 21 of 30

Environmental Science & Technology

339

4

340

In this study, we developed LUR models based on short-term mobile and stationary monitoring

341

campaigns. The mobile campaign was conducted on bicycle while the stationary campaign was

342

conducted based on short-term sidewalk measurements. The predictors captured in the LUR

343

models based on each sampling campaign are different and exposure surfaces resulting from the

344

LUR predictions are dissimilar. The adjusted R2 of the UFP LUR were 0.405 and 0.430 for the

345

fixed point and cycling models, and 0.525 and 0.434 for the BC fixed point and cycling models,

346

respectively. These values fall within the range of R2 presented in the literature. TABLES S20

347

and S21 summarise the characteristics of various UFP and BC short-term monitoring campaigns

348

and LUR models published in the recent literature. For UFP, values for the R2 range between

349

0.23 and 0.85 for LUR models based on fixed-site monitoring, while varying between 0.13 and

350

0.72 for LUR models based on mobile monitoring. For BC, they range between 0.28 and 0.86 for

351

fixed-site monitoring and 0.12 and 0.68 for mobile monitoring. Based on the literature, it seems

352

that fixed monitoring generally leads to higher LUR R2 than mobile monitoring for BC (TABLE

353

S21).

DISCUSSION

354

A review of recent LUR models developed for various cities (TABLES S20 and S21)

355

illustrates that the number of fixed sites (between less than 206 and 16011 for a single city) and

356

the duration of sampling (between 15 minutes5 and 90 minutes8) vary significantly across

357

studies. Moreover, the aggregation of measurements in the case of mobile campaigns varies

358

significantly, as the unit of analysis ranges from a road segment, extending between two

359

intersections, to segments of equal length. The number of aggregation locations varies between

360

about 10012 and more than 4,00023, depending on the extent of the area covered. The average

361

time spent at each location has been reported to be as low as 18 seconds9 and reaching more than

21 ACS Paragon Plus Environment

Environmental Science & Technology

Page 22 of 30

362

10 minutes18. During the cycling campaign, we spent on average 121 seconds on each road

363

segment, which is within the range of time reported in the literature. During our stationary

364

campaign, each fixed point was visited on average 102 minutes, comparable with the range of 15

365

to 90 minutes typical of most fixed monitoring campaigns.

366

In our study, extreme UFP and BC values measured on bicycles were more enhanced

367

than those measured at fixed points, which is related to the time spent at each location. At fixed

368

points, the average concentrations were almost always calculated based on 100 minutes of

369

sampling, whereas the averages of road segments sampled while cycling were computed based

370

on a very short duration, and were therefore highly influenced by the surrounding environment

371

(e.g. bus, construction work) when sampling. Comparing more specifically the concentrations on

372

the road segments surrounding the fixed points provides the same conclusion (FIGURES S6 and

373

S7). The mean concentrations are higher on road segments, and the maximum values are also

374

high.

375

Our findings are consistent with the results of Kerckhoffs et al.9 who reported cycling

376

measurements on average 1.7 and 2.1 times higher than stationary measurements for UFP and

377

BC respectively. However, the correlation between stationary and mobile measurements reported

378

by Kerckhoffs et al.9 (0.48) was higher than the one observed in this study. In the study of

379

Kerckhoffs et al.9, short-term stationary monitoring was performed in between mobile

380

measurements. Therefore, the comparison was done between measurements conducted very

381

close in time and in space. In light of the low correlation between stationary and mobile

382

measurements observed in our study, we can conclude that the data collection protocol in itself

383

(i.e. cycling on road vs standing on sidewalk) has a large influence on the data recorded, beyond

22 ACS Paragon Plus Environment

Page 23 of 30

Environmental Science & Technology

384

the averaging time. Few stationary measurements are typically performed in cars on the road side

385

while standing on the curbside is more commonly observed6,11.

386

Few variables are common to the fixed point and cycling models for both pollutants,

387

suggesting that measurements on roads and at fixed points do not capture the same

388

characteristics. The signs associated with some variables are worth noting. Although temperature

389

is generally negatively associated with UFP and BC concentrations14,24, it presents a positive

390

coefficient in all our models. We can explain the positive association between temperature and

391

UFP concentrations by nucleation events occurring when temperature is high, which is supported

392

by the peaks in UFP concentrations generally recorded at the beginning of the afternoon.

393

Distances to the shore and to the CBD appear in the cycling and fixed point UFP models with

394

negative and positive signs respectively. Several highways run along the shore, and two cycling

395

routes were designed in particular to capture this effect. On the other hand, trucks, which were

396

positively associated with UFP levels in the fixed point model, do not usually drive downtown

397

but mainly on highways located north of the city or along the shore and far from downtown,

398

explaining the positive association between UFP levels and distance to the CBD in the fixed

399

point model. Another interesting variable is the water area, which has a positive coefficient in the

400

UFP fixed point model but a negative coefficient in the UFP and BC cycling models, a

401

difference probably due to the distinct buffer sizes included in these models. Finally, in the

402

cycling model, the number of trees is positively associated with UFP concentrations. Several

403

studies have showed that vegetation could be positively or negatively associated with air

404

pollutant concentrations25–27. Indeed, the characteristics of the street (i.e. building height, street

405

width) combined with the type of vegetation (e.g. canopy density, crown diameter, trunk height)

23 ACS Paragon Plus Environment

Environmental Science & Technology

Page 24 of 30

406

can have a significant influence on the pollutants levels, by either increasing the turbulence or

407

decreasing the wind speed.

408

Weichenthal et al.15 and Sabaliauskas et al.12 developed LUR models for UFP in Toronto

409

based on measurements conducted in 2015. Many predictors included in the model from

410

Weichenthal et al.15 were related to small-scale traffic characteristics (i.e. distance to the

411

highway, distance to the major road, distance to the bus line, number of intersections, length of

412

bus line), while distance to the airport was also an important predictor. Both our UFP models,

413

and more specifically the cycling model, also include small-scale traffic variables, and distance

414

to the airport is a predictor in the cycling model. The predictors included in Sabaliauskas et al.12

415

model were related to the land use (e.g. population, industrial and residential areas), also

416

captured in our UFP LUR models via predictors such as open area, population, or water area.

417

All four surfaces are dissimilar, which raises concerns regarding their use for

418

epidemiologic studies. Major roads are highlighted in both cycling surfaces, because air pollutant

419

levels were collected directly on roads and were therefore affected by small-scale traffic

420

variables. In contrast, the fixed point surfaces are capturing hotspots at a larger spatial scale,

421

which is probably related to the nature of the monitoring approach, i.e. stationary locations being

422

more spaced. The models based on short-term stationary and mobile approaches developed by

423

Kerckhoffs et al.9 were similar in terms of predictors, and the concentrations they estimated

424

using their two models at 12,682 residential addresses were also similar. These observations are

425

different from the outcomes of our study, which highlights sensitivity of exposure surfaces to the

426

sampling protocol. In the study of Kerckhoffs et al.9, stationary measurements were carried out

427

in between mobile measurements at the same distance from traffic since a car was used for both

428

protocols. This is not the case for our study, since our study is not only capturing the difference

24 ACS Paragon Plus Environment

Page 25 of 30

Environmental Science & Technology

429

between mobile and stationary measurements but also the location of where mobile and

430

stationary sampling would normally occur: on the road vs. on the sidewalk. Together, these two

431

studies begin to unveil the complexity in the development of LUR surfaces for traffic-related air

432

pollution, highlighting the importance of testing their sensitivity prior to use in epidemiological

433

studies.

434

Our comparison with data from outdoor personal exposures demonstrated that using two

435

different surfaces can lead to different values for daily exposure. Our results suggest that the

436

mobile monitoring campaign tends to overestimate short-term exposures to UFP, while using the

437

fixed point surface tends to underestimate the short-term exposures to both UFP and BC. For

438

BC, the cycling surface was able to capture the distribution of outdoor personal exposures. For

439

UFP, despite over-predicting the mean exposure, the cycling surface was able to better capture

440

the distribution. Mobile sampling enables the coverage of wide areas, and we took advantage of

441

this when designing our campaign as we sampled 270 km of unique roads. However, as the

442

measurements are done on a small time scale, they are more susceptible to be influenced by

443

short-term events. When comparing short-term exposures, capturing these short-term peaks is

444

important. It is worth noting that in our study, personal exposures were collected between 9am

445

and 4pm, which could have an impact on the comparison with exposures estimated based on the

446

surfaces as the monitoring campaigns took place between 7am and 7pm.

447

The development of exposure surfaces using data collected during short-term sampling

448

campaigns have gained momentum in recent years, primarily in light of advances in air pollution

449

monitoring technology. Small portable instruments have enabled the development of data

450

collection protocols that involve mobile sampling by foot, by bike, or by cars fitted with various

451

sensing devices. The development of inexpensive air pollution sensors will further enable these

25 ACS Paragon Plus Environment

Environmental Science & Technology

Page 26 of 30

452

protocols and might even lead to ad-hoc data collection means through participatory air pollution

453

sensing. The question of robustness of LUR surfaces will become of utmost importance since the

454

streams of data collected are expected to become more diverse, including various locations in an

455

urban area: on-road, on the sidewalks, or at residential locations. There is a need to further

456

investigate the effect of the data collection protocol on exposure surfaces and evaluate the

457

sensitivity of LUR models.

458 459

ACKNOWLEDGMENTS

460

The authors thank Ryan Kulka, Xinli Tu, Marie-Sophie Wint, Kuruparam Satkunanathan,

461

Tszchun Chow, Tian Kang, James Mario and Teresa Li for their great help in collecting data.

462

SUPPORTING INFORMATION

463

The SI contains information about 1) the design of the campaigns, 2) the LUR predictors, 3) the

464

correction of the monitors, 4) the data collected, 5) the choice of the models, 6) the comparison

465

of the exposures, 7) the characteristics of various UFP and BC monitoring campaigns and LUR

466

models.

467

REFERENCES

468

(1)

Aguilera, I.; Dratva, J.; Caviezel, S.; Burdet, L.; De Groot, E.; Ducret-Stich, R. E.;

469

Eeftens, M.; Keidel, D.; Meier, R.; Perez, L.; et al. Particulate matter and subclinical

470

atherosclerosis: Associations between different particle sizes and sources with carotid

471

intima-media thickness in the SAPALDIA study. Environ. Health Perspect. 2016, 124

472

(11), 1700–1706.

473

(2)

Clark, N. A.; Demers, P. A.; Karr, C. J.; Koehoorn, M.; Lencar, C.; Tamburic, L.; Brauer, 26 ACS Paragon Plus Environment

Page 27 of 30

Environmental Science & Technology

474

M. Effect of early life exposure to air pollution on development of childhood asthma.

475

Environ. Health Perspect. 2010, 118 (2), 284.

476

(3)

Sahsuvaroglu, T.; Jerrett, M.; Sears, M. R.; McConnell, R.; Finkelstein, N.; Arain, A.;

477

Newbold, B.; Burnett, R. Spatial analysis of air pollution and childhood asthma in

478

Hamilton, Canada: comparing exposure methods in sensitive subgroups. Environ. Heal.

479

2009, 8 (1), 14.

480

(4)

Crouse, D. L.; Peters, P. A.; Hystad, P.; Brook, J. R.; van Donkelaar, A.; Martin, R. V.;

481

Villeneuve, P. J.; Jerrett, M.; Goldberg, M. S.; Arden Pope, C.; et al. Ambient PM2.5, O3,

482

and NO2 exposures and associations with mortality over 16 years of follow-up in the

483

canadian census health and environment cohort (CanCHEC). Environ. Health Perspect.

484

2015, 123 (11), 1180–1186.

485

(5)

Rivera, M.; Basagaña, X.; Aguilera, I.; Agis, D.; Bouso, L.; Foraster, M.; Medina-Ramón,

486

M.; Pey, J.; Künzli, N.; Hoek, G. Spatial distribution of ultrafine particles in urban

487

settings: A land use regression model. Atmos. Environ. 2012, 54, 657–666.

488

(6)

Saraswat, A.; Apte, J. S.; Kandlikar, M.; Brauer, M.; Henderson, S. B.; Marshall, J. D.

489

Spatiotemporal Land Use Regression Models of Fine, Ultrafine, and Black Carbon

490

Particulate Matter in New Delhi, India. Environ. Sci. Technol. 2013, 47 (22), 12903–

491

12911.

492

(7)

Ghassoun, Y.; Ruths, M.; Löwner, M. O.; Weber, S. Intra-urban variation of ultrafine

493

particles as evaluated by process related land use and pollutant driven regression

494

modelling. Sci. Total Environ. 2015, 536, 150–160.

495 496

(8)

Montagne, D. R.; Hoek, G.; Klompmaker, J. O.; Wang, M.; Meliefste, K.; Brunekreef, B. Land Use Regression Models for Ultrafine Particles and Black Carbon Based on Short-

27 ACS Paragon Plus Environment

Environmental Science & Technology

Page 28 of 30

497

Term Monitoring Predict Past Spatial Variation. Environ. Sci. Technol. 2015, 49 (14),

498

8712–8720.

499

(9)

Kerckhoffs, J.; Hoek, G.; Messier, K. P.; Brunekreef, B.; Meliefste, K.; Klompmaker, J.

500

O.; Vermeulen, R. Comparison of Ultrafine Particle and Black Carbon Concentration

501

Predictions from a Mobile and Short-Term Stationary Land-Use Regression Model.

502

Environ. Sci. Technol. 2016, 50 (23), 12894–12902.

503

(10)

Wolf, K.; Cyrys, J.; Harciníková, T.; Gu, J.; Kusch, T.; Hampel, R.; Schneider, A.; Peters,

504

A. Land use regression modeling of ultrafine particles, ozone, nitrogen oxides and

505

markers of particulate matter pollution in Augsburg, Germany. Sci. Total Environ. 2017,

506

579, 1531–1540.

507

(11)

van Nunen, E.; Vermeulen, R.; Tsai, M.-Y.; Probst-Hensch, N.; Ineichen, A.; Davey, M.;

508

Imboden, M.; Ducret-Stich, R.; Naccarati, A.; Raffaele, D.; et al. Land Use Regression

509

Models for Ultrafine Particles in Six European Areas. Environ. Sci. Technol. 2017, 51 (6),

510

3336–3345.

511

(12)

Sabaliauskas, K.; Jeong, C. H.; Yao, X.; Reali, C.; Sun, T.; Evans, G. J. Development of a

512

land-use regression model for ultrafine particles in Toronto, Canada. Atmos. Environ.

513

2015, 110, 84–92.

514

(13)

Patton, A. P.; Zamore, W.; Naumova, E. N.; Levy, J. I.; Brugge, D.; Durant, J. L.

515

Transferability and generalizability of regression models of ultrafine particles in urban

516

neighborhoods in the boston area. Environ. Sci. Technol. 2015, 49 (10), 6051–6060.

517

(14)

Farrell, W.; Weichenthal, S.; Goldberg, M.; Valois, M. F.; Shekarrizfard, M.;

518

Hatzopoulou, M. Near roadway air pollution across a spatially extensive road and cycling

519

network. Environ. Pollut. 2016, 212, 498–507.

28 ACS Paragon Plus Environment

Page 29 of 30

520

Environmental Science & Technology

(15)

Weichenthal, S.; Van Ryswyk, K.; Goldstein, A.; Shekarrizfard, M.; Hatzopoulou, M.

521

Characterizing the spatial distribution of ambient ultrafine particles in Toronto, Canada: A

522

land use regression model. Environ. Pollut. 2016, 208, 241–248.

523

(16)

Weichenthal, S.; Ryswyk, K. Van; Goldstein, A.; Bagg, S.; Shekkarizfard, M.;

524

Hatzopoulou, M. A land use regression model for ambient ultrafine particles in Montreal,

525

Canada: A comparison of linear regression and a machine learning approach. Environ.

526

Res. 2016, 146, 65–72.

527

(17)

Hankey, S.; Marshall, J. D. Land Use Regression Models of On-Road Particulate Air

528

Pollution (Particle Number, Black Carbon, PM2.5, Particle Size) Using Mobile

529

Monitoring. Environ. Sci. Technol. 2015, 49 (15), 9194–9202.

530

(18)

Larson, T.; Henderson, S. B.; Brauer, M. Mobile monitoring of particle light absorption

531

coefficient in an urban area as a basis for land use regression. Environ. Sci. Technol. 2009,

532

43 (13), 4672–4678.

533

(19)

Li, H. Z.; Dallmann, T. R.; Gu, P.; Presto, A. A. Application of mobile sampling to

534

investigate spatial variation in fine particle composition. Atmos. Environ. 2016, 142, 71–

535

82.

536

(20)

Klompmaker, J. O.; Montagne, D. R.; Meliefste, K.; Hoek, G.; Brunekreef, B. Spatial

537

variation of ultrafine particles and black carbon in two cities: Results from a short-term

538

measurement campaign. Sci. Total Environ. 2015, 508, 266–275.

539

(21)

Hagler, G. S. W.; Yelverton, T. L. B.; Vedantham, R.; Hansen, A. D. A.; Turner, J. R.

540

Post-processing method to reduce noise while preserving high time resolution in

541

aethalometer real-time black carbon data. Aerosol Air Qual. Res. 2011, 11 (5), 539–546.

542

(22)

Eeftens, M.; Beelen, R.; De Hoogh, K.; Bellander, T.; Cesaroni, G.; Cirach, M.; Declercq,

29 ACS Paragon Plus Environment

Environmental Science & Technology

Page 30 of 30

543

C.; Dedele, A.; Dons, E.; De Nazelle, A.; et al. Development of land use regression

544

models for PM2.5, PM 2.5 absorbance, PM10 and PMcoarse in 20 European study areas;

545

Results of the ESCAPE project. Environ. Sci. Technol. 2012, 46 (20), 11195–11205.

546

(23)

Farrell, W. J.; Deville Cavellin, L.; Weichenthal, S.; Goldberg, M.; Hatzopoulou, M.

547

Capturing the urban canyon effect on particle number concentrations across a large road

548

network using spatial analysis tools. Build. Environ. 2015, 92, 328–334.

549

(24)

Hatzopoulou, M.; Weichenthal, S.; Dugum, H.; Pickett, G.; Miranda-Moreno, L.; Kulka,

550

R.; Andersen, R.; Goldberg, M. The impact of traffic volume, composition, and road

551

geometry on personal air pollution exposures among cyclists in Montreal, Canada. J.

552

Expo. Sci. Environ. Epidemiol. 2013, 23 (1), 46–51.

553

(25)

554 555

Buccolieri, R.; Gromke, C.; Di Sabatino, S.; Ruck, B. Aerodynamic effects of trees on pollutant concentration in street canyons. Sci. Total Environ. 2009, 407 (19), 5247–5256.

(26)

Gromke, C.; Ruck, B. Influence of trees on the dispersion of pollutants in an urban street

556

canyon-Experimental investigation of the flow and concentration field. Atmos. Environ.

557

2007, 41 (16), 3287–3302.

558 559

(27)

Jin, S.; Guo, J.; Wheeler, S.; Kan, L.; Che, S. Evaluation of impacts of trees on PM 2 . 5 dispersion in urban streets. Atmos. Environ. 2014, 99, 277–287.

560

30 ACS Paragon Plus Environment