Case Studies
Research studies used as a basis for the AMOS project.
Last updated
Research studies used as a basis for the AMOS project.
Last updated
Sociodemographic information of people and households (HH) - scaled to 10%
Cleaning a. Original data
b. Spatial editing
c. cleaned
Entry HHi is multiplicated by weight wi (which indicates how many households a specific entry represents, use stochastic rounding for floats) ~ “copy” each household wi times. (Mind the sampling rate (direct or not.)
Income information is in the census - divided into bins
OSM, zones, schools (shapefiles, csv)
From zones in shapefiles (use geopandas, set current coordinate system crs, transform to different coord system), set zone id
Extract roads from OSM: x, y, purpose
Create “opportunities” - offer work, offer houses - transform geometries to a new coordinate reference system (geopandas)
Add schools to opportunities - transform geometries to a new coordinate reference system (geopandas)
a.
b.
From shapefile of zones - transform geometries to a new coordinate reference system (geopandas): geometry, zone_id
Cleaning (removing NaN, duplicates), remapping categories
Divide into two dataframes - person, trips
Remapping categories (work - employed, not employed, student; trip purpose - home, leisure, shop, work; mode - pt, car, car-passenger...)
Create point from home_coord for each person + remapping to correct coordinate system
a.
b.
Map home coords (points) of each person to zones (poly) -> home_zones
Generating areas a.
b.
Generating trips
origin and destination purpose (shop, work, ...), mode, zones, …
Remove trips from a place to the same place
Remove trips not starting at home, remove trips not ending at home
Calculate activity duration
Spatial join origin & destination coords with zones
Output
persons.csv
trips.csv
OD matrices An origin–destination matrix is a matrix in which each cell represents the number of trips from an origin zone (given by the corresponding row of the matrix) to a destination zone (column), or the percentage of trips starting in the origin zone that reach the destination zone. Those matrices can be created from the household travel survey. In this study, one weighted origin–destination matrix was generated for work trips.
Spatial zoning system
Census data
Large microsample of the population - 30% households in France
Commuting relations (flow matrix, moving for work and education purpose) between municipalities
Aggregated zonal information
Household income
Household travel survey
Detailed activity chain for one reference person (what activities, when, how the person moved between activities)
Locations at which activities can take place - address, coordinates,
Work - number of employees
Location of educational facilities
socio demographic information and residence locations of households and persons are generated - municipality or [area] is defined for each synthetic person
income information is added to each household
activity chains are attached to the synthetic persons - statistical matching based on the correlation of daily activity patterns and sociodem. attributes
places of work and education are assigned - primary location assignment, using commuting matrix
Correct number of people should commute from one municipality to another in the population
The commute distance should fit the assigned activity chain
the locations of all other activities in the persons’ activity chains are chosen
full anonymous census 2011 with household information
CSU natality and mortality data between 2011 and 2016
National HTS 2016
City HTS 2016
Zoning data
Clean raw travel survey data
Trips going in/out of the catchment area will start/end at the "city gates"
merge census and HTS - they use two different HTS data (2 sets of mandatory/preferred columns while matching with hot-deck)
Get facilities/build data - building purpose, activity sector (households, industry, agriculture, forestry, transportation, utilities, hospitality, administrative, public services, ...)
assign facilities to zones and classify them according to trip purposes
home and primary locations (work, education) - based on OD pairs from merged census and HTS and facility data
impute secondary locations
household travel survey ~ contains 84 889 samples which are weighted, so that the total weight sum amounts to 20 508 979, more or less the number of inhabitants in the area in 2017.
Additional facility data -
stochastic simulation of several demographic transition processes that update census 2011 data (based on and rates, residential mobility)