Introduction to GIS for Public Health
Introduction
Introduction
- Public health - demands a broader view!
- The need for participatory decision making in public health
- The transparency of open data science approach
The third upcoming area of research methods
- Observational
- Experimental
- Computational/data science/ML/AI, etc.
What information does GIS use?
- Data that defines geographical features like roads, rivers
- Soil types, land use, elevation
- Demographics, socioeconomic attributes
- Environmental, climate, air-quality
- Annotations that label features and places
What is (Geo)-Spatial Data Science?
- Analyse and extract insights from geospatial data
- Work with real-world data on a number of domains and problems
- Acquire key data science skills and important tools to answer spatial questions
A valuable tool for public health policy advocacy
The Process of Geospatial Data Science

Mininum skills for public health data science
Hard Skills - Programming Language - Transparency and Reproducibility
Soft Skills - Communication - Storytelling - Geospatial analytics - Ethical skills
Tools for Spatial Data Science (SDS)
Graphical User Interfaces (GUIs)
- QGIS and GRASS has revolutionized Open source Geographic Information Systems (GIS).
- However, the reproducibility aspect has many challenges
Command Line Interfaces (CLIs)
- R and
- Python
are a good way to bring in reproducible algorithms for GIS/SDS
The Geodata ‘revolution’
Advanced Hardware
- High-performance computer hardware
- Efficient algorithms to process vast data sets
Scalable Software
- Scalable solutions with the R
- extract valuable insights from the noise
Spatial Databases
- The advent of spatial databases
Healthcare Data
Traditionally data in healthcare are:
- Collected for the purpose (carefully designed)
- Detailed and informative (“rich profile and portraits”)
- High quality
However, they are:
- Massive enterprises (very costly)
- Coarse resolution (need to be aggregated to protect privacy)
- Slow - the more detailed, the less frequent they are available
New Forms of Spatial Data
Tied into the Geodata revolution
Accidental: created for different purposes but available for analysis as a side product
Diverse: resolution and quality but, potentially much more detailed in both space and time
Challenges (Arribas-Bel, 2014)
- Bias
- Technical barriers
- Methodological “mismatch”
(Geo)-Spatial Visualization
Spatial Visualization


By encoding information visually, they allow to present large amounts of numbers in a meaningful way.
A Map for Everyone
A Real Public Health Tool
- Maps can fulfill several needs, looking very different depending on the end-goal.
Three Main Dimensions
- Knowledge of what is being plotted
- Target audience
- Degree of interactivity
MacEachren & Kraak (1997)
MacEachren & Kraak (1997)

DiBiase’s (1990) “Swoopy”
Translating numbers into a (visual) language that the human brain “speaks better”

Exploratory Visualization
> “forces us to notice what we never expected to see”
(Tukey 1977: vi)
Mostly for researchers in the course of the research process.
Many, quick and dirty, and rather unattractive graphs.
Explanatory Visualization
> “forces readers to see the information the designer wanted to convey”
(Kosslyn 1994: 271)
Mostly for others after the research is completed.
Few, carefully crafted, and attractive graphs.
Choropleths
Thematic map in which values of a variable are encoded using a colour gradient of some sort
- Counterpart of the histogram
Both allows us to guage the distribution of a variable.
Spatial Weights
Spatial Weights
For a statistical method to be explicitly spatial, it needs to contain some representation of the geography, or spatial context.
(Geo)-Spatial Visualization:
- translating numbers into a (visual) language (colors) that the human brain can interpret.
Spatial Weights Matrices:
- translating geography into a (numerical) language that a computer can interpret.
Spatial Weight Matrices
Spatial Weights Matrices are building block for spatial analysis and statistics.
They are used to assign a weighted average or sum of neighbouring data values to an observation, or other point in space.
Relates to concepts of spatial ‘smoothing’ and interpolating data
Applications of Spatial weights
Spatial weights form the core element in several spatial analysis techniques
- Spatial autocorrelation
- Spatial clustering/geo-demographics
- Spatial regression
Spatial Heterogeneity
Most influential local determinants of household energy expenditure

Mashhoodi et al., 2019 (doi:10.1080/19475683.2018.1557253)
REDCAP and MLR
Late-stage breast cancer rates in Chicago region in 2000: (a) ZIP code areas, (b) REDCAP-constructed areas
::: aside Wang et al., 2015 (doi:10.1080/19475683.2019.1702099) :::
2SFCA
- Two-step floating catchment area (2SFCA) method
- Hospital potential crowdedness

Wang et al., 2018 (10.1080/19475683.2019.1702099)