Spatial Data Science with R
GIS and Public Health
Extremely useful in providing a fresh outlook to public health.
Provides opportunity to enable overlaying data with its spatial representation
Supports better planning and decision-making.
The convergence of many new sub-disciplines:
- medical geography
- public health informatics
- data science

Map of the plague in the province of Bari, Naples, 1690-1692
The map shows areas most affected and the boundaries of a military quarantine imposed to prevent its spread to neighboring towns and to other provinces.
Applications of GIS in Public Health
- disease surveillance
- environmental health
- infectious diseases
- mathematical modelling
- agent based modelling
- population genetics
- medical imagining
- cancer biology


While traditional uses of GIS in healthcare still are relevant, newer methods and advancing technology would be monumental for public health research.
What is Spatial Data Science?
Spatial data science (SDS) is a subset of Data Science that focuses on the unique characteristics of spatial data, moving beyond simply looking at where things happen to understand why they happen there.

Like data science, spatial data science seems to be a field that arises bottom-up in and from many existing scientific disciplines and industrial activities concerned with application of spatial data, rather than being a sub-discipline of an existing scientific discipline.
Edzer Pebesma, Roger Bivand - Spatial Data Science With Applications in R
How is it different from Data Science?

Why Spatial Data Science for Public Health?



Potential of Spatial Data Science for Public Health
Wealth of Spatial Data
70% of all data that is generated data has spatial attributes
Routine health data can be geo-referenced
Provide a gateway for researchers and practitioners to examine the role and harness the power of SDS in public health
Coupled with the emerging field of spatial statistics, the analysis of this location-based data is developing new and novel directions for public health.
Data Science as a methodological approach
The key word in data science is not data, it is science.
– Jeff Leek, JHU Data Science Lab
Reproducible Research

Reproducible Research
There are four key elements of reproducible research:
- data documentation
- data publication
- code publication,
- output publication.

Tools for Spatial Data Science
- GIS related
- Data Science related
- Spatial Data Science related
R is the best spatial data science tool available for public health !!!
R provides a range of powerful packages for geospatial analysis, enabling advanced computations and analytics.

R Spatial Analysis Ecosystem
R Spatial Learning Resources
Wealth of Resource material
Powerful tools/packages
seamlessly handle vector and raster data
inractive visualization
end-to-end solution

Newest addition: Spatial Data Science: With Applications in R














