In this topic we will explore spatial data in R in order to visualize the spatial aspect of porpoise distribution.Besides the generic R functions, there are many specialized packages to visualize and analyze spatial data.
Combining ggplot with the R packages that facilitate the handling of geospatial data, as maptools, rgdal, rgeos and mapproj, enables the construction of simple and complex maps.
library(maptools)
library(rgdal)
library(rgeos)
library(mapproj)
Using these packages we will construct a simple map of the Belgian Part of the North Sea (BPNS).
A common data format for geospatial data is the shapefile. In a shapefile, geospatial data are described as points, lines and polygons and can be analysed as such in GIS software. Marine Regions provides access to many shapefiles through the gazetteer. The folder Mapping contains some useful shapefiles.
list.files("Data day 1/Mapping")
[1] "banks" "Coastal banks" "eez" "Hinder banks"
[5] "netherlands_coast" "world_bay_gulf" "Zeeland banks"
The world_bay_gulf folder and file contains the shapefile of the Southern Bight. Below you can find three ways to import it into the R Environment. The readShapePoly() function is one of many to read spatial data. This particular function of the maptools package already specifies this particular shapefile is a polygon. The recommended readOGR() recognizes the type of spatial data automatically.
bight <- readOGR("Data day 1/Mapping/world_bay_gulf", layer = "world_bay_gulf")
OGR data source with driver: ESRI Shapefile
Source: "Data day 1/Mapping/world_bay_gulf", layer: "world_bay_gulf"
with 1 features
It has 5 fields
Looking at the class, structure and plot of the shapefile allows to better understand the data type.
class(bight)
[1] "SpatialPolygonsDataFrame"
attr(,"package")
[1] "sp"
str(bight)
plot(bight)
The fortify() function allows to easily convert an object of a spatial class to a regular dataframe. Though this transformation is not necessary for making a map in R, it is vital to map data in ggplot.
bightfort <- fortify(bight)
Regions defined for each Polygons
str(bightfort)
'data.frame': 25290 obs. of 7 variables:
$ long : num 4.99 4.99 4.97 4.97 4.96 ...
$ lat : num 52.4 52.4 52.4 52.4 52.4 ...
$ order: int 1 2 3 4 5 6 7 8 9 10 ...
$ hole : logi FALSE FALSE FALSE FALSE FALSE FALSE ...
$ piece: Factor w/ 64 levels "1","2","3","4",..: 1 1 1 1 1 1 1 1 1 1 ...
$ id : chr "0" "0" "0" "0" ...
$ group: Factor w/ 64 levels "0.1","0.2","0.3",..: 1 1 1 1 1 1 1 1 1 1 ...
Now, let’s start exploring these spatial data with ggplot! In order to plot the Southern Bight, we want a plot that connects the different points in our fortified data frame.
ggplot() + geom_path(data = bightfort, aes(x = long, y = lat))
This first try seems slightly demotivating! However, the issue is easily fixed! When fortifying your spatial data, the new data frame will group certain points. It is therefore important to maintain this grouping of the data, simply by telling ggplot group = group.
ggplot() + geom_path(data = bightfort, aes(x = long, y = lat, group = group))
Now, we can add more information on this plot.The Belgian EEZ for example.
eez <- readOGR("Data day 1/Mapping/eez/eez.shp")
eez <- readOGR("Data day 1/Mapping/eez/eez.shp")
eezfort <- fortify(eez)
eezfort <- fortify(eez)
ggplot() +
geom_path(data = bightfort, aes(x = long, y = lat, group = group)) +
geom_path(data = eezfort, aes(x = long, y = lat, group = group))
So now we read the EEZ shapefile, transformed it to a dataframe and simply added it to our plot with geom_path(). Be careful always to define x as longitude and y as latitude!
With coord_map(), R recognizes the plot is a map and will scale the dimensions of the axex correctly.
ggplot() +
coord_map() +
geom_path(data = bightfort, aes(x = long, y = lat, group = group)) +
geom_path(data = eezfort, aes(x = long, y = lat, group = group))
You can also define the limits of the axes with coord_map().
ggplot() +
coord_map(xlim = c(2,3.5), ylim = c(51,52)) +
geom_path(data = bightfort, aes(x = long, y = lat, group = group)) +
geom_path(data = eezfort, aes(x = long, y = lat, group = group))
Themes in R allow for prettier maps! The black and white theme can be applied with theme_bw().
ggplot() +
theme_bw() +
theme(panel.background = element_rect(fill = "#0093b4"),
panel.grid.major = element_line(linetype = "blank"),
panel.grid.minor = element_line(linetype = "blank"),
axis.title = element_blank(),
axis.text = element_text(size = 16)) +
coord_map(xlim = c(2,3.5), ylim = c(51,52)) +
geom_path(data = bightfort, aes(x = long, y = lat, group = group)) +
geom_path(data = eezfort, aes(x = long, y = lat, group = group))
Using theme(), a number of lay-out features of a plot can be determined. In the code above, we determined the background colour, removed the grid lines, removed the axis titles and changed the font size of the axis text. The numerous options for the use of the theme() function are documented here.
The background colour was defined here with a colour code. However, fill = “blue” would also work. More info on colours in R can be found in this useful cheatsheet and in this catalogue.
ggplot() +
theme_bw() +
theme(panel.background = element_rect(fill = "#0093b4"),
panel.grid.major = element_line(linetype = "blank"),
panel.grid.minor = element_line(linetype = "blank"),
axis.title = element_blank(),
axis.text = element_text(size = 16)) +
coord_map(xlim = c(2,3.5), ylim = c(51,52)) +
geom_polygon(aes(x=long, y=lat, group=group), data = bightfort, fill = "white") +
geom_path(data = bightfort, aes(x = long, y = lat, group = group)) +
geom_path(data = eezfort, aes(x = long, y = lat, group = group))
When we define the colour (fill, possible confusion!) of the sea as “white”, we see there is an issue with Zeeland on our map. Apparently the shapefile does recognize the points of the peninsula to be part of the Southern Bight (and can draw a path between them), but it does not group them correctly. A possible solution involves the import of a file describing the coast of the Netherlands.
ggplot() +
theme_bw() +
theme(panel.background = element_rect(fill = "#0093b4"),
panel.grid.major = element_line(linetype = "blank"),
panel.grid.minor = element_line(linetype = "blank"),
axis.title = element_blank(),
axis.text = element_text(size = 16)) +
coord_map(xlim = c(2,3.5), ylim = c(51,52)) +
geom_polygon(aes(x=long, y=lat, group=group), data = bightfort, fill = "white") +
geom_polygon(aes(x=long, y=lat, group=group), data = netherlands_coastfort, fill = "#0093b4") +
geom_path(data = bightfort, aes(x = long, y = lat, group = group)) +
geom_path(data = eezfort, aes(x = long, y = lat, group = group))
Land and sea are now in a different colour! However, the map is still quite sober. Next up: let’s add sand banks!
ggplot() +
theme_bw() +
theme(panel.background = element_rect(fill = "#0093b4"),
panel.grid.major = element_line(linetype = "blank"),
panel.grid.minor = element_line(linetype = "blank"),
axis.title = element_blank(),
axis.text = element_text(size = 16)) +
coord_map(xlim = c(2,3.5), ylim = c(51,52)) +
geom_polygon(aes(x=long, y=lat, group=group), data = bightfort, fill = "white") +
geom_polygon(aes(x=long, y=lat, group=group), data = netherlands_coastfort, fill = "#0093b4") +
geom_path(data = bightfort, aes(x = long, y = lat, group = group)) +
geom_path(data = eezfort, aes(x = long, y = lat, group = group)) +
geom_polygon(aes(x=long, y=lat, group=group), data = banksfort, fill = "#016483")
Including sand banks in our map facilitates orientation and is also prettier. Note that the order of geom_ arguments matters: the banks are now on top of the EEZ. Let’s add some more sand banks!
ggplot() +
theme_bw() +
theme(panel.background = element_rect(fill = "#0093b4"),
panel.grid.major = element_line(linetype = "blank"),
panel.grid.minor = element_line(linetype = "blank"),
axis.title = element_blank(),
axis.text = element_text(size = 16)) +
coord_map(xlim = c(2,3.5), ylim = c(51,52)) +
geom_polygon(aes(x=long, y=lat, group=group), data = bightfort, fill = "white") +
geom_polygon(aes(x=long, y=lat, group=group), data = netherlands_coastfort, fill = "#0093b4") +
geom_path(data = bightfort, aes(x = long, y = lat, group = group)) +
geom_path(data = eezfort, aes(x = long, y = lat, group = group)) +
geom_polygon(aes(x=long, y=lat, group=group), data = banksfort, fill = "#016483") +
geom_polygon(aes(x=long, y=lat, group=group), data = bankszeelandfort, fill = "#016483") +
geom_polygon(aes(x=long, y=lat, group=group), data = bankscoastalfort, fill = "#016483") +
geom_polygon(aes(x=long, y=lat, group=group), data = bankshinderfort, fill = "#016483")
We now have a simple map of the BPNS (with some nice VLIZ colours)! This code for making the map is neither the only, neither the best way, but it provides a simple outline for future reference!
We can join all this code in a function to load all shape files and plot the map.
load_shapes_plot_map() now makes our map with only one line of code!
load_shapes_plot_map()
You can also load all shapefiles in the environment and construct the map seperately. This might be the favoured option when you plot several maps.
bankscoastal <- readOGR("Data day 1/Mapping/Coastal banks/banks.shp")
bankscoastal <- readOGR("Data day 1/Mapping/Coastal banks/banks.shp")
bankshinder <- readOGR("Data day 1/Mapping/Hinder banks/banks.shp")
bankshinder <- readOGR("Data day 1/Mapping/Hinder banks/banks.shp")
netherlands_coast <- readOGR("Data day 1/Mapping/netherlands_coast/world_countries_coasts.shp")
bankscoastal <- readOGR("Data day 1/Mapping/Coastal banks/banks.shp")
bankshinder <- readOGR("Data day 1/Mapping/Hinder banks/banks.shp")
netherlands_coast <- readOGR("Data day 1/Mapping/netherlands_coast/world_countries_coasts.shp")
eezfort <- fortify(eez)
eezfort <- fortify(eez)
banksfort <- fortify(banks)
banksfort <- fortify(banks)
bightfort <- fortify(bight)
bightfort <- fortify(bight)
bankszeelandfort <- fortify(bankszeeland)
netherlands_coast <- readOGR("Data day 1/Mapping/netherlands_coast/world_countries_coasts.shp")
eezfort <- fortify(eez)
banksfort <- fortify(banks)
bightfort <- fortify(bight)
bankszeelandfort <- fortify(bankszeeland)
bankscoastalfort <- fortify(bankscoastal)
bightfort <- fortify(bight)
bankszeelandfort <- fortify(bankszeeland)
bankscoastalfort <- fortify(bankscoastal)
bankshinderfort <- fortify(bankshinder)
banksfort <- fortify(banks)
bightfort <- fortify(bight)
bankszeelandfort <- fortify(bankszeeland)
bankscoastalfort <- fortify(bankscoastal)
bankshinderfort <- fortify(bankshinder)
netherlands_coastfort <- fortify(netherlands_coast)
eezfort <- fortify(eez)
banksfort <- fortify(banks)
bightfort <- fortify(bight)
bankszeelandfort <- fortify(bankszeeland)
bankscoastalfort <- fortify(bankscoastal)
bankshinderfort <- fortify(bankshinder)
netherlands_coastfort <- fortify(netherlands_coast)
netherlands_coastfort <- filter(netherlands_coastfort, lat >51.36)
plot_map()
First off, we group our station data in a new data frame stationdf. %>% creates a “pipeline” of code. This is an other way of formulating the combination of code to summarize a data frame.
Using plot_map, we can explore the location of the stations in our data.
plot_map() + geom_point(data = stationdf, aes(Longitude,Latitude), size = 4, colour = "red")
Now, we can add the station names.
plot_map() + geom_point(data = stationdf, aes(Longitude,Latitude), size = 4, colour = "red") +
geom_label(data = stationdf, aes(Longitude +0.01,Latitude,label = Station), hjust = 0)
This is very messy… By using the package ggrepel, you can avoid overlapping labels or text.
plot_map() + geom_point(data = stationdf, aes(Longitude,Latitude), size = 4, colour = "red") +
geom_label_repel(data = stationdf, aes(Longitude +0.01,Latitude,label = Station))
geom_label_repel() does the job, but our plot is still a bit messy. We can try to plot zones instead.
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude), size = 4, colour = "red") +
geom_label_repel(data = zonedf, aes(Longitude +0.01,Latitude,label = Zone))
In order to plot detection related variables spatially, we first summarise them for each zone.
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude, size = Milliseconds), colour = "red")
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude, size = Number_clicks_filtered), colour = "red")
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude, size = Dpm), colour = "red")
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude, size = Dp10m), colour = "red")
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude, size = Dph), colour = "red")
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude, size = Click_frequency), colour = "red")
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude, size = Click_intensity), colour = "red")
An alternative to first grouping and summarizing the data, is to use the group argument in ggplot.
plot_map() + geom_point(data = poddata_day, aes(Longitude,Latitude, size = Click_frequency, group=Zone), colour = "red")
In order to get an idea of the porpoise distribution each month, we first summarize the data per zone and month.
We can now visualize the distribution pattern per month.
plot_map() + geom_point(data = zonedf[zonedf$Month == 1,], aes(Longitude,Latitude, size = Click_frequency), colour = "red") + ggtitle(1)
Instead of making 12 plots in 12 lines of code, we can make a list of plots.
lapply(unique(zonedf$Month), function(x){
plot_map() + geom_point(data = zonedf[zonedf$Month == x,], aes(Longitude,Latitude, size = Click_frequency), colour = "red") + ggtitle(x)
})
An easy alternative to plot in chronological order:
In order to have the results of the 12 months displayed in one plot, we can apply facet_wrap().
plot_map() + geom_point(data = zonedf, aes(Longitude,Latitude, size = Click_frequency), colour = "red") + facet_wrap(~Month)