Multivariable geostatistics in S: the gstat package $

Published on Aug 1, 2004in Computers & Geosciences2.721
· DOI :10.1016/j.cageo.2004.03.012
Edzer Pebesma30
Estimated H-index: 30
(UU: Utrecht University)
This paper discusses advantages and shortcomings of the S environment for multivariable geostatistics, in particular when extended with the gstat package, an extension package for the S environments (R, S-Plus). The gstat S package provides multivariable geostatistical modelling, prediction and simulation, as well as several visualisation functions. In particular, it makes the calculation, simultaneous fitting, and visualisation of a large number of direct and cross (residual) variograms very easy. Gstat was started 10 years ago and was released under the GPL in 1996; was started in 1998. Gstat was not initially written for teaching purposes, but for research purposes, emphasising flexibility, scalability and portability. It can deal with a large number of practical issues in geostatistics, including change of support (block kriging), simple/ordinary/universal (co)kriging, fast local neighbourhood selection, flexible trend modelling, variables with different sampling configurations, and efficient simulation of large spatially correlated random fields, indicator kriging and simulation, and (directional) variogram and cross variogram modelling. The formula/models interface of the S language is used to define multivariable geostatistical models. This paper introduces the gstat S package, and discusses a number of design and implementation issues. It also draws attention to a number of papers on integration of spatial statistics software, GIS and the S environment that were presented on the spatial statistics workshop and sessions during the conference Distributed Statistical Computing 2003.
  • References (26)
  • Citations (1436)
📖 Papers frequently viewed together
801 Citations
992 Citations
1 Author (Simon N. Wood)
5,962 Citations
78% of Scinapse members use related papers. After signing in, all features are FREE.
#1Peter J. Diggle (Lancaster University)H-Index: 75
#2Jonathan A. Tawn (Lancaster University)H-Index: 45
Last. Rana Moyeed (PSU: Plymouth State University)H-Index: 22
view all 3 authors...
Geostatistics is concerned with estimation and prediction problems for spatially continuous phenomena, using data obtained at a limited number of spatial locations. Model-based geostatistics refers to the application of general statistical principles of modelling and inference to geostatisatical problems. This volume is the first book-length treatment of model-based geostatistics.
1,496 Citations
#1Roger BivandH-Index: 1
Access to well-structured and sometimes self-describing spatial position data with associated data attributes in geographical scales domains is increasing, and is expected to increase further. Until recently, it has often been sufficient to treat data sets as autonomous, dropping positional metadata attributes for analysis and visualization. It may be argued that this is short-sighted, because positional data from different sources may not then be readily co-registered. This contribution will su...
4 Citations
Kriging with external drift allows one to estimate a target variable, accounting for a densely sampled auxiliary variable. Contrary to cokriging, kriging with external drift does not make explicit the structural link between target variable and auxiliary variable, for the latter is considered to be deterministic. In this paper, we show that kriging with external drift assumes implicitly an absence of spatial dependence between the auxiliary variable and the residual of the linear regression of t...
32 CitationsSource
#1Rana Moyeed (Plymouth University)H-Index: 22
#2Andreas Papritz (EPFL: École Polytechnique Fédérale de Lausanne)H-Index: 17
Spatial prediction is a problem common to many disciplines. A simple application is the mapping of an attribute recorded at a set of points. Frequently a nonlinear functional of the observed variable is of interest, and this calls for nonlinear approaches to prediction. Nonlinear kriging methods, developed in recent years, endeavour to do so and additionally provide estimates of the distribution of the target quantity conditional on the observations. There are few empirical studies that validate...
66 CitationsSource
#1Petter Abrahamsen (Norwegian Computing Center)H-Index: 8
#2Fred Espen Benth (Norwegian Computing Center)H-Index: 31
A Gaussian random field with an unknown linear trend for the mean is considered. Methods for obtaining the distribution of the trend coefficients given exact data and inequality constraints are established. Moreover, the conditional distribution for the random field at any location is calculated so that predictions using e.g. the expectation, the mode, or the median can be evaluated and prediction error estimates using quantiles or variance can be obtained. Conditional simulation techniques are ...
33 CitationsSource
#1Roger Bivand (NHH: Norwegian School of Economics)H-Index: 25
Abstract Many researchers wish to explore and analyse spatial data, but typical software does not readily permit such integration. This paper presents a simple interface between two open-source software systems, the GRASS geographical information system, and the R statistical data analysis language. The platform used here is GNU/Linux, because both systems compile and install cleanly; R runs cleanly in Windows environments as well. The interface allows floating point and category data to be pass...
46 CitationsSource
#1Roger Woodard (WashU: Washington University in St. Louis)H-Index: 1
1 Linear Prediction.- 1.1 Introduction.- 1.2 Best linear prediction.- Exercises.- 1.3 Hilbert spaces and prediction.- Exercises.- 1.4 An example of a poor BLP.- Exercises.- 1.5 Best linear unbiased prediction.- Exercises.- 1.6 Some recurring themes.- The Matern model.- BLPs and BLUPs.- Inference for differentiable random fields.- Nested models are not tenable.- 1.7 Summary of practical suggestions.- 2 Properties of Random Fields.- 2.1 Preliminaries.- Stationarity.- Isotropy.- Exercise.- 2.2 The ...
1,585 CitationsSource
#1Edzer Pebesma (UvA: University of Amsterdam)H-Index: 30
#2Gerard B. M. Heuvelink (UvA: University of Amsterdam)H-Index: 42
Following the method of Stein, this article shows how a Latin hypercube sample can be drawn from a Gaussian random field. In a case study the efficiency of Latin hypercube sampling is compared experimentally to that of simple random sampling. The model outputs studied are the mean and the 5- and 95-percentile of the areal fraction where point concentration of zinc in the topsoil exceeds a given threshold. The Latin hypercube sampling procedure slightly distorts the short-distance correlation, an...
115 CitationsSource
#1Edzer Pebesma (UvA: University of Amsterdam)H-Index: 30
#2C. G. Wesseling (UU: Utrecht University)H-Index: 6
Abstract Gstat is a computer program for variogram modelling, and geostatistical prediction and simulation. It provides a generic implementation of the multivariable linear model with trends modelled as a linear function of coordinate polynomials or of user–defined base functions, and independent or dependent, geostatistically modelled, residuals. Simulation in gstat comprises conditional or unconditional (multi-) Gaussian sequential simulation of point values or block averages, or (multi-) indi...
420 CitationsSource
177 CitationsSource
Cited By1436
#1Shin ArakiH-Index: 4
#2Masayuki ShimaH-Index: 18
Last. Kouhei YamamotoH-Index: 5
view all 3 authors...
Abstract Accurate estimation of historical PM 2.5 exposures for epidemiological studies is challenging when extensive monitoring data are limited in duration. Here, we develop a national-scale PM 2.5 exposure model for Japan using measurements recorded between 2014 and 2016 to estimate monthly means for 1987 through 2016. Our objective is to obtain accurate PM 2.5 estimates for years prior to implementation of extensive PM 2.5 monitoring, using observations from a limited period. We utilize a ne...
#1Hussnain Mukhtar (NTU: National Taiwan University)H-Index: 2
#2Chieh-Yu Chan (NTU: National Taiwan University)
Last. Chiao-Ming Lin (NTU: National Taiwan University)H-Index: 1
view all 4 authors...
Abstract Birds are bioindicators for research on the relationship between environmental heavy metal concentration levels and accumulation levels in bird tissues. We use roadkill samples, collected by citizen science participants, to investigate the accumulation levels and associations of seven heavy metals in internal organs (heart, liver, and kidney), feathers (primary and breast), and bone (sternum and femur) of two focal species, Amaurornis phoenicurus and Gallinula chloropus. We found that h...
#1Alois Simon (BOKU: University of Natural Resources and Life Sciences, Vienna)H-Index: 2
#2Clemens Geitner (University of Innsbruck)H-Index: 8
Last. Klaus Katzensteiner (BOKU: University of Natural Resources and Life Sciences, Vienna)H-Index: 14
view all 3 authors...
Abstract The modelling of forest ecosystems is a broad scientific field, encompassing species distribution, dynamic forest succession, growth and disturbance, and biogeochemical cycles. Soil information is frequently required for a holistic and spatially explicit modelling approach. Information on soil properties at sufficient resolution to be incorporated in spatially distributed models is rare however, in particular for mountain forest areas that are poorly accessible and where the required sa...
The trees in agroforestry plots create spatial heterogeneity of high interest for adaptation, mitigation, and the provision of ecosystem services. But to what distance, exactly, from the tree? We tested a novel approach, based upon geostatistics and Unmanned Aerial Vehicle (UAV) sensing, to infer the distance at which a single agroforestry tree affects the surrounding under-crop, to map yield, litter (i.e. stover) and compute crop-partial Land Equivalent Ratio (LERcp) at the whole-plot level. In...
#1Jingye Li (CAS: Chinese Academy of Sciences)
#2Jianguo Huang (CAS: Chinese Academy of Sciences)H-Index: 22
Last. Peng Zhou (CAS: Chinese Academy of Sciences)
view all 7 authors...
Abstract A greater amplitude and higher mean intensity of the El Nino phenomenon have both been observed in the last few decades. In order to determine how tree growth and recent El Nino variabilities are associated within the East Asia subtropical forest (EASF), we conducted a dendroecological study using a network of 25 zonal tree-ring width chronologies from Pinus massoniana Lamb. trees distributed across extensive latitudinal (23 to 33°N) and elevational (77 to 1285 m) gradients in EASF. Usi...
#1Scott M. DevineH-Index: 2
#2Anthony T. O’GeenH-Index: 20
Last. Randy A. DahlgrenH-Index: 55
view all 7 authors...
Abstract Accurate assessments of soil organic carbon (SOC) stocks are needed at multiple scales given their importance to both local soil health and global C cycles. Rangelands cover 54% of California, representing a large stock of SOC, but existing SOC estimates are uncertain. To improve understanding of fine-resolution SOC stocks in complex terrain and provide guidance to rangeland SOC inventories, we grid-sampled 105 locations (21-m grid cells) at two depths (0–10 and 10–30 cm) in a 10-ha ann...
#1Christopher Young (WSL: Swiss Federal Institute for Forest, Snow and Landscape Research)H-Index: 2
#2Mathias Hofmann (TUD: Dresden University of Technology)H-Index: 5
Last. Nicole Bauer (WSL: Swiss Federal Institute for Forest, Snow and Landscape Research)H-Index: 9
view all 5 authors...
Abstract In the context of increasing urbanization, gardens as a form of urban greenspace are an important resource for the psychological restoration of urban dwellers, while underpinning urban biodiversity and delivering ecosystem services. However, the links between restoration, garden type and biodiversity are not fully understood. In this interdisciplinary study we aimed to identify how the self-reported restoration of gardeners was related to three factors: garden type (domestic vs. allotme...
#1Lauren T. Bennett (University of Melbourne)H-Index: 21
#2Nina Hinko-Najera (University of Melbourne)H-Index: 5
Last. Sabine Kasel (University of Melbourne)H-Index: 16
view all 7 authors...
Abstract Soil organic carbon (SOC) stocks in Australia’s temperate forests have been overlooked in national soil databases and in global SOC analyses of natural ecosystems despite the importance of temperate forests to the global terrestrial carbon balance. This limits the potential to both predict change in SOC stocks in temperate Australia and to identify where and how SOC stocks can be managed to mitigate climate change. Based on data from 707 sites, we examine variations in SOC concentration...
#1Maogui Hu (CAS: Chinese Academy of Sciences)
#2Yanwei Huang (CAS: Chinese Academy of Sciences)
Abstract Geostatistical interpolation methods are used in diverse disciplines, such as environmental science, ecology, and hydrology. With the increasing availability of areal spatial data, area-to-area and area-to-point interpolations have great application potential. In this study, based on the variogram deconvolution algorithm proposed by Goovaerts (2008), an open-source area-to-area kriging package atakrig is developed in the R environment. In atakrig, point-scale variogram and cross-variogr...
#1Gerald BlaschH-Index: 1
#2Zhenhai LiH-Index: 11
Last. James A. TaylorH-Index: 15
view all 3 authors...