GIS spatial analysis is a rapidly changing field, and GIS packages are increasingly including analytical tools as standard built-in facilities, as optional toolsets, as add-ins or 'analysts'. In many instances these are provided by the original software suppliers (commercial vendors or collaborative non-commercial development teams), while in other cases facilities have been developed and are provided by third parties. Furthermore, many products offer software development kits (SDKs), programming languages and language support, scripting facilities and/or special interfaces for developing one's own analytical tools or variants. The increased availability has created a new dimension to
business intelligence termed "
spatial intelligence" which, when openly delivered via intranet, democratizes access to geographic and social network data.
Geospatial intelligence, based on GIS spatial analysis, has also become a key element for security. GIS as a whole can be described as conversion to a vectorial representation or to any other digitisation process.
Geoprocessing is a GIS operation used to manipulate spatial data. A typical geoprocessing operation takes an input
dataset, performs an operation on that dataset, and returns the result of the operation as an output dataset. Common geoprocessing operations include geographic feature overlay, feature selection and analysis,
topology processing,
raster processing, and data conversion. Geoprocessing allows for definition, management, and analysis of information used to form decisions.
Terrain analysis of the Valestra area in the northern Apennines (Italy) Many geographic tasks involve the
terrain, the shape of the surface of the earth, such as
hydrology,
earthworks, and
biogeography. Thus, terrain data is often a core dataset in a GIS, usually in the form of a raster
Digital elevation model (DEM) or a
Triangulated irregular network (TIN). A variety of tools are available in most GIS software for analyzing terrain, often by creating derivative datasets that represent a specific aspect of the surface. Some of the most common include: •
Slope or grade is the steepness or gradient of a unit of terrain, usually measured as an angle in degrees or as a percentage. •
Aspect can be defined as the direction in which a unit of terrain faces. Aspect is usually expressed in degrees from north. • Cut and fill is a computation of the difference between the surface before and after an
excavation project to estimate costs. •
Hydrological modeling can provide a spatial element that other hydrological models lack, with the analysis of variables such as slope, aspect and watershed or
catchment area. Terrain analysis is fundamental to hydrology, since water always flows down a slope. Each of these is strongly affected by the level of detail in the terrain data, such as the resolution of a DEM, which should be chosen carefully.
Proximity analysis Distance is a key part of solving many geographic tasks, usually due to the
friction of distance. Thus, a wide variety of analysis tools have analyze distance in some form, such as
buffers,
Voronoi or Thiessen polygons,
Cost distance analysis, and
network analysis.
Data analysis It is difficult to relate
wetlands maps to rainfall amounts recorded at different points such as airports, television stations, and schools. A GIS, however, can be used to depict two- and three-dimensional characteristics of the Earth's surface, subsurface, and atmosphere from information points. For example, a GIS can quickly generate a map with
isopleth or
contour lines that indicate differing amounts of rainfall. Such a map can be thought of as a rainfall contour map. Many sophisticated methods can estimate the characteristics of surfaces from a limited number of point measurements. A two-dimensional contour map created from the surface modeling of rainfall point measurements may be overlaid and analyzed with any other map in a GIS covering the same area. This GIS derived map can then provide additional information – such as the viability of
water power potential as a
renewable energy source. Similarly, GIS can be used to compare other
renewable energy resources to find the best geographic potential for a region. Additionally, from a series of three-dimensional points, or
digital elevation model, isopleth lines representing elevation contours can be generated, along with slope analysis,
shaded relief, and other elevation products. Watersheds can be easily defined for any given reach, by computing all of the areas contiguous and uphill from any given point of interest. Similarly, an expected
thalweg of where surface water would want to travel in intermittent and permanent streams can be computed from elevation data in the GIS.
Topological modeling A GIS can recognize and analyze the spatial relationships that exist within digitally stored spatial data. These
topological relationships allow complex spatial modelling and analysis to be performed. Topological relationships between geometric entities traditionally include adjacency (what adjoins what), containment (what encloses what), and proximity (how close something is to something else).
Geometric networks Geometric networks are linear networks of objects that can be used to represent interconnected features, and to perform special spatial analysis on them. A geometric network is composed of edges, which are connected at junction points, similar to
graphs in mathematics and computer science. Just like graphs, networks can have weight and flow assigned to its edges, which can be used to represent various interconnected features more accurately. Geometric networks are often used to model road networks and
public utility networks, such as electric, gas, and water networks. Network modeling is also commonly employed in
transportation planning,
hydrology modeling, and infrastructure modeling.
Cartographic modeling layer (contour lines) over it. Next up is a standing water layer (pond, lake) and then a flowing water layer (stream, river), followed by the boundary layer and finally the road layer on top. The order is very important to properly display the final result. Note that the ponds are layered under the streams, so that a stream line can be seen overlying one of the ponds.
Dana Tomlin coined the term
cartographic modeling in his PhD dissertation (1983); he later used it in the title of his book,
Geographic Information Systems and Cartographic Modeling (1990).
Cartographic modeling refers to a process where several thematic
layers of the same area are produced, processed, and analyzed. Tomlin used raster layers, but the overlay method (see below) can be used more generally. Operations on map layers can be combined into algorithms, and eventually into simulation or optimization models.
Map overlay The combination of several spatial datasets (points, lines, or
polygons) creates a new output vector dataset, visually similar to stacking several maps of the same region. These overlays are similar to mathematical
Venn diagram overlays. A
union overlay combines the geographic features and attribute tables of both inputs into a single new output. An
intersect overlay defines the area where both inputs overlap and retains a set of attribute fields for each. A
symmetric difference overlay defines an output area that includes the total area of both inputs except for the overlapping area. Data extraction is a GIS process similar to vector overlay, though it can be used in either vector or raster data analysis. Rather than combining the properties and features of both datasets, data extraction involves using a "clip" or "mask" to extract the features of one data set that fall within the spatial extent of another dataset. In raster data analysis, the overlay of datasets is accomplished through a process known as "local operation on multiple rasters" or "
map algebra", through a function that combines the values of each raster's
matrix. This function may weigh some inputs more than others through use of an "index model" that reflects the influence of various factors upon a geographic phenomenon.
Geostatistics Geostatistics is a branch of statistics that deals with field data, spatial data with a continuous index. It provides methods to model spatial correlation, and predict values at arbitrary locations (interpolation). When phenomena are measured, the observation methods dictate the accuracy of any subsequent analysis. Due to the nature of the data (e.g. traffic patterns in an urban environment; weather patterns over the Pacific Ocean), a constant or dynamic degree of precision is always lost in the measurement. This loss of precision is determined from the scale and distribution of the data collection. To determine the statistical relevance of the analysis, an average is determined so that points (gradients) outside of any immediate measurement can be included to determine their predicted behavior. This is due to the limitations of the applied statistic and data collection methods, and interpolation is required to predict the behavior of particles, points, and locations that are not directly measurable.
Interpolation is the process by which a surface is created, usually a raster dataset, through the input of data collected at a number of sample points. There are several forms of interpolation, each which treats the data differently, depending on the properties of the data set. In comparing interpolation methods, the first consideration should be whether or not the source data will change (exact or approximate). Next is whether the method is subjective, a human interpretation, or objective. Then there is the nature of transitions between points: are they abrupt or gradual. Finally, there is whether a method is global (it uses the entire data set to form the model), or local where an algorithm is repeated for a small section of terrain. Interpolation is a justified measurement because of a spatial autocorrelation principle that recognizes that data collected at any position will have a great similarity to, or influence of those locations within its immediate vicinity.
Digital elevation models,
triangulated irregular networks, edge-finding algorithms,
Thiessen polygons,
Fourier analysis,
(weighted) moving averages,
inverse distance weighting,
kriging,
spline, and
trend surface analysis are all mathematical methods to produce interpolative data.
Address geocoding Geocoding is interpolating spatial locations (X,Y coordinates) from street addresses or any other spatially referenced data such as
ZIP Codes,
parcel lots and address locations. A reference theme is required to
geocode individual addresses, such as a road centerline file with address ranges. The individual address locations have historically been interpolated, or estimated, by examining address ranges along a road segment. These are usually provided in the form of a table or database. The software will then place a dot approximately where that address belongs along the segment of centerline. For example, an address point of 500 will be at the midpoint of a line segment that starts with address 1 and ends with address 1,000. Geocoding can also be applied against actual parcel data, typically from municipal tax maps. In this case, the result of the geocoding will be an actually positioned space as opposed to an interpolated point. This approach is being increasingly used to provide more precise location information.
Reverse geocoding Reverse geocoding is the process of returning an estimated
street address number as it relates to a given coordinate. For example, a user can click on a road centerline theme (thus providing a coordinate) and have information returned that reflects the estimated house number. This house number is interpolated from a range assigned to that road segment. If the user clicks at the
midpoint of a segment that starts with address 1 and ends with 100, the returned value will be somewhere near 50. Note that reverse geocoding does not return actual addresses, only estimates of what should be there based on the predetermined range.
Multi-criteria decision analysis Coupled with GIS,
multi-criteria decision analysis methods support decision-makers in analysing a set of alternative spatial solutions, such as the most likely ecological habitat for restoration, against multiple criteria, such as vegetation cover or roads. MCDA uses decision rules to aggregate the criteria, which allows the alternative solutions to be ranked or prioritised. GIS MCDA may reduce costs and time involved in identifying potential restoration sites.
GIS data mining GIS or spatial
data mining is the application of data mining methods to spatial data. Data mining, which is the partially automated search for hidden patterns in large databases, offers great potential benefits for applied GIS-based decision making. Typical applications include
environmental monitoring. A characteristic of such applications is that spatial correlation between data measurements require the use of specialized algorithms for more efficient data analysis. GIS-based spatial modeling has increasingly been combined with machine learning techniques to identify and forecast regional inequities in health and social outcomes, including healthcare access and insurance enrollment. ==Data output and cartography==