Andy Mitchell

The ESRI Guide to GIS Analysis: Vol. 1: Geographic Patterns and Relationships

Redlands, CA: ESRI Press, 1999. ISBN 1-879102-06-4

 

CHAPTER 1: WHAT IS GIS ANALYSIS?

 

1.What are the three types of features in a GIS?  P/ 12

Point data: individual locations, such as businesses

Line data, such as rivers or roads

Polygon data, such as land parcels or political jurisdictions

 

2. Explain vector vs. raster representation of geographic information. P. 14

Vector: the most common type for mapping. Data are stored as information on x.y coordinates, line lengths, and angles, so the map may be recreated on any scale by redrawing according to these directions.

Raster: used in orthophotography and where there is continuous spatial data.. Data are stored as a matrix of cells (often very fine ones). Increasing map size causes graininess as cells are drawn further apart.

 

3. Explain “projections” and “coordinate systems”. P. 15.

Projections: There are several ways to draw the map on a spherical ball onto a flat surface.There is no perfect way to do this and all projections involve distortions. There is a tradeoff among accurate representation of distance, area, and shape.

Coordinate system: For any given projection, a coordinate system provides the definition of the origin and also the units from the origin to locate any point.

 

4. What are attributes, and what are the types of attribute values? Pp. 16-17

Attributes are variables, often represented as different layers on a map.

Categories are nominal values such as types of crime.

Ranks are ordinal data such as poor to excellent ratings for “scenic value”

Counts and amounts are interval data, such a population sizes.

Ratios are percentages, such as average no. of people per household in each census tract.

 

5. What are data tables and what are some common operations in working with them? Pp. 18-19.

Data tables are literally the tables containing attribute data. GIS software will have a table view as well as a map view.

Selecting is an operation which finds all features which have certain attributes.

Calculating is an operation used to create a new variable (ex., creating a ratio variable from two count variables).

Summarizing is the operation which gives descriptive statistics like means, medians, miximumx, and minimums.

 


CHAPTER 2: MAPPING WHERE THINGS ARE

 

1. What is Mitchell’s rule of thumb about the number of categories that can be represented on a map and still be readily understood? Pp. 30-31.

About 6 - 7 values per variable. He suggests grouping if you have more categories.

 

2. Which is easier to distinguish – colors or symbols to represent points? P. 32.

Colors.

 

3. Differentiate clustered, uniform, and random distribution of points. P. 35.

Clustered: features appear in groups, near each other.

Random: features are equally likely to appear anywhere on the map.

Uniform: features tend not to be near each other, but are also not likely to be found in some areas of the map.

 

 

 

CHAPTER 3. MAPPING THE MOST AND LEAST

 

1.  What methods are used to show most and least on a map, and what are the problems of each? What tips for good usage? Pp. 56-63

Graduated symbols: ex., small to large dots showing population size. Problem if points overlap. Tips: too small gradations are hard to discern; too big overlap too much. Note not just dots, but also various sizes of lines for line data.

Graduated colors: light to dark blue, to show ocean depth. Colors may not intuitively relate to magnitude. Tips: pink to dark red is more contrasty than light green to dark green; if you have positive and negative values, use two color schemes.

Contours: the closer contour lines are together, the steeper the grade at that elevation. Can be hard to read and requires interpolating values between observed points.

3-D Perspective: like contour but more visual; front areas may hide rear. Also, the z-value (perspective height) cannot be too small or too great or contrasts will be obscured. You may also have a light source with origin and angle, and this can also affect contrasts.

Charts. Superimposing bar or pie charts on areas can show patterns too. Can be too cluttery. Bar charts are more for amounts, pie charts for percentages. You can also use pies of graduated sizes to represent a variable.

 

2, What is classification for and explain the standard classification schemes. Pp. 48-51.

Classes are groupings of values, such as income ranges. Classification is assigning cases to these classes by a classification scheme below:

Natural breaks: nominal variables (ex., race) have natural breaks, and some “natural” breaks are human-defined, such as above and below the official poverty level. Use natural breaks if they exist, as a rule.

Quantiles: rank the cases from low to high and set cutting points which divide the cases into equal groupings (ex., 20% in each group, for quintiles)

Equal intervals: divide the cases by equal ranges of the unit of measurement, such as $0 - $10,000; $10,000 - $20,000; $20,000 - $30,000; etc. Easier to interpret than quantiles but some ranges may have few or even no cases.

Standard deviations: Assume a normal distribution and compute the standard deviation of the variable. About 95% of cases will be within 2 sd’s of the mean, 99% within three sd’s. Used to identify outliers.

 

3. How many classes are best to display data? P. 54.

This has to be approached by trial and error. Too few classes will average apples and oranges. Too many classes will lead to hard-to-discern complexity. The optimal number of classes will make the pattern stand out on the map. See example on p. 54.

 

           


 

CHAPTER 4. MAPPING DENSITY

 

1. What are some examples of mapping density? Pp. 70 ff.

Mapping density is mapping concentrations of something: concentrations of people (population), fish, businesses/square mile, logging road density

 

2. What are some examples of policy decisions using density maps? P. 70-71

- Assigning police to patrol areas based in crime incident density

- Locating government service agency offices based on population density

- Locating fish monitoring devices in lake areas of high fish density

 

3. What is a dot density map, and what is a major drawback of this type of map? Likewise, what is a density surface map and what is a major drawback of density surface maps? P. 70

            A dot density map shows location and can show density, but it has the problem of overlaying dots at the same location, obscuring the true density.

            A density surface map is a type of contour map , with shaded contours linking points of approximately equal density. Using a color gradient to display increasing density (ex., darker reds), it is easier to see the true density of any area. See map p. 70, upper left. However, like all contour maps, it may falsely suggest equal density throughout any given contour plane.

 

4. Is dot density mapping the same as mapping locations? P. 73

No, Location mapping is a special case of dot density mapping, where there is one dot per location (ex., per business). But it may be that one dot represents FIVE businesses in a zip code, for instance. In such cases the dot may be located at random within the zip code and may obscure actual locations (ex., locations along a highway). See maps in lower left of p. 73.

 

5. Can a choropleth (shaded area) map be a density map? Compare such maps with density surface maps in terms of pros and cons for measuring density. P. 72

            A choropleth map is a density map if the color gradient used to shade the areas (ex., census tracts) based on some density value (ex., population per square mile). See map in lower left corner of p. 72.

            Both chorpleth and density surface maps measure density, and both have the drawback of suggesting equal density thoughout a given shaded area. When the choropleth areas (ex., census tracts) are related to the variable whose density is being measured, they may be preferable. When there is no particular relation, the unconstrained definition of density areas by a density surface map may be preferable.  Of course, one can have a density surface map AND overlay boundaries like census tracts or school districts.

 

6. What is Mitchell’s point about mapping features vs. mapping feature values? P. 72

Make sure you are mapping the variables you really want. In the example, is it number of businesses or is it number of employees?

 

7. What does “calculating density on the fly” mean? P. 75

It means ArcView, ArcInfo and some other GIS’s do not require an actual density field to be a column in the database. Instead one specifies the count field (ex., # of businesses) and the area field (ex., zip code feature).

 

8. What three mistakes in dot mapping can obscure the pattern one is seeking?pp. 76-77.

            1. Too few features per dot (ex., 2 businesses per dot rather than 5 businesses per dot) leads to too many dots, overlapping, and obscuring the pattern.

            2. Dot size too large, leading to overlapping and obscuring.

            3. Overlaid boundaries too fine (census blocs instead of census tracts), causing boundary lines to overlay dots and obscure patterns.

 

9. What causes the difference between the two maps at the top of p. 77?

The one on the left maps 1 dot = 500 households by county and has county boundaries.

The one on the right maps the same thing BY ZIP CODE and still has county boundaries.

This locates the dots more accurately WITHIN counties (but still at random within any given zip code).

 

10. How exactly are density contour lines drawn by ArcView?

The map is treated as a raster surface, divided by a grid. The number of features (ex. businesses) within a given radius of each cell center. Each grid cell is colored in according to its summed count divided by area size.

            The smaller the grid cell size, the smoother the contour lines but the longer to process and the larger the file size to store.  (Cell size is defined by the length of one of the sides of its square).  Generally, one picks a cell size which is 10 to 100 cells per density unit (ex., per square mile)

            The larger the search radius, the more generalized the patterns. Too large or too small a radius will make patterns hard to see.  In counting features within the radius, one may have a simple count or may weight to give more emphasis if as the feature is closer to the center of the grid cell.

 

11. When using graduated colors to show different density values, what are the four methods of graduation?

            1. Natural breaks

            2. Quantiles: equal number of cases in each of the categories: can force more cases into the highest category, obscuring the true centers of density.

            3. Equal interval: The ranges represented by each category are equal (ex., 0-500, 501-1000, etc) even if this means some categories are unequal or even empty in count.

            4. Standard deviation: plus or minus 1, 2, 3 standard deviations: highlights the extremes.                                                            

 

12. When using graduation, how many categories should you ask for? P. 83

Several is good. Over 15 is really hard to interpret. Under 4 may well not show patterns well.  By the way, usually darker=more dense.

 

13. Is a contour map the same as a density surface map? P. 84.

No. A contour map shows actual contour lines. A density surface maps shows the same information as shaded contours. The contour lines could be narrower or more widely spaced than the shaded contours of the density surface map. See the map at the bottom of p. 84.

 

14. How can it be that there are no features where the highest density is shown on a density map? P. 85

Density is calculated based on counts within a radius of each grid cell. The grid cell itself might have zero count. This would happen, for instance, for grid cells located between other grid cells with high counts and within the search radius of grid cells with zero count.

 

 


Chapter 5. FINDING WHAT’S INSIDE

 

1. What are some examples of mapping to “find what’s inside”? P. 89 ff.

- Mapping interior features: business points, road lines, school district polygons

- New features you draw, like new sales areas dividing an existing polygon

- Merging existing polygons to make a large one

- Service areas: features w/in a radius, driving distance, or driving time

- Buffer zones, like around a school for purposes of speed limits

Note: You can include only polygons wholly inside an area (ex., inside a flood plain); areas which are wholly or partly inside; or polygons which are wholly inside plus the inside portions of the partly inside polygons).

 

2. Differentiate list, count, and summary output. Pp. 92-93.

- List: table of all parcels, with ones inside a flood plain highlighted in yellow (as in ArcView)

- Return the count of the number of parcels inside a flood plain

- Sum the tax assessment value of all parcels inside a flood plain; sum the area of parcels by each of several types of land use within a flood plain.

 

3. What are the three ways of “finding what’s inside”? What question does each method address? Pp. 96-98.

- 1. Overlay a polygon (ex., flood plain) on a feature map (ex., tax parcels). En Question: Is a given feature inside the polygon?

- 2. As (1) but also let the GIS use the polygon overly to select automatically the features included within the polygon. Question: Return a list of features inside the polygon, or return a summary of features inside the polygon.

- 3. As (2) but then also create a new third layer combining the features of the first two layers. Question: Return a list or summary for multiple polygons in the overlay.

 

Note: When overlaying, you can put the polygon boundary layer on top of or underneath the feature layer, depending on which you want to emphasize. You can also control the transluscency of the shading of polygons, and the thickness of border lines to get different effects.

 

4. What are three ways of displaying an overlaid polygon? P. 100.

- Use a dark line border

- Fill it with hatching or a transluscent shade.

- Place a screen over all areas outside the polygon, with the polygon itself transparent (thus showing the feature layer underneath). See p. 100, lower right.

 

5. What is the difference between counts , frequencies, and summaries? Pp. 102-103

- counts are the number of features regardless of value (ex., businesses, regardless of type) within an area

- frequencies are the number of features of a given value or of each value (ex., retail businesses) within an area

- summaries are sums, averages, medians, or standard deviations of numeric attributes (ex., number of employeess at businesses in an area).

 

6. What happens when you overlay a polygon laye on top of a feature layer, select features within a polygon (within a flood plain), and create a new third layer? Pp. 106-107

The GIS uses the x,y coordinates of the features to match them to the polygon(s) (ex., census trracts) and create a  new merged table containing both the feature information and the polygon information. Rows will be the features, but there will now be columns for census tract id and census tract population. You can now sum features by tract and join the summary table to the tract table, in which rows are tracts. You can then divide the sum of features by population to create a new features per population variable.

 

7. What happens when you overlay a polygon layer on another polygon area? Pp. 108-109.

The boundaries of the first layer are used to subdivide the boundaries of the second area, giving you a merged layer with a lot of polygons.

 

Note: when you do this, you may get slivers, which are small areas where borders are offset. It is best to merge these slivers with adjacent larger polygons. This happens only with vector overlays, not raster overlays.

 

8. How can bar or pie charts be integrated in mapping? Pp. 11-12

Pie or bar charts can show the frequency distribution of values of a feature or percentages (ex., percent of area by land use type). Charts can be placed on top of their respective polygons.

 


CHAPTER 6: FINDING WHAT’S NEARBY

 

1. What are some examples of proximity maps (mapping what’s nearby)? Pp. 116 ff.

- Service or catchment areas (p. 116): concentric zones (by radius, road distance, or time distance) around some feature

- Color-coded points coded by which feature they are nearest (p. 117, bottom left)

- Spider diagram: same as above but with lines to the feature (p. 117 right)

 

2.  What is “cost” in the context of proximity mapping?

Cost is a value associated with distance. Cost can be used as the definition of nearness. Examples would be operating cost per mile, travel time per mile, or simply miles. Distance by cost will be different from straight-line (radial) distance.

            Cost is usually in the context of cost over a network, where each line segment has an associate time or distance cost.

            However, you can also have cost over a surface (without a network). An example would be travel cost in an open hiking area, based on slope (steeper slope = more cost).

 

3. What are some uses of straight-line distance? P.122.

- Create a buffer around something.

- Create a distance surface around something (a concentric series of buffers, with a color gradient).

- Calculate feature to feature distance

- Select features within a given distance of another feature

 

4. What is Euclidean distance and how is it calculated?

It is the straight-line distance from one point to another. It is calculated as the square root of the sum of the squares of the sides (over and up) of the triangle connecting the points along the hypotenuse. (Pythagorean theorem).

 

5. How would you find features within 1,000 feet of two other features, X and Y? p. 127.

Create a 1,000 foot buffer around X and select features within the buffer

            Create a variable and set all selected features to 1

            Repeat for Y

            Select features with both variables set to 1.

 

6. If you want to have a map’s table have a column for distances to a feature, do you have to calculate the distance for each point of interest? p. 129

No. The GIS’s distance calculation for all features in relation to a given feature(s) will insert two new columns in the table: one for the id# of the nearest given feature(s) and one for the distance from the current point to that feature.

 

7. What is a distance surface? p. 132.

It is a raster layer of concentric distance intervals from a point, line, or polygon. You can set the distance ranges and also the maximum distance for which intervals will be calculated.

8. What is a network? pp. 135 ff.

 

CHAPTER 7: MAPPING CHANGE

 

1. Explain some different types of change-focused maps Pp. 151-2.

* Change of location: one map with a series of connected points, with each point reflecting a change in location. Ex: bird migratory path from north to south US

* Change in character: a before and after map with one color per character value.. Ex. Land cover in 1914 and 1988

* Change in magnitude: choropleth map keyed to a change variable. Ex. Percent change in population by county.

* Change in location and character: one map, connected points, point symbol changes to reflect changing character value. Ex. Path of precipitation with different symbols for rain, sleet, snow.

* Change in location and magnituede: one map, connected points, but point size changes by magnitude. Ex: hurricane track with dot size based on wind speed.

 

2. If there are many change points (ex., many points in time), what determines how many intervals you have? Pp. 155-7.

* Wide enough intervals that anomalies and outliers get averaged in

* Use the fewest divisions necessary to show change (fewer is easier to visualize)

* Include start and end points, of course (total time period tracked should be consistent with the change process being studied - ex, cannot study climate change over a short period).

 

3. How does mapping change relate to use of color gradients in maps?

* Normally use a gradient with darker=more change

* But if you have a change-nochange situation, let no change be a contrasting color like white

* If there is both positive and negative change, use a two-color contrasting gradient with no change at the color midpoint.

 

4. What is a “tracking map”? P. 161

A tracking map shows the position of something at each of several points in time. For instance, it might be a choropleth map where the gradient colors show the progressive spread of some area (ex. A forest fire area on days 1, 2, 3, etc.).  Could be done with lines or points, not just polygons as shown on p. 161, bottom.

 

5. How is movement related to position in a tracking map? P. 165

Since you locate the successive polygons, line segments, or points at equal time intervals, the further apart they are, the more rapid the movement. For instance, in a hurricane tracking map, long line segments indicate rapid movement in those time periods, short segments mean little movement in those other time segments.

 

6. How can orthophotography be integrated with tracking maps, and why?

This is simply superimposing successive polygon boundaries, line segments, or points on an aerial photograph of the are area, to show visually what the original area looked like (ex., as before a fire swept through successive polygons).

 

7. How can you map just the area that has been changed using the raster method? P. 170

The raster method divides an area into grid cells. Overlay one may, say 2000, on another map of the same area, say 1990. A GIS can match cells and automatically select the ones which differ...this selection is just the changed area.

 

8. How does the vector method differ? P. 170-171.

The two layers (2000 and 1990) are overlaid and a new layer is created with polygons for each intersection of the overlaying boundaries with the base layer boundaries. For each new polygon in the new layer, select it if, say landusetype2000 differs from landusetype1990.

 

9. What is the maximum number of category values Mitchell reommends in a single map? P. 172.

Six

 

10. What is a density surface change map? P. 173.

Recall a density map is a contour map where the contours are areas of like density, such as crime incident density. If you have density maps for two time periods, you can subtract the incidents in the second table from those in the first, and in the resulting difference table, use these positive and negative values to create a new density surface map. Use contrasting gradients (ex., reds and blues) to represent positive and negative density contours, reflecting increase or decreas in, say, crime incidents. This is illustrated on p. 173.