Theorizing globally, but analyzing locally: the importance of geographically weighted regression in crime analysis

Andresen, Martin A.

doi:10.1186/s40163-022-00173-0

Research
Open access
Published: 17 October 2022

Theorizing globally, but analyzing locally: the importance of geographically weighted regression in crime analysis

Martin A. Andresen ORCID: orcid.org/0000-0002-4767-7276¹

Crime Science volume 11, Article number: 10 (2022) Cite this article

3174 Accesses
2 Citations
2 Altmetric
Metrics details

Abstract

Theoretical relationships with crime across cities are explicitly or implicitly assumed to be the same in all places: a one-unit change in X leads to a β change in Y. But why would we assume the impact of unemployment, for example, is the same in wealthy and impoverished neighborhoods? We use a local statistical technique, geographically weighted regression, to identify local relationships with property crime. We find that theoretical relationships vary across the city, most often only being statistically significant in less than half of the city. This is important for the development of criminal justice policy and crime prevention, because these initiatives most often work in particular places potentially leading to a misallocation of scarce public resources.

Introduction

Global regression analyses of neighbourhoods in a city estimate one parameter for each explanatory variable to represent an entire study region. Though rarely questioned, having one estimate to represent an entire study area is an assumption. But does one estimate properly represent an entire city? Would one expect a one percent change in unemployment to have the same impact in a wealthy neighbourhood as an impoverished neighbourhood? Certainly not. This has important implications for understanding the nuances of theoretical relationships, but also for criminal justice policy. If a public policy variable only impacts crime in particular areas, this should be known to avoid wasting scarce public resources.

In the early 1990s, research on local spatial statistics began to emerge (Anselin, 1995; Ord & Getis, 1995). In the context of a local regression, geographically weighted regression (GWR) emerged in the mid-1990s to investigate how relationships between variables may vary across space (Brunsdon et al., 1996; Fotheringham et al., 2001, 2002). In GWR, each spatial unit of analysis has its own coefficient to represent its relationship between the explanatory and outcome variable because a regression is estimated for each unit of analysis to estimate local effects. Approximately a decade later, early research in spatial criminology used GWR and found that theoretical relationships do not hold in all places and that theoretical relationships switch signs across the study area (Arnio & Baumer, 2012; Cahill & Mulligan, 2007; Malczewski & Poetz, 2005).

The benefits of this research using local spatial statistics are, primarily, twofold. First, we can identify whether or not theoretical relationships have the same strength across the study region and if the predicted relationship is always in the same direction. In the first case, some places may have a stronger relationship than others and, in the second case, the direction of the theoretical relationship may change across places. As such, the global relationship represents an average effect that may be positive, but there may be places where that relationship is negative.

Second, we can identify which spatial units of analysis are driving the global results. It may be that the theoretical relationship under investigation is present in all places, but if that relationship is only present in a subset of the places within the study region, that does not mean that the theory is incorrect, rather that its predicted effects are not omnipresent. This is referred to as spatial nonstationarity, or heterogeneity, in the local results.

We analyze multiple crime types and their explanatory variables, based on social disorganization theory, across Vancouver census tracts (CTs) for 2016. With the working hypothesis being that local effects do not matter, we compare global regression output with GWR output in order to identify local patterns in the data. This shows the utility of using local rather than global spatial statistics to identify local variability in an international (non-US) context. Methodologically, we contribute to the spatial criminology comparing GWR results with the appropriate global spatial regression model, rather than with a non-spatial regression method. More generally, much of the current research uses a limited set of explanatory variables and crime types. We consider a broader set of explanatory variables and five crime types.

Related research

With the development of local regression models in geography during the late 1990s, it was almost 10 years before these methods worked their way into criminological contexts, undertaken by geographers. Overall, this research has proven to be instructive in the contexts of theory and policy, showing the limitations of global statistical methods when trying to understand spatial crime patterns. Specifically, theoretical relationships are not supported in all places.

Analyzing residential burglaries, Malczewski and Poetz (2005) found a different set of explanatory variables remained statistically significant for global and GWR results. Curiously, the signs on coefficients switched in different places or were only statistically significant in particular places: global results showed increases in average dwelling value led to decreases in residential burglaries, but only a relatively small percentage of places had this relationship in the GWR model. In fact, the coefficient switched signs to be positive in places with high levels of rentals and student populations; in other words, relatively more affluent places without a lot of guardianship because of population turnover and young populations not spending a lot of time at home had more residential burglaries. In the context of multi-family dwellings that had a positive global relationship, that relationship was always true at the local level but was not always statistically significant having its strongest effects in an around the city core. In a rural context for theft and residential burglary, Deller and Deller (2012) found that GWR results were almost always consistent with the global results, when statistically significant.

Cahill and Mulligan (2007) found similar types of results in the context of violent crime. With their GWR results, when a variable had some statistically significant effects, only between one- and two-thirds of the units of analysis had statistically significant results. Similar to Malczewski and Poetz (2005), Cahill and Mulligan (2007) had GWR results that were both consistent with global results (when statistically significant) and changing signs in different places across the city: racial heterogeneity, wealth distribution, population density, and single-member households. They also found that variables not statistically significant in the global model were statistically significant in the GWR models; in this case, those areas that were statistically significant had both positive and negative coefficients, likely “cancelling each other out” at the global level. As such, global regression analyses may be masking statistically significant local results because they represent averages of areas across a larger study region.

Considering assault and aggravated assault, Grubesic et al. (2012) found that GWR results were almost always consistent with the global results, when statistically significant, for the relationship between alcohol outlet density, social disorganization, and violence. Disaggregating violence rates by race in the context of structural disadvantage (poverty, unemployment, female head of household, and low education), Light and Harris (2012) found that GWR results all showed statistically significant variation with many of the explanatory variables, including racial diversity and racial groups, switching sign. Specifically, their measure of structural disadvantage is always positive when statistically significant in the GWR model, but most of the (race-specific) control variables switch sign depending on the area: residential instability, percent Hispanic, and young males.

Investigating the relationship between immigration and homicide, Graif and Sampson (2009) showed similar types of changes with their GWR models, but also that the GWR model always showed improvements in goodness-of-fit over global models. In the context of immigration, they found negative results at the global level but varied results when considering GWR results (percent foreign born switched signs from positive to negative depending on the area)—Andresen and Ha (2020) found similar results for a number of property crime types in the context of immigration and crime. Investigating homicide, Becker (2016) found similar results in global and GWR models with regard to statistically significant explanatory variables: concentrated disadvantage, immigrant concentration, and residential stability. Becker (2016) found that immigrant concentration had a negative global effect, but that effect varied in the GWR model, as did concentrated disadvantage. More recently, Becker (2019) found that, when statistically significant, immigrant concentration always has a negative, but spatially varying, effect on homicide; they also find that collective efficacy only partially mediates neighbourhood disadvantage, and that disadvantage becomes spatially stationary when controlling for neighbourhood change. And in an investigation of homicide across Brazilian municipalities, Ingram and da Costa (2017) found spatial variability in all but two of their explanatory variables, with many switching signs from municipality to municipality.

Considering the impact of housing foreclosures on neighbourhood crime rates, Arnio and Baumer (2012) and Zhang and McCord (2014) both found spatial nonstationarity. Moreover, GWR results were statistically significant in different places for different crime types. This has important implications for any crime prevention initiatives and is particularly interesting because the global results show insignificance.

Boivin (2018) used GWR and found positive and negative associations between the presence of people and crime. Specifically, Boivin (2018) found statistically significant positive and negative results for residential mobility, work trips, other trips, and mixed land use for an aggregate of crime. This result suggests that places with higher concentrations of people may have guardianship effects, but particularly in places used for shopping, school, and work (Boivin, 2018). Though only considering population density, Maldonado-Guzmán (2020) found that higher population density only predicted property offences; additionally, they found that the presence of temporary lodging (AirBnB) increased both property and violent crimes, varying across space.

Bunting et al. (2018), Louderback and Roy (2018), and Cowen et al. (2019) have investigated global and GWR results for various crime types and context in the Miami-Dade area. These authors consistently found a limited number of places drive the global results and that GWR results often, but not always, switch signs. Similarly, Smith and Sandova (2019) identified spatial heterogeneity of robbery rates across census tracts and block groups in Saint Louis, particularly for relationships involving race, stability, and robbery rates.

Data and methods

Data

Crime incident data for the City of Vancouver are from the Vancouver Open Data Catalogue,^{Footnote 1} that includes commercial burglary, residential burglary, theft from vehicle, theft of vehicle, and other theft^{Footnote 2} (see Table 1). In order to facilitate interpretations, we use the natural logarithm of the counts of all crime types.^{Footnote 3} These ease interpretations because β_i then represents the percentage change in the crime type based on a one-unit change in independent variable i rather than change in a crime rate with no baseline information. Locations of the criminal incidents are not specific to an address, but to the center of their respective street segment and on the appropriate side of the street segment. Because each incident is allocated to the correct side of the street, all incidents are placed in the correct spatial unit of analysis when the count of points in polygons is performed. These crime data are available from 2003 to 2020, but only 2016 crime incident data are used to match the most recent available census data. Figure 1 is provided for neighbourhood context/references in the results, below.

Table 1 Descriptive statistics, dependent (rate per 1000) and independent variables

Full size table

As noted above, GWR research in criminological contexts often considers a limited set of explanatory variables, often using data reduction techniques, such as factor analysis that strives to capture a latent variable/construct through the combination of multiple variables—see Louderback and Roy (2018), Becker (2019), and Maldonado-Guzmán (2020) for recent examples. Though pragmatic because of the volume of statistical output when using GWR, the nuances of theoretical relationships may be shrouded when using formal theoretical constructs rather than the variables used to derive them; this has shown to be of importance in the context of the impact of the economy on crime (Andresen, 2013, 2015). In order to account for these potential nuances, particularly in the context of a relatively new statistical technique, we use the theoretically-informed individual explanatory variables as predictors rather than indices measuring broader theoretical constructs. This can be particularly important from a policy standpoint to develop social programs because not taking a data reduction approach allows for the individual variable driving the policy relevant result to be better identified.

Theoretically informed variables used as predictors in the analyses below are derived from social disorganization theory (Shaw & McKay, 1942). A full review of this empirical literature is beyond the scope of the current paper, but the theoretical approach of social disorganization theory has strong support in the literature (Pratt & Cullen, 2005). According to social disorganization theory, social and economic deprivation, ethnic heterogeneity, and population turnover (residential mobility) lead to increases in crime and delinquency rates; these constructs are listed and italicized, below, to identify the associated variables. In order to account for these constructs a number of variables are derived from the Statistics Canada Census of Population: 13 variables that capture various neighbourhood (census tract) structural characteristics including socio-demographic, socio-economic, housing, income, and land use characteristics are included for analysis. See Table 1 for descriptive statistics.

Population turnover is measured considering the number of residents who have moved into the census tract within the past year (residential mobility), and the percentage of rental units to capture the transient nature of renters when compared to owners (residential mobility). Additional housing characteristics are measured with percentage of dwellings under major repair and percentage of old homes (40 years+), measuring economic deprivation. Social and economic deprivation include measures of the unemployment rate, the percentage of people with a post-secondary degree/diploma/certificate, the percentage of families that are low income, the percentage of people whose income comes from government assistance (welfare, family allowance, employment insurance, etc.), average dwelling value in thousands of 2006 dollars, average rent in hundreds of 2006 dollars, median income in thousands of 2006 dollars, and median family income in thousands of 2006 dollars—all observations are based on 2016 values but are converted to 2006 dollars in a panel data set, with this being the most recent year available for analysis. Lastly, in the context of ethnic heterogeneity, the percentages of immigrants, recent immigrants (within the past 5 years), and visible minorities are included with the degree of ethnic heterogeneity measured using the Blau (1977) Index. Though conceptually similar, many of these variables measure different phenomena, especially given immigration waves and enclave settlement; for example, some neighbourhoods may have low degrees of ethnic heterogeneity while having high degrees of immigration that are not necessarily visible minorities. Given the importance of immigration shown in the immigration and crime literature, including the use of multiple metrics immigrant, ethnic, and visible minority measures (Andresen & Ha, 2020), we consider these variables separately in the analyses below.

All crime and ecological data for Vancouver are aggregated to the census tract level. Census tracts are relatively small and stable geographic areas that tend to have a population ranging from 2500 to 8000, with an average of 4000 persons. There are 105 census tracts in the City of Vancouver. These census tracts typically have boundaries along major roads, but may be along neighbourhood level roads in places with higher population density. As noted above, because the crime data are geolocated on the correct side of the street segment, events are always allocated to the correct census tract. No edge effect corrections have been made to the calculations. Though the crime and place literature is increasingly using micro-places (street segments) as the unit of analysis (Andresen et al., 2017a, 2017b; Braga et al., 2017; Weisburd et al., 2004, 2012) to capture variability within larger units such as census tracts, there are benefits to geographically larger units such as census tracts. There is within census tract variability that cannot be captured here, but the use of census tracts allows for the incorporation of many more socio-demographic and socio-economic variables available through the census. This allows for a better assessment of our spatial theories of crime.

Geographically weighted and global regression analyses

Our global regression analyses begin with ordinary least squares (OLS) and testing for spatial autocorrelation in the residuals using Moran’s I. If Moran’s I indicates spatial autocorrelation, there is a choice between a spatial lag and spatial error model: spatial lag models filter out spatial autocorrelation only within the dependent variable whereas spatial error models filter out the spatial autocorrelation in the residuals. Conceptually, the difference between these two models is that the spatial error model is accounting for the unmeasured effect of independent variables, whereas a spatial lag model is accounting for a diffusion process; see Deane et al. (2008) for an excellent articulation of these concepts.

The choice of spatial lag or spatial error models is undertaken using Lagrange Multiplier tests with subsequent tests for remaining spatial autocorrelation in the residuals. In all cases of spatial regression models, first-order Queen’s contiguity is sufficient to filter out spatial autocorrelation in the residuals—Rook’s contiguity is not considered because we consider census tracts that only connect at a corner to still be contiguous. Because of this, higher order contiguity matrices are not necessary in these analyses. All global regression models use robust standard errors for statistical testing, though tests for heteroskedasticity is only identified in the Other Theft model using OLS. Global and local (geographically weighted) regression models are compared using AIC values.

Geographically weighted regression can be represented using the following equation:

$${y}_{i}={\beta }_{0}\left({u}_{i},{v}_{i}\right)+{\sum }_{k}{\beta }_{k}{\left({u}_{i},{v}_{i}\right)x}_{ik}+{\varepsilon }_{i}$$

(1)

where y_i represents the value for a crime type at location i, β₀(u_i, v_i) represents the constant for location i, β_k(u_i, v_i) represents the estimated parameter for independent variable x_k at location i, and ε_i is the independently and identically distributed residual at location i. The vector of parameters is estimated as follows:

$$\widehat{\beta }\left({u}_{i}, {v}_{i}\right)= {\left({X}^{T}W\left({u}_{i}, {v}_{i}\right)X\right)}^{-1}{X}^{T}W\left({u}_{i}, {v}_{i}\right)y$$

(2)

where $\widehat{\beta }\left({u}_{i}, {v}_{i}\right)$ is the vector of estimates for β at all locations i, $W\left({u}_{i}, {v}_{i}\right)$ is an n x n matrix that has diagonal elements denoting the weighting for all locations for point i (Brunsdon et al., 1996; Fotheringham et al., 2001, 2002). In all cases we use an adaptive kernel. Leung et al. (2000) is used to assess the value of using geographically weighted regression: statistical significance indicated no value in accounting for spatial nonstationarity.

The output from these regressions allows for the mapping of estimated parameters for each independent variable in the regression because a regression is estimated for each unit of analysis. The minimum, maximum, and quartiles are presented in the output table, below, but do not indicate the statistical significance of those estimated parameters. It is critically important to note that any spatial variation identified may not be statistically significant and this should be tested before presenting results. Statistical significance is indicated in the output, along with the percentage of census tracts (if any) that are statistically significant. In order to map both statistical significance and the various magnitudes of the estimated parameters, only estimated parameters statistically significant at the 5 percent level are represented on the maps presented in the discussion, rather than mapping both the spatially varying parameters and z-statistics separately.

With regard to multicollinearity, Table 2 shows that very few of the independent variables have (nonparametric) correlation coefficients greater than 0.80. Aside from immigrant percent (highly correlated with recent immigrants and visible minorities, expected results), only post-secondary education and government assistance are correlated at a level (marginally) greater than 0.80. Moreover, based on variance inflation factors (VIFs), multicollinearity is generally not shown to be an issue—VIFs are based on a variable’s collinearity with all other variables in the regression, with values greater than 10 being a common threshold for concern (O’Brien, 2007). It is important to note that rented, post-secondary, government assistance, average dwelling value, median family income, immigrants, and visible minorities have VIFs greater than 10. However, rented, government assistance, average dwelling value, and median family income are all statistically significant in at least one of the global models with statistically significant results in the local models for the others. In order to test the impact of highly collinear variables, these variables were removed from the analyses and data reduction techniques were investigated. Removing these variables had very little qualitative impact on the results, with no impact on the results reported below. Data reduction techniques did not generate clean components (poor factor loadings, low alpha values, and low variance explained, potentially leading to omitted variable bias) except for those variables that did not impact the results reported below when removed from the analyses. Also, from a practical perspective, theoretical constructs generated using data reduction techniques do not allow for identifying what is actually driving the empirical results. This is problematic for those who wish to use such output for policy formation. Given that testing a specific theoretical construct is not the goal of the present research and that avoiding omitted variable bias is a greater concern (through removing variables or using data reduction), at this point in the analysis there are no general concerns for multicollinearity in the results presented below. Moreover, it is known that geographically weighted regression is not sensitive to multicollinearity, despite this common misconception (Fotheringham & Oshan, 2016).

Table 2 Nonparametric correlations, independent variables

Full size table

All analyses are undertaken using R: A Language and Environment for Statistical Computing http://www.r-project.org/, using the spatialreg (global regression analyses) and spgwr (geographically weighted regression) libraries.

Results

The global and local regression results are presented in Table 3. Goodness-of-fit statistics are presented for both the global spatial and geographically weighted regression results as well as the type of global regression model (spatial error, spatial lag, or OLS) as is the type and order of contiguity matrix. Both R² and Adjusted-R² values for OLS versions of the global models show significant improvements in variance explained when compared to the local models—the GWR quasi-R² values represent the average R² for each set of local models; a similar result is present for the local versus global AIC value comparisons. All global models are statistically significant (Wald or F-statistic), and the Leung et al. (2000) test shows that we cannot reject of the null hypothesis of GWR being a better fit in all models. The asterisks on the minimum GWR values indicate which variables have some local effects that are statistically significant, not necessarily that the minimum values are statistically significant, allowing for comparisons with the global models. Lastly, the Moran’s I test for spatial autocorrelation on the final model is reported. As shown in these tables, there is no remaining spatial autocorrelation in these models, with the same result for the geographically weighted regressions.

Table 3 Geographically weighted regression and global regression results, all crime types

Full size table

One of the interesting results shown here, consistent with previous research, is that when a variable is statistically significant in the global model and in the local model, only a subset of the census tracts are statistically significant at the local level. As shown in Table 3, there are some local models in which all census tracts for a variable are statistically significant, but that is not the norm. As such, a subset of the census tracts is driving the results at the global level, having potential impacts for policy development and implementation, discussed above. Additionally, there are a number of results that show statistically insignificant results at the global level but (at least) some statistically significant results at the local level. This shows that global regression model may wash out the effect of a variable when there are only a few of the units of analysis that exhibit statistically significant effects.

The results for commercial burglary show population change, major repairs, and government assistance being statistically significant for the global model; population change has a positive relationship with commercial burglary with major repairs and government assistance having negative relationships. The GWR results have population change, major repairs, and government assistance being statistically significant—see Fig. 2 for mapped output of government assistance and low income estimated parameters with only statistically significant results (at the 5 percent level) being shaded. Additionally, low income and ethnic heterogeneity both have positive relationships with commercial burglary for some census tracts. This ties back to one of the benefits of local spatial statistics being able to identify statistically significant results in a subset of census tracts even when the same variables are not statistically significant at the global level, showing the utility of GWR.

For residential burglary, population change, old houses, low income, and Aboriginal have positive relationships with residential burglary, whereas variables representing rented homes, major repairs, government assistance, and average rent have negative relationships. In addition to all of these variables, the GWR model results showed statistical significance for the unemployment rate (negative) and immigrants (positive)—see Fig. 3 for mapped results of population change and unemployment.

The global results for theft from vehicle retain population change and low income (positive relationships), and major repairs, government assistance, and average dwelling value (negative relationships). Similar to the other crime types, the GWR model maintains the statistical significance and sign of these independent variables as well as rented homes and immigrants, both negative relationships with theft from vehicle—see Fig. 4a for mapped results of major repairs.

Theft of vehicle retains few variables in both the global and GWR models. This may partially be due to the significant drop of this crime type in Vancouver over the past 20 years (Hodgkinson et al., 2016). Regardless, theft of vehicle has differences in the global and GWR results. Variables representing population change and Aboriginal identity have positive relationships with theft of vehicle,^{Footnote 4} whereas major repairs, government assistance, and average dwelling value have negative relationships. In the GWR model, variables representing major repairs and Aboriginal identity are no longer statistically significant in any of the census tracts, but low income is positive and statistically significant in the GWR model. Despite the change in the pattern of retained independent variables, the AIC value for the GWR model still shows a clear improvement over the global model with fewer remaining statistically significant independent variables.

Lastly, there are the global and GWR results for other theft. Only population change has a statistically significant and positive relationship with other theft, whereas major repairs, government assistance, average dwelling value, and median family income have negative relationships. In the GWR model, population change and government assistance are no longer statistically significant for any census tract, with recent immigrants having a statistically significant and positive relationship—see Fig. 4b for mapped results of major repairs. Similar to theft of vehicle, despite the change in the pattern of retained independent variables, the AIC value for the GWR model still shows a clear improvement over the global model with fewer remaining statistically significant independent variables.

Discussion

The results presented above show the benefits of using GWR when considering spatially referenced crime and ecological data. Overall, the GWR models show an improvement over the global models for all property crime types, based on AIC and Leung et al. (2000) statistics. Though there are some changes in the independent variables that are statistically significant in global and GWR models (theft of vehicle and other theft), there are no qualitative changes in the GWR results—there may be some GWR parameters that are opposite in sign when compared to the global parameter, but those GWR parameters are not statistically significant. These overall results are shown in Table 4.

Table 4 Geographically weighted regression results, results summary

Full size table

The unemployment rate is only statistically significant for one crime type and only in the GWR model. This alone shows the importance of considering spatial heterogeneity and how the presence of a small number of local relationships can be shrouded when results are only considered in a global context. Population change, rented homes, major repairs, low income, government assistance, and average dwelling value are statistically significant in many of the global and GWR models. Old homes, average rent, median family income, Aboriginal, recent immigrants, and ethnic heterogeneity are statistically significant in at least two of the global or GWR models. Variables representing recent movers, post-secondary education, and visible minorities are not statistically significant in any of the global or GWR models. And only the percentage of immigrants in a census tract switches signs from one crime type to another: local residential burglary (positive) and local theft from vehicle (negative).

When considering the GWR results, it is important to note that the local parameters for a variable are, at times, statistically significant and the same sign in all spatial units of analysis. This is important to note from both a theoretical and a policy implementation/evaluation standpoint because whether a potential policy variable impacts an outcome everywhere or only a portion of all places can determine if theoretical relationships hold (even partially) or if a policy intervention was successful in the places it was supposed to be successful. Such a situation is found for population change (theft from vehicle), rented homes (residential burglary), major repairs (theft from vehicle), low income (theft from vehicle), and average dwelling value (theft of vehicle).

With regard to the spatial heterogeneity, there are a number of interesting results. As shown in Fig. 2, government assistance and low income have highly localized effects for commercial burglary. The central northern peninsula at the top of the map is the central business district in Vancouver, with the areas immediately to the east being the Downtown Eastside, the poorest urban neighbourhood in Canada (Barnes & Sutton, 2009). This shows that increases in the percentage of low-income families lead to increases in commercial burglary, but only in places that already have high levels of low income. This in no way implies that increases in low income in other areas does not have an impact on families and their neighbourhoods, but for increases in low income to have an impact on commercial burglary, that impact is only present in places where low income is already at higher levels. This may be some form of a multiplier effect with regard to the impact of poverty in a neighbourhood (Oreopoulos, 2008), or simply shoe the need to consider the interactive nature of constructs within social disorganization theory when understanding crime patterns (Kubrin et al., 2022). The corresponding result here is that the impact of increases in government assistance leads to decreases in commercial burglary in the same places. Moreover, the magnitude impact of increases in government assistance is greater than increases in low income. As such, spatially-targeted government assistance may not only be able to reduce the percentage of low income families, but counter the criminological effects from existing/remaining low income that leads to financial strain for those families.

Two interesting results emerge for residential burglary (see Fig. 3): population change and unemployment. Both of these variables represent aspects of social disorganization theory, with the impact of unemployment on crime also being dependent on the time frame considered, short- versus long-run (Cantor & Land, 1985). Population change over the previous 5 years should capture residential instability and the inability to develop social bonds (Sampson et al., 1997). The positive global parameter supports this, but the fact that the local effects, also positive, are only in the southern and south-eastern portion of the city is instructive. Specifically, only in the places that have lower levels of population change (specifically fewer rental homes) do increases in population turnover lead to increases in residential burglary. As such, areas that tend to systematically have higher levels of population turnover because of being close to a university, in a trendy neighbourhood (Kitsilano), or the central business district do not have impacts from increases in that population turnover, only places that consistently have lower levels of turnover. Moreover, these latter areas have also seen increases in building security in recent years leading to decreases in residential burglary in these areas (Hodgkinson & Andresen, 2019).

Regarding the unemployment rate, increases in unemployment are expected to be related to increases in criminal activity in a social disorganization perspective. However, as put forth by Cantor and Land (1985), and subsequent research (Andresen, 2012, 2013; Phillips & Land, 2012), the short-run effects of increases in unemployment are expected to decrease crime because of increased guardianship through people spending more time at home and spending less money. This is found for residential burglary in Vancouver. In fact, the unemployment rate is only statistically significant for local residential burglary in all of the analyses presented here. As such, because of the relatively low levels of unemployment at the census tract level across Vancouver, its impact on spatial property crime patterns are found to be minimal.

Lastly, though more GWR maps are available to the interested reader, there are the localized effects of major repairs on theft from vehicle and other theft. Houses under major repair are often thought to be an indicator of dilapidated housing and, consequently, representing lower levels of socio-economic status. However, in a city like Vancouver, the presence of major repairs is often related to the refurbishing of older homes in the process of gentrification—see Lees et al. (2007) for a discussion of the process of gentrification. Within Vancouver, gentrification began on the west side of the city and dominantly continued in that area until the turn of the twentieth century (Ley & Dobson, 2008). However, in more recent decades that gentrification has been moving east to relatively more affordable areas of the city.

As shown in Fig. 4a, for theft from vehicle, the local parameters for major repairs are statistically significant and negative for the whole city but the magnitude of the impacts are greater in the eastern areas of Vancouver that are experiencing more recent and, hence, currently greater magnitude levels of gentrification. In the context of other theft, Fig. 4b, the statistically significant and negative effects from increases in major repairs are only present in those areas that are undergoing a lot of gentrification. This shows the importance of understanding local context and a need for future research in this area. This finding of decreases in crime resulting from gentrification processes is known in the criminological literature (MacDonald & Stokes, 2020). However, it is also important to note that the gentrification process has been shown to have negative impacts of the health of marginalized populations living in those areas, specifically in Vancouver (Goldenberg et al., 2020).

Though much of the criminological literature that uses GWR investigates property crime, one of the limitations in the current analyses is that no violent crime types are considered. This limits the generalizability with US-based research. Similar to other research in this area, only one year of data, 2016, are analyzed. A number of our independent variables have high degrees of multicollinearity. However, as discussed, most of these variables prove to be statistically significant in the global and local results, showing the importance of variable inclusion rather than risking omitted variable bias. And, of course, only official police and census data are used in the current analyses. This may be problematic for police data, as it is for all research based on police data, because of the well-known dark figure of crime (Perreault, 2015), but property crime types do have higher reporting rates than violent crime types (Andresen, 2020); however, it may be the case that some of the relationships found here are mediated by under-reporting of crime. In the context of census data, similar to most research in this area, census data are proxies for theoretical constructs, particularly for social disorganization theory (Sampson & Groves, 1989).

Regardless of these limitations, we extend the literature through an international application considering 5 crime types and large number of theoretically informed explanatory variables that allows for a more nuanced investigation of spatial variations in crime patterns. Though there is international research using GWR, cited above, more research in this area is necessary. This is important for (social) science and generalizability, more generally. Though our spatial theories of crime fare well in international contexts, we have shown here that local knowledge is important for understanding the local results. As such, we must be careful when generalizing and need more research in different contexts.

In addition to addressing the limitations, stated above, future research should continue to be applied in other international contexts. Moreover, we need a better understanding of why theoretical expectations are only present in particular places (despite emerging as globally significant). We also need a better understanding of why theoretical relationships change directions in some places despite global relationships being consistent with theoretical predictions; this did not occur in the current research but does occur in this literature, more broadly. Only with a better understanding of these nuances can we move forward with our spatial theories of crime and, potentially, understand why and where they continue to operate as expected and why and where they need to change.

Conclusion

Overall, these analyses show that there is significant local variability in all cases, though that variability has different ranges for different crime types. Similar to previous research in this area, a subset of the areas (CTs) under analysis drive the global results. However, unlike much of the US research, local level results, when statistically significant, are always consistent (same sign) with the global regression output. This shows that the presence of spatial heterogeneity does not necessarily mean that relationships change direction across space, but only their statistical significance and magnitude changes. Moreover, this is shown in a more international (Canadian) context.

Availability of data and materials

Please contact the author.

Notes

https://data.vancouver.ca/datacatalogue/crime-data.htm.
Other theft refers to forms of theft not in the other four categories: theft of a mobile phone, computer, purse/wallet, shoplifting, and so on.
The use of crime counts, and their natural logarithm also avoids complications from using inappropriate denominators in crime rate calculations (Andresen, 2011).
This result must be taken in the Canadian context as a result of colonialism, institutional racism, and the use of residential schools that have found to be significant risk factors for criminalizing a marginalized population (Monchalin, 2010; Shen & Andresen, 2021).

References

Andresen, M. A. (2011). The ambient population and crime analysis. Professional Geographer, 63(2), 193–212.
Article Google Scholar
Andresen, M. A. (2012). Unemployment and crime: A neighbourhood level panel data approach. Social Science Research, 41(6), 1615–1628.
Article Google Scholar
Andresen, M. A. (2013). Unemployment, business cycles, crime, and the Canadian provinces. Journal of Criminal Justice, 41(4), 220–227.
Article Google Scholar
Andresen, M. A. (2015). Unemployment, GDP, and crime: The importance of multiple measurements of the economy. Canadian Journal of Criminology and Criminal Justice, 57(1), 35–58.
Article Google Scholar
Andresen, M. A., & Ha, O. K. (2020). Spatially-varying relationships between immigration measures and property crime types in Vancouver census tracts, 2016. British Journal of Criminology, 60(5), 1342–1367.
Article Google Scholar
Andresen, M. A., Curman, A. S. N., & Linning, S. J. (2017a). The trajectories of crime at places: Understanding the patterns of disaggregated crime types. Journal of Quantitative Criminology, 33(3), 427–449.
Article Google Scholar
Andresen, M. A., Linning, S. J., & Malleson, N. (2017b). Crime at places and spatial concentrations: Exploring the spatial stability of property crime in Vancouver BC, 2003–2013. Journal of Quantitative Criminology, 33(2), 255–275.
Article Google Scholar
Anselin, L. (1995). Local indicators of spatial association — LISA. Geographical Analysis, 27(2), 93–115.
Article Google Scholar
Arnio, A. N., & Baumer, E. P. (2012). Demography, foreclosure, and crime: Assessing spatial heterogeneity in contemporary models of neighbourhood crime rates. Demographic Research, 26, 449–488.
Article Google Scholar
Barnes, T., & Sutton, T. (2009). Situating the new economy: Contingencies of regeneration and dislocation in Vancouver’s inner city. Urban Studies, 46(5–6), 1247–1269.
Article Google Scholar
Becker, J. H. (2019). Within-neighbourhood dynamics: Disadvantage, collective efficacy, and homicide rates in Chicago. Social Problems, 66(3), 428–447.
Article Google Scholar
Becker, J. H. (2016). The dynamics of neighbourhood structural conditions: The effects of concentrated disadvantage on homicide over time and space. City and Community, 15(1), 64–82.
Article Google Scholar
Blau, P. M. (1977). Inequality and heterogeneity. Free Press.
Google Scholar
Boivin, R. (2018). Routine activity, population(s) and crime: Spatial heterogeneity and conflicting propositions about the neighbourhood crime-population link. Applied Geography, 95, 79–87.
Article Google Scholar
Braga, A. A., Andresen, M. A., & Lawton, B. (2017). The law of crime concentration at places: Editors’ introduction. Journal of Quantitative Criminology, 33(3), 421–426.
Article Google Scholar
Brunsdon, C., Fotheringham, A. S., & Charlton, M. E. (1996). Geographically weighted regression: A method for exploring spatial nonstationarity. Geographical Analysis, 28(4), 281–298.
Article Google Scholar
Bunting, R. J., Chang, O. Y., Cowen, C., Hankins, R., Langston, S., Warner, A., Yang, X., Louderback, E. R., & Roy, S. S. (2018). Spatial patterns of larceny and aggravated assault in Miami-Dade County, 2007–2015. Professional Geographer, 70(1), 34–46.
Article Google Scholar
Cahill, M., & Mulligan, G. (2007). Using geographically weighted regression to explore local crime patterns. Social Science Computer Review, 25(2), 174–193.
Article Google Scholar
Cantor, D., & Land, K. C. (1985). Unemployment and crime rates in the post World War II United States: A theoretical and empirical analysis. American Sociological Review, 50(3), 317–332.
Article Google Scholar
Cowen, C., Louderback, E. R., & Roy, S. S. (2019). The role of land use and walkability in predicting crime patterns: A spatiotemporal analysis of Miami-Dade County neighbourhoods, 2007–2015. Security Journal, 32(3), 264–286.
Article Google Scholar
Deane, G., Messner, S. F., Stucky, T. D., & McGeever, & Kubrin, C.E. (2008). Not ‘islands, entire of themselves’: Exploring the spatial context of city-level robbery rates. Journal of Quantitative Criminology, 24(4), 363–380.
Article Google Scholar
Deller, S., & Deller, M. (2012). Spatial heterogeneity, social capital, and rural larceny and burglary. Rural Sociology, 77(2), 225–253.
Article Google Scholar
Fotheringham, A. S., & Oshan, T. M. (2016). Geographically weighted regression and multicollinearity: Dispelling the myth. Journal of Geographical Systems, 18(4), 303–329.
Article Google Scholar
Fotheringham, A. S., Brunsdon, C., & Charlton, M. (2002). Geographically weighted regression: The analysis of spatially varying relationships. Wiley.
Google Scholar
Fotheringham, A. S., Charlton, M., & Brunsdon, C. (2001). Spatial variations in school performance: A local analysis using geographically weighted regression. Geographical and Environmental Modelling, 5(1), 43–66.
Article Google Scholar
Goldenberg, S. M., Amram, O., Braschel, M., Moreheart, S., & Shannon, K. (2020). Urban gentrification and declining access to HIV/STI, sexual health, and outreach services amongst women sex workers between 2010–2014: Results of a community-based longitudinal cohort. Health & Place, 62, 102288.
Article Google Scholar
Graif, C., & Sampson, R. J. (2009). Spatial heterogeneity in the effects of immigration and diversity on neighbourhood homicide rates. Homicide Studies, 13(3), 242–260.
Article Google Scholar
Grubesic, T. H., Mack, E. A., & Kaylen, M. T. (2012). Comparative modeling approaches for understanding urban violence. Social Science Research, 41(1), 92–109.
Article Google Scholar
Hodgkinson, T., & Andresen, M. A. (2019). Changing spatial patterns of residential burglary and the crime drop: The need for spatial data signatures. Journal of Criminal Justice, 61, 90–100.
Article Google Scholar
Hodgkinson, T., Andresen, M. A., & Farrell, G. (2016). The decline and locational shift of automotive theft: A local level analysis. Journal of Criminal Justice, 44(1), 49–57.
Article Google Scholar
Ingram, M. C., & da Costa, M. M. (2017). A spatial analysis of homicide across Brazilian municipalities. Homicide Studies, 21(2), 87–110.
Article Google Scholar
Kubrin, C. E., Branic, N., & Hipp, J. R. (2022). (Re)conceptualizing neighbourhood ecology in social disorganization theory: From a variable-centered approach to a neighbourhood-centered approach. Crime & Delinquency, 68(11), 2008–2032.
Article Google Scholar
Lees, L., Slater, T., & Wyly, E. K. (2007). Gentrification. Routledge.
Google Scholar
Leung, Y., Mei, C., & Zhang, W.-X. (2000). Statistical tests for spatial nonstationary based on the geographically weighted regression model. Environment and Planning A, 32(1), 9–32.
Article Google Scholar
Ley, D., & Dobson, C. (2008). Are there limits to gentrification? The contexts of impeded gentrification in Vancouver. Urban Studies, 45(12), 2471–2498.
Article Google Scholar
Light, M. T., & Harris, C. T. (2012). Race, space, and violence: Exploring spatial dependence in structural covariates of white and black violent crime in US counties. Journal of Quantitative Criminology, 28(4), 559–586.
Article Google Scholar
Louderback, E. R., & Roy, S. S. (2018). Integrating social disorganization and routine activity theories and testing the effectiveness of neighbourhood crime watch programs: Case study of Miami-Dade County, 2007–15. British Journal of Criminology, 58(4), 968–992.
Article Google Scholar
MacDonald, J. M., & Stokes, R. J. (2020). Gentrification, land use, and crime. Annual Review of Criminology, 3, 121–138.
Article Google Scholar
Malczewski, J., & Poetz, A. (2005). Residential burglaries and neighbourhood socioeconomic context in London, Ontario: Global and local regression analysis. Professional Geographer, 57(4), 516–529.
Article Google Scholar
Maldonado-Guzmán, D. J. (2022). Airbnb and crime in Barcelona (Spain): Testing the relationship using a geographically weighted regression. Annals of GIS, 28(2), 147–160.
Article Google Scholar
Monchalin, L. (2010). Canadian Aboriginal peoples victimization, offending and its prevention: Gathering the evidence. Crime Prevention and Community Safety, 12(2), 119–132.
Article Google Scholar
O’Brien, R. M. (2007). A caution regarding rules of thumb for variance inflation factors. Quality & Quantity, 41(5), 673–690.
Article Google Scholar
Ord, J. K., & Getis, A. (1995). Local spatial autocorrelation statistics: Distributional issues and an application. Geographical Analysis, 27(4), 286–306.
Article Google Scholar
Oreopoulos, P. (2008). Neighbourhood effects in Canada: A critique. Canadian Public Policy, 34(2), 237–258.
Article Google Scholar
Perreault, S. (2015). Criminal victimization in Canada, 2014. Ottawa, ON: Statistics Canada.
Google Scholar
Phillips, J. A., & Land, K. C. (2012). The link between unemployment and crime rate fluctuations: An analysis at the county, state, and national levels. Social Science Research, 41(3), 681–694.
Article Google Scholar
Pratt, T. C., & Cullen, F. T. (2005). Assessing macro-level predictors and theories of crime: A meta-analysis. Crime and Justice, 32, 373–450.
Article Google Scholar
Sampson, R. J., Raudenbush, S. W., & Earls, F. (1997). Neighbourhoods and violent crime: A multilevel study of collective efficacy. Science, 277(5328), 918–924.
Article Google Scholar
Shaw, C. R., & McKay, H. D. (1942). Juvenile delinquency and urban areas: A study of rates of delinquency in relation to differential characteristics of local communities in American cities. University of Chicago Press.
Google Scholar
Shen, J.-L., & Andresen, M. A. (2021). A tale of two theories: Whither social disorganization theory and the routine activities approach? Canadian Journal of Criminology and Criminal Justice, 63(2), 1–22.
Article Google Scholar
Smith, T. A., & Sandova, J. S. (2019). Examining the local spatial variability of robberies in Saint Louis using a multi-scale methodology. Social Sciences, 8(2), 50.
Article Google Scholar
Weisburd, D., Bushway, S., Lum, C., & Yang, S.-M. (2004). Trajectories of crime at places: A longitudinal study of street segments in the City of Seattle. Criminology, 42(2), 283–321.
Article Google Scholar
Weisburd, D., Groff, E. R., & Yang, S.-M. (2012). The criminology of place: Street segments and our understanding of the crime problem. Oxford University Press.
Book Google Scholar
Zhang, H., & McCord, E. S. (2014). A spatial analysis of the impact of housing foreclosures on residential burglary. Applied Geography, 54, 27–34.
Article Google Scholar

Download references

Acknowledgements

I thank the anonymous reviewers for their helpful comments.

Funding

No outside funding was used to support this work.

Author information

Authors and Affiliations

School of Criminology, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada
Martin A. Andresen

Authors

Martin A. Andresen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The author read and approved the final manuscript.

Author information

Martin A. Andresen is a Professor in the School of Criminology at Simon Fraser University. His research interests are in crime and place, geography of crime, spatial–temporal criminology, and spatial analysis.

Corresponding author

Correspondence to Martin A. Andresen.

Ethics declarations

Competing interests

The author declares that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Andresen, M.A. Theorizing globally, but analyzing locally: the importance of geographically weighted regression in crime analysis. Crime Sci 11, 10 (2022). https://doi.org/10.1186/s40163-022-00173-0

Download citation

Received: 07 May 2022
Accepted: 06 October 2022
Published: 17 October 2022
DOI: https://doi.org/10.1186/s40163-022-00173-0

Theorizing globally, but analyzing locally: the importance of geographically weighted regression in crime analysis

Abstract

Introduction

Related research

Data and methods

Data

Geographically weighted and global regression analyses

Results

Discussion

Conclusion

Availability of data and materials

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Author information

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Crime Science

Contact us

Theorizing globally, but analyzing locally: the importance of geographically weighted regression in crime analysis

Abstract

Introduction

Related research

Data and methods

Data

Geographically weighted and global regression analyses

Results

Discussion

Conclusion

Availability of data and materials

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Author information

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Crime Science

Contact us