8

Redrawing Hot Spots of Crime in Dallas, Texas

 3 years ago
source link: https://journals.sagepub.com/doi/full/10.1177/1098611120957948
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

Abstract

In this work we evaluate the predictive capability of identifying long term, micro place hot spots in Dallas, Texas. We create hot spots using a clustering algorithm, using law enforcement cost of responding to crime estimates as weights. Relative to the much larger current hot spot areas defined by the Dallas Police Department, our identified hot spots are much smaller (under 3 square miles), and capture crime cost at a higher density. We also show that the clustering algorithm captures a wide array of hot spot types; some one or two addresses, some street segments, and others an agglomeration of larger areas. This suggests identifying hot spots based on a specific unit of aggregation (e.g. addresses, street segments), may be less efficient than using a clustering technique in practice.

Introduction

While targeting police resources at micro place hot spots has been one of the most successful policing interventions to reduce crime (Braga et al., 2019), it is still an open question as to how to construct those hot spots. Using open crime data from Dallas, Texas, this article provides an example of constructing long term micro place hot spots. Using historical data to identify hot spots of crime, we show how our identified clusters capture a higher density of crime costs than current Dallas hot spot areas on future crimes.

While there are a plethora of prior articles examining the predictive ability of different hot spot methods (Chainey et al., 2008; Drawve, 2016; Levine, 2008; Van Patten et al., 2009), there are three novel contributions of this work. First, we illustrate the use of a hierarchical clustering technique, DBSCAN (Campello et al., 2013), to formulate different contiguous hot spot areas. A point of contention in prior work is the correct spatial unit of analysis to conduct analysis on. For example, several scholars often suggest street segments (Rosser et al., 2017; Weisburd et al., 2004), some suggest street segments and intersections (Braga et al., 2010; Wheeler et al., 2016), and others have suggested targeting specific addresses (Eck et al., 2007; Lee & Eck, 2019; Sherman et al., 1989). Using a hierarchical clustering technique allows one to avoid needing to specify such a spatial unit of analysis up front, and can be used on address based crime data to identify a reasonable resolution for a particular hot spot. In the micro place hot spots we identify, we subsequently show some are one or two addresses clustered together, others are a street segment, and others are an agglomeration of several nearby street segments.

The second novel contribution of this work is to construct hot spots using law enforcement cost of responding to crime estimates, which is related to prior research creating hot spots using crime harm scores (Macbeth & Ariel, 2019; Ignatans & Pease, 2016; Ratcliffe, 2015; Sherman et al., 2016). Crime harm has been characterized in prior research by either asking survey respondents to rank different crimes (Wolfgang, 1985), or by translating sentencing decisions to create harm weights (Ratcliffe, 2015), with the ultimate goal that policing resources are better allocated relative to the harm a particular crime impacts on the community, as opposed to counting all crime equal (Sherman & Cambridge University Associates, 2020).

In place of crime harm scores however, here we use average Uniform Crime Report (UCR) cost of crime estimates (per law enforcement) for Texas as weights in the DBSCAN algorithm (Hunt et al., 2019). This provides more interpretable hot spot summaries in terms of direct cost of crime estimates relevant to police departments. These cost of crime estimates can be used to provide more actionable information in terms of cost-benefit analysis for police departments when planning a hot spots policing strategy.

The third novel contribution of the study is to evaluate the predictive accuracy of our identified cost of crime weighted hot spots using historical data to predict future crimes. While prior work has mapped harm spots (Curtis-Ham & Walton, 2017; Fenimore, 2019; Norton et al., 2018; Weinborn et al., 2017), the majority of that work has simply focused on crime concentration, and has not assessed the predictive accuracy of a technique to create such harmspots (for an exception, see Macbeth & Ariel, 2019). Additionally, most of that work has mapped harm spots at a specific pre-chosen unit of analysis (Curtis-Ham & Walton, 2017; Mitchell, 2019; Norton et al., 2018; Weinborn et al., 2018, for an exception see Fenimore, 2019 who used weighted kernel density estimates). We show here in our constructed hot spot areas there is no single spatial unit of analysis that would consistently cover the same areas we identified.

The study area of Dallas was chosen for one main reason. They have historically identified long term hot spot areas, Target Area Action Grids (TAAG) (Ferguson, 2017; Worrall & Wheeler, 2019). These hot spot areas on their face do not conform with contemporary advice on targeting micro place hot spots of crime. TAAG areas cover 64 square miles of Dallas, around 20% of the city. We show that TAAG areas capture around 60% of the total crime cost in the city. Weisburd’s law of crime concentration states that approximately 5% of the places within a city typically capture around 50% of the total amount of crime (Weisburd, 2015). Thus it suggests one can identify a much smaller subset of the city and capture crime at a much higher density.

A common statistic used to evaluate crime forecasts is the Predictive Accuracy Index (PAI) (Chainey et al., 2008). The PAI is simply the ratio of the proportion of crime captured for the numerator and the percent of the area covered by the prediction in the denominator. A simple extension this statistic is to consider not just the proportion of total crimes captured, but the total cost of crime captured. We show that current TAAG areas in Dallas have cost-of-crime PAI's of around 3 (60% crime cost/20% of area of city), while Weisburd's law of crime concentration suggests a reasonable baseline for PAI statistics should be 10 (50% of crime cost/5% of the area). Our identified hot spot areas are much smaller, cover less than 1% of the area of the city (around 2.5 square miles), capture 12% of the total cost of crime in the city, and subsequently have a combined PAI of 16.

Literature Review

Creating Hot Spots of Crime

Work that identified that crime clusters at small locations, such as specific addresses (Sherman et al., 1989), or street segments (Weisburd et al., 2004) was likely the innovation that spurred subsequent work on hot spots policing (Braga et al., 2019; Lum et al., 2011). This innovation was particularly important given the lack of prior successes of police interventions to reduce crime (Bayley, 1994). In retrospect, those poor results are likely attributable to police targeting of much larger areas (Larson, 1975), which has been consistently shown to be less successful than targeting micro place hot spots (Lum et al., 2011).

However, despite the popularity of hot spots policing, there is not a single, unanimous definition of what constitutes a hot spot of crime (Taylor, 2015). This is because a hot spot is ultimately a data based definition, not a physical entity that one can go and show in the physical world. Thus while it is easy to show that a small number of micro locations have a relatively large number of crime counts (Weisburd, 2015), it is much harder to delineate the boundaries of what is inside or what is outside of a hot spot (Taylor, 2015). Subsequently there have been a myriad of different ways using data to define hot spots.

Sherman et al. (1989) conducted a study on hot spots in Minneapolis which confirmed that relatively few areas produce the most calls to police. They found that half of all calls to police were in only 3% of the places (addresses or intersections) within the city. Subsequent work by David Weisburd has stated a consistent law of crime concentration, that around 5% of the places in a city (where places are often defined as street segments) contain around 50% of the crime (Weisburd, 2015). These approaches to identifying hot spots rely on one to pre-specify a particular unit of analysis (e.g. address, street segment, census block, patrol area, etc.), as well as a particular ranked threshold (e.g. the top 5% of areas, or the top 10 areas) to set a cut-off for what constitutes a hot spot.

Chainey et al. (2008) identified a number of different mapping techniques that can be used to identity hot spots of crime that don’t per se rely on a pre-defined unit of analysis or a simple threshold (e.g. top 5% of the area). Examples include point mapping, clustering census areas, or kernel density estimation. The depiction of the hot spots on a map may vary depending of the level of concentration for the analysis. For example, if the focus is on streets, lines may be used, while if it was on areas/neighborhoods, shaded areas or gradients may be more appropriate (Eck et al., 2005). While these techniques potentially avoid relying on some arbitrary decisions (such as a spatial unit of analysis), they still rely on other arbitrary decisions, such as where to set the cut-off for a kernel density threshold to set the area for a hot spot. Frequently individuals use rules such as a z-score above 3 to make such decisions for continuous predictions (Eck et al., 2005; Chainey et al., 2008; Drawve, 2016).

According to Haberman (2017), there are several different ways to identify hot spots. Before beginning, the size of the hot spot areas needs to be established. This can range from a large police beat, to a smaller grid area, and even to an even smaller microplace consisting of an intersection or street segment. It is important to then determine the criteria that may qualify an area as a hot spot. A common method includes the establishment of a minimum number of crimes that must have occurred in that area. Separately, selecting the locations with the highest number of crimes after rank-ordering the areas can be used to establish hot spots. Finally, spatial statistics may be used to identify clusters of crime, therefore revealing hot spots. Using one of these methods, most researchers have identified hot spots of all crime within an urban area, while others have focused on a specific type of crime, such as robbery or motor vehicle thefts (Haberman, 2017).

Given that how to draw hotspots is arbitrary, there has been extensive work on analyzing the accuracy of different hot spot mapping techniques (Chainey et al., 2008; Drawve, 2016; Flaxman et al., 2019; Lee et al., 2020; Mohler & Porter, 2018). Here we focus on one of the more common metrics, the Predictive Accuracy Index (PAI) (Kounadi et al., 2020).

Measuring Hot Spot Accuracy Using the Predictive Accuracy Index

The Prediction Accuracy Index (PAI), is a technique used to calculate the overall accuracy of a hot spot, but take into account the size of the area predicted (Chainey et al., 2008; Drawve, 2016). For example, if one simply used the number of crimes captured in a predicted area, or similarly the hit rate of whether any crime occurred in a predicted area (for small temporal intervals), it would be unfair to compare a prediction that encompassed a much larger area than a smaller predicted hot spot. Using the hit rate, you would gain the best prediction by predicting the entire study area, which is ultimately not helpful in identifying small hot spots of crime.

The PAI metric however divides the total crimes captured in an identified hot spot with the size of the area being considered. The PAI formula can be written as:

𝑛/𝑁𝑎/𝐴=% Crime Captured% Area of City=PAIn/Na/A=%CrimeCaptured%AreaofCity=PAI
where n is the number of crimes captured in the identified hot spot, and N is the total number of crimes in the entire study area, and this is the numerator for the PAI metric. The denominator is defined by a, the area of the hot spot, divided by A, the area of the entire city. As such, the metric reduces to the proportion of crimes captured in a hot spot for the numerator, and the proportion of the area considered in the denominator. For a simple example, Weisburd’s law of crime concentration would suggest that 5% of the area of the city approximately captures 50% of the crime, which results in a PAI statistic of 0.5/0.05 = 10.

It is important when evaluating this metric to use prospectively identified hot spots, and then use future crimes captured in those previously defined areas, e.g. create hot spots using 2016 data, and see how many crimes they capture in 2017 data. If one uses the same dataset to evaluate both, the accuracy of the method used to generate the hot spots will be greatly inflated (Berk, 2008).

Given we are interested in capturing weighted crime hot spots, we slightly adapt the PAI metric to take those weights into account. If one defines the weighted crime cost (or harm) captured in a hot spot and the weighted crime cost (or harm) in an entire city as below, where w indicates the weight for a particular crime:

∑𝑤⋅𝑛=𝑣, Weighted crime captured in hot spot∑w⋅n=v,Weightedcrimecapturedinhotspot
∑𝑤⋅𝑁=𝑉, Weighted crime in entire study area∑w⋅N=V,Weightedcrimeinentirestudyarea

We can then simply replace the numerator in the prior PAI formula with v/V, and have a value weighted PAI𝑣PAIv statistic.

𝑣/𝑉𝑎/𝐴=% Weighted Crime Captured% Area of City= PAI𝑣v/Va/A=%WeightedCrimeCaptured%AreaofCity=PAIv

This statistic is agnostic as to what constitutes the weights, which segues into our next section, how prior research has quantified crime cost (or harm) weights to capture the notion that some crimes (e.g. violent interpersonal offenses) should take priority over lower level offenses (e.g. petty theft) when identifying areas for police departments to target.

Calculating Crime Harm Scores

Recognizing that all types of crime count the same amount towards the crime rate, although they differ in severity, Wolfgang (1985) conducted a survey to determine how the public perceived the severity of crimes. Each respondent was asked to compare and rate a number of crimes based on their description and details of the incident. These ratings were averaged to determine an overall score for each crime. Patterns could be seen in the scores from the survey, such as crimes involving multiple victims typically receiving a higher severity score than the same crimes with a fewer number of victims. Additionally, factors such as who the victim was, the amount of money involved in the crime, or the original intent of the individual, are all additional factors that were important to respondents when assigning severity scores.

Many researchers have begun developing additional harm indexes to more appropriately measure crime than by a count or rate within an area. The majority of these harm indexes are based on sentencing data, as opposed to using public opinion surveys. Although the methods to create such harm indexes have varied slightly over different locations.

Ratcliffe (2015) attempted to create a realistic measure of harm experienced by the community due to crime. Focusing on Philadelphia, each offense was assigned a score based on general guidelines to assist judges in assigning an appropriate penalty for guilty individuals. From here, the author believed the index should also include harms suffered by the community that were not included in the harm index since it focuses mainly on violent crime. This ‘social harm perspective’ extends to include poverty/cash-loss, fraud, emotional harm, psychological harm, or sexual harm that may not be captured by serious crime numbers. The addition of other crime types, such as traffic accidents and investigative activity/stops, is more reflective of police activity than it is of a measure of harm to a community.

Sherman et al. (2016) created the Cambridge Crime Harm Index, adopted for the UK from the previously created Crime Harm Index, where they rated the harm of each crime reported to the police. This measure excluded any proactive crime detections from the police department, as they could not reliably measure harm experienced by the population. Each crime was multiplied by the number of days in prison an offender would receive upon conviction of that crime.

The New Zealand Crime Harm Index is another example of measuring the harm of crime, in an effort to reallocate police resources more efficiently. Curtis-Ham and Walton (2018) developed this measure as an alternative to the traditional Crime Harm Index, to be more specific with crimes in New Zealand. Although it takes a similar approach, the New Zealand Crime Harm Index uses the actual time served by the offender, to maintain proportionality between short and long sentences due to early releases and parole, account for Home Detention allotment, as well as account for the proportion of the sentence served.

The Western Australia Crime Harm Index was developed by House and Neyroud (2018) to analyze harm levels among common offenses for first-time offenders. It was calculated by taking the median number of days sentenced for each first time offender in a sample of most common offense categories. Conditional community sentences and monetary penalties were additionally accounted for in termed of the equivalent number of prison days. The number of offenses for each category was multiplied by the median number of prison days sentenced per category. This created the Western Australia Crime Harm Index, which was found to have the potential to improve policing, as well as community safety in Western Australia.

Mapping crime harm spots can take many different forms. Weinborn et al. (2017) identified the top 5% of street segments (two standard deviations away from the mean) that accounted for the most crime in the city to be hotspots, and constructed weights based on a crime harm index centered around sentencing data. Curtis-Ham and Walton (2017) used census units to determine which areas of New Zealand had the highest crime harm, specifically comparing the Crime Harm Index with the New Zealand Priority Locations Index. Norton et al. (2018) measured harm within street segments using the Crime Harm Index based on sentencing guidelines, and included results outside of three standard deviations from the mean. They identified the four most harmful offense categories (violence against persons, sexual offense, robbery and theft) to make up 80% of the harm within the 99 harm spots. Mitchell (2019) created the California Crime Harm Index using a measure of potential maximum prison days to evaluate the Sacramento Hot Spot Experiment in comparison with crime counts for the area. Fenimore (2019) used weighted kernel density estimates to show that while crime harm tended to be as concentrated as crime counts, crime clusters were in different areas than clusters based on crime counts. This was due to violent crimes contributing more to the weights, which were based on sentencing guidelines or the prior severity scores based on the Wolfgang survey.

As opposed to using crime harm estimated via sentencing decisions, we take a different tact in this analysis; we use estimates of costs of crime relevant directly to a police department. Hunt et al. (2019) chose to calculate crime specific variable cost estimates (for police departments) to aid in cost-benefit analysis. Using the labor costs of crime to construct hot spots, instead of focusing on sentencing guidelines and punishment, can directly relate to the benefits of reducing crime in that area that are directly realized by police.

To briefly describe Hunt et al.'s (2019) methodology in more detail, it specifically focuses on the labor costs of responding to crime. Criminologists often distinguish between reactive and proactive policing (Nagin et al., 2015), and Hunt's estimates are relevant for the reactive category. The reactive police response to crime can be broken down in-between various categories; Hunt et al. (2019) list different categories of administrative, arrest, crime scene, court time, investigative, and en-route/waiting. They generate state level estimates for costs of responding to crime based on various data sources, including Bureau of Justice Statistics resources for justice expenditures and time spent responding to different crimes (broken down by differing roles, such as patrol officer vs. detective), and state crime incident totals from the Uniform Crime Reporting program. They further estimate breakdowns between urban and rural agencies using individual level time diary estimates from studies specific jurisdictions, and conduct monte-carlo simulations to generate distributions around point estimates, although in this study we only use the point estimates for Part 1 crimes.

As Hunt et al. (2019) pointed out in the article, crimes that have the biggest costs often require the largest amount of manpower and time. Therefore, by reducing crime in those hot spot locations, the police department and city may be able to directly save money by reducing the burden to respond to different crimes. This may be more helpful than analyzing hot spots based on sentencing guidelines, as a department may not have as much motivation to make a change based on this data, since estimates of the benefit are not directly realized by the police department, nor will it help them justify any upfront investment in targeting such hot spots.

For an example, if a hot spot had an estimate $1 million dollars of cost over a year, and say a police department was interested in implementing a problem-oriented approach similar to that in the Lowell hot spot study (Braga & Bond, 2008). In that study, the Lowell Police Department reduced various Part 1 crimes by around 20% to 40% relative to control areas. So if a police department can successfully replicate those same results, it would produce a savings of $200,000 to $400,000 dollars over a years time in resources not spent responding to crime in that area. One would not be able to make similar return on investment arguments to justify hot spots policing when using the prior crime harm indices based on sentencing decisions.

The crime reduction results for problem oriented policing in the Braga and Bond (2008) study are around the average effect estimates in a recent meta analysis on similar problem oriented policing interventions (Hinkle et al., 2020), and so while this does not consider the upfront cost estimate, it gives an estimate of potential return on investment. Clearly hot spot areas with crime costs lower than the potential intervention cost are not capable of generating a positive return on investment when only considering policing costs.

Data and Methods

Data used for the analysis all come from open data sources. Address level geocoded crime data is available from dallasopendata.com. For this analysis we use crime data from June 2014 through December 2016 as the training dataset to draw historical hot spots, and then compare to test data from January 2017 through June 2018. While additional crime data was available at the time of writing, Dallas PD underwent a transition to NIBRS reporting, which subsequently caused various anomalies in the open data (such as a dramatic drop in thefts). The Dallas open data does not contain reported rapes, so the analysis here only examines the other Part 1 UCR index crimes; murder, aggravated assault, robbery, burglary, theft, and motor-vehicle-theft. Crimes that were given the address of a Dallas PD station (or sub-station) were removed from the analysis.

For GIS files, we obtained 2017 TAAG hot spot areas from the same dallasopendata.com website. We obtained an outline of the city, as well as street centerlines, from https://gis.dallascityhall.com/shapefileDownload.aspx. There are a total of 54 TAAG areas. All geographic analysis was conducted using a local projection relevant for Dallas, a Lambert conformal conic projection centered in Texas.

Although the main analysis uses area as the denominator for PAI calculations, street centerlines are used as an alternative denominator in place of area for PAI calculations in supplementary analysis (Drawve & Wooditch, 2019). This supplementary analysis is available in the appendix, and largely mimics the same findings using area for PAI calculations that we focus on in the main analysis.

We eliminated several lochs from the city outline of Dallas proper, as including these areas would artificially increase PAI statistics, although they cannot reasonably have geocoded crime incidents occur within them (Wheeler & Steenbeek, 2020). Figure 1 displays a map of Dallas proper, along with the 2017 Dallas PD identified TAAG areas. While these are 2017 areas, many are historically consistent, and are the same as those reported in Worrall and Wheeler (2019).

figure

Figure 1. Historical Hot Spots Used by the Dallas Police Department, Target Area Action Grid (TAAG) Areas.

For cost of crime estimates, we use the Texas specific estimates presented in Hunt et al. (2019). Table 1 lists the descriptive statistics, as well as those cost of crime estimates per UCR offense. As the testing period is shorter (1.5 years vs. 2.5 years), the total numbers of crimes are smaller. But on a per unit time scale, the overall crime counts between periods are largely comparable, so estimates based on the pre-data should be reasonable to forecast in time. This is especially the case since others have found micro place hot spots tend to be very stagnant over long periods of time (Andresen et al., 2017; Curman et al., 2015; Weisburd et al., 2004; Wheeler et al., 2016).

Table 1. Cost of Crime and Crime Totals.

Table 1. Cost of Crime and Crime Totals.

10.1177_1098611120957948-table1.gif

There are two main methodological advances presented in the paper. One is the use of the DBSCAN (Density Based SCAN) clustering technique to create hotspots of crime (Campello et al., 2013). Being a hierarchical clustering technique, it is well suited to identify irregular shaped hotspots, such as runs along a street (Brantingham & Brantingham, 1993; Grubesic, 2006).

While the contribution of illustrating DBSCAN is a minor change compared to other common hierarchical clustering techniques (Haberman, 2017), the main reason it was chosen is because it can easily incorporate weights into the clustering algorithm.1 Here we use cost of crime estimates as those weights when constructing hot spots. This is similar to past work that has used crime harm estimates to construct and evaluate hot spots (Macbeth & Ariel, 2019; Ratcliffe, 2015).

For a brief description of how the clustering algorithm works, DBSCAN has two parameters that need to be chosen by the analysts; epsilon (the radius to which two points can ultimately be connected), and the minimum number of points (or here weighted points) to establish a cluster. These parameters are chosen here as 400 feet for epsilon, and $400,000 for the total costs of crime within a hotspot. Four hundred feet was chosen as this is slightly less than the average street segment size in Dallas (Wheeler & Steenbeek, 2020). For an average length street segment in Dallas, if there were only crimes reported at either end, this technique would not link them together. We believe approximating a street segment length in epsilon is a reasonable distance parameter, given the large literature specifying crime hot spots as particular high crime street segments (MacBeth & Ariel, 2019; Weisburd et al., 2004; Weisburd, 2015). If a street segment only has crimes reported at two polar ends, with no crimes in between, we believe it is more likely they should be connected to other potential hot spots, not with each other.

Four hundred thousand dollars (in 2010 dollars) (Hunt et al., 2019) was chosen as a baseline cost of crime hot spot value for the ad-hoc reason it tended to produce a total number of hot spots similar to the total number of TAAG areas. $400,000 is the minimum weight needed for the technique to return an identified cluster, it will ultimately identify areas with even more crime cost than that baseline.

For a simplified example of how DBSCAN works, imagine you have two locations, A and B, that are 300 feet apart, and each have a total of $200,000 in crime cost. Even though each location by itself does not meet the $400,000 threshold, combined they do. The epsilon parameter is what defines the threshold as to whether two points can be combined to consider their joint weight. Points A and B then form a core cluster. Points additionally within 400 feet from either A or B will additionally be considered inside the cluster, but unless all points within a particular radius of 400 feet exceed the $400,000 in crime cost, those points will not be considered a core point for the cluster.

Thus the epsilon distance parameter and the minimum weight parameter are two details that need to be chosen by the analyst, they are not automatically chosen in an algorithmic way. This is both a strength and a weakness – there are likely no universal solutions for what those values should be to produce the ‘best’ hot spots, but that choice also allows the analyst to adjust the parameters for their own particular circumstance. So cities that are more spread out an analyst may choose a larger distance parameter, or for hot spot interventions that are more costly the analyst may make the threshold for hot spot identification larger as well. Ultimately all hot spot creation procedures involve some arbitrary decisions, e.g. choosing a bandwidth for a kernel density map, and the necessary decisions to use DBSCAN is no more onerous than other common hot spot techniques.

We use the open source R statistical package to conduct the analysis, in particular the dbscan library, with cost estimates for crime as the weights (Hahsler et al., 2019). We create DBSCAN clusters based on the training dataset, and then buffer the core points within the clusters 400 feet to generate hot spot areas. We additionally merge any two clusters that this procedure results in overlaps, as well as remove any holes (areas in which it is entirely surrounded by a cluster). We then count the testing set number of crimes and their cost that fall within those hot spots, which provides an example of how such hot spots are used prospectively by police in practice (Berk, 2008). This procedure results in a total of 61 micro place clusters.

To evaluate the clusters, we then use the predictive accuracy index (PAI) to assess the accuracy of the hot spots. This common technique to measure accuracy (Chainey et al., 2008; Levine, 2008), and was used in the recent National Institute of Justice Forecasting challenge (Flaxman et al., 2019; Lee & Eck, 2019; Mohler & Porter, 2018).

The PAI statistic is typically defined as the proportion of crimes captured in a hot spot area, divided by the proportion of the city the hot spot covers. For an example, if a particular set of identified hot spots covers 2% of the city, and the hot spots capture 10% of the total crime, that would result in a PAI of 20. Here we make a simple modification to the PAI to accommodate cost of crime estimates, for the numerator we consider the total cost of crime captured, instead of simply the proportion of crime. So in our prior example, if there is a total of $1 million dollars in crime cost in a city, and the same hot spot captures $200,000 in crime cost (20% of the total cost), using the same 2% of the area denominator, the PAI statistic would then be 40. We evaluate both the total cumulative PAI identified by our DBSCAN clusters vs. Dallas PD defined TAAG areas, as well as provide graphs and maps of individual identified hot spots.

Results

Table 2 displays the resulting citywide aggregate statistics, accumulating the entire TAAG and our identified DBSCAN hot spot areas. The total crime cost captured in the DBSCAN areas is just under $20 million dollars, which is 12% of the cost of crime in the city (over $169 million dollars). Given that the DBSCAN areas cover less than 3 square miles (1% of the city), this results in a PAI statistic of 16. Although the clusters were only fit on the cumulative proportions of each crime, overall they result in very similar PAI statistics when doing the (non-cost weighted) breakdowns for individual crime types, thus the results do not appear to be driven by a particular crime type.

Table 2. Cumulative PAI Statistics for DBSCAN Areas vs. TAAG Areas.

Table 2. Cumulative PAI Statistics for DBSCAN Areas vs. TAAG Areas.

10.1177_1098611120957948-table2.gif

While the TAAG hot spots capture a much larger proportion of crime and crime cost, they also cover a much larger area of the city. TAAG areas cover 65 square miles, 19% of the city. Overall TAAG’s capture 54% of the costs of crime in Dallas (over $92 million dollars), resulting in a PAI of 2.9. Again the statistics only vary slightly when considering non-weighted crime counts, and are typically around a PAI of 3.

Figure 2 displays PAI statistics for individual TAAG and DBSCAN hot spot areas. The majority of the TAAG areas hover under a PAI of 5, whereas the DBSCAN areas have a much broader distribution. While the left tail of the DBSCAN hot spots falls short of the PAI over 10 ideal laid out in the introduction, 50 of the 61 DBSCAN clusters have a PAI of over 10. It happens that only three of the DBSCAN areas have a lower PAI (based on the cost of crime) than the highest PAI for TAAG areas.

figure

Figure 2. PAI Statistics for Individual DBSCAN Crime Harm Weighted Hotspots vs. Current TAAG Areas.

Figure 3 displays a screenshot of an interactive map we created to explore the areas, https://apwheele.github.io/MathPosts/HotSpotMap.html. The blue DBSCAN areas are all smaller than 0.2 square miles, whereas the red TAAG areas tend to be around 1 to 2 square miles. Like other work on hot spots at micro places (Weisburd et al., 2004), the DBSCAN clusters are spread throughout the city, although have several sub-clusters in different areas of the city. The majority (but not all) of DBSCAN areas are covered by TAAGs. This suggests many of the TAAG hot spots currently in place can simply be adjusted to focus on more specific places.

figure

Figure 3. Screenshot of Interactive Map Showing Superimposed TAAG Areas (Red) With the Constructed DBSCAN Hot Spot Areas (Blue). Map available at https://apwheele.github.io/MathPosts/HotSpotMap.html.

Another way to consider the size of a hot spot is to consider the length of the streets in the area, since that will more naturally define the area in which officers will be patrolling (Drawve & Wooditch, 2019; Ratcliffe & Sorg, 2017). For reference hot spot areas in terms of street length, in the Philadelphia policing tactics experiment, the hot spot areas tended to cover around 3 miles of streets (Groff et al., 2015). For the Philadelphia foot patrol experiment, the areas covered a total of 1.3 miles of street on average (Ratcliffe et al., 2011). The DBSCAN hot spots here on average cover just under 1 mile of street. Of the 61 DBSCAN hot spots, only 11 cover more than 1.3 miles of street length, and only one covers more than 3 miles of street length. For comparison the current TAAG areas cover on average 24 miles of streets. The smallest TAAG area in terms of street length coverage is 12 miles.

Figure 4 illustrates a more zoomed in focused screen shot of the same interactive map. The map provides tooltips when clicking on a blue DBSCAN area, and this provides a breakdown of the total crimes and the cost of crime captured in a particular DBSCAN area. The particular example shown, a cluster that runs along Scyene road and expands onto Saint Augustine road illustrates the utility of a hierarchical clustering technique. Clusters that rely on identifying elliptical shaped clusters would have a difficult time approximating that shape. This particular hot spot contains a Family Dollar store, a gas station, an elementary school, and several apartment complexes, each likely an individual high risk crime generator (Eck et al., 2007), that all in toto contribute various crimes to this hotspot. One can see from the tool tip that the hot spot contains a variety of violent (26 aggravated assaults, 13 robberies), as well as non-violent crimes (15 burglaries, 58 thefts, and 26 thefts of motor-vehicles).

figure

Figure 4. Zoomed in Screenshot of Interactive Map Showing Superimposed TAAG Areas (Red) With the Constructed DBSCAN Hot Spot Areas (Blue). Blue DBSCAN areas have a textual description showing crime totals and harms totals for the individual area. Map available at https://apwheele.github.io/MathPosts/HotSpotMap.html.

In terms of the question do hot spots tend to be a single address or a street segment, one can find examples to support either. Many of the clusters are 1 to 2 addresses, although this may in part be a function of the nature of the clustering algorithm. It can return a single address as a cluster if it happens to have more than $400,000 dollars in cost in the training dataset. But if there are any addresses within 400 feet, even with a single crime, they will also be within the core cluster. One can also find examples that more closely approximate street segments, or several street segments nearby one another, which are a function of multiple addresses accumulating to reach that $400,000 threshold.

Other exploratory data analysis did not reveal any obvious patterns to the identified hot spots. In particular we examined whether specific crime types tended to dominate the identified clusters. It could be since violent crimes have larger costs, the hot spots tend to be more violent on average. Or the opposite, since property crimes are much more prevalent, the hot spots are all dominated by thefts. Neither of these appeared to be the case – the majority of hot spots tended to have a wide distribution of crimes (although one could find some counter-examples dominated by violent or property crimes). To quantify this, we have estimated the Shannon entropy statistic for each of our DBSCAN clusters (Lee & Eck, 2019).2 For this statistic, if a hot spot only contained one crime type, the entropy would be 0. Higher values signify more entropy, so it is harder to predict what type of crime may fall within that hot spot. The average entropy for each area is 1.38, with a maximum entropy given six potential crime types under examination here is 1.79. Thus areas are much closer to max entropy than they are encompassing a predictable set of crime types.

This is contra to the work by Haberman (2017), who found hot spots of different crime types did not tend to overlap. This difference is perhaps a result of clustering on the combined crimes, instead of clustering crimes individually. Although the difference in results may also be the case that Dallas has substantively different crime patterns than Philadelphia.

Discussion

The results of the analysis demonstrated what we believed at the onset – that Dallas PD could identify much smaller hot spots than the current TAAG areas in place and capture a much larger density of cost of crime per area. Like prior analysis of concentration of crime harm (Curtis-Ham & Walton, 2017; Fenimore, 2019; Norton et al., 2018; Weinborn et al., 2018), we find micro places that have high values of crime cost in clusters. Additionally, we show that such micro places have long term stability in the cost of crime, by showing that hot spots generated with historical data also capture high values of crime cost in the future (Norton et al., 2018). As such, our work is complimentary to many past analyses, and builds on a long standing literature of constructing and evaluating the predictive capability of hot spots clustering.

An additional finding of the work is that we identify a heterogenous set of areas that qualify as hot spots in our clustering algorithm. While much past work has either conducted analysis at a pre-specified unit of analysis, such as census areas (Curtis-Ham & Walton, 2017), street segments (Norton et al., 2018), or specific addresses (O’Brien & Winship, 2017), our approach identifies hot spots that do not nicely conform to any of those pre-specified units of analysis. Thus there is likely a potential to be more efficient in hot spot identification when using a solution that does not presume a specific structure for hot spots at the start. Although we note that any of these ranking approaches is likely better to the status quo of constructing hot spot areas based on personal intelligence (MacBeth & Ariel, 2019).

That being said, ultimately the nature of hot spot identification will always come with some arbitrariness, given that hot spots themselves have no well agreed upon definition (Taylor, 2015). Subsequently a limitation of this work is that we do not consider here other methods used to identify hot spots and test their accuracy. Many exist, in addition to the traditional clustering techniques mentioned in the literature review, there are a variety of model based approaches (Drawve, 2016; Mohler et al., 2018; Wheeler & Steenbeek, 2020) we have not touched on here. Thus while we cannot say that our DBSCAN analysis is dispositive that it is better than these other approaches, we believe it is likely the case one can use many of these different approaches to improve upon the current Dallas TAAGs, even simply just counting up street segments with the highest crimes is likely much more accurate (MacBeth & Ariel, 2019; Wheeler & Steenbeek, 2020).

One strong limitation of the current analysis is that the definition of the cost of crime is not well agreed upon. While here we use the work of Hunt et al. (2019) that is mostly based on costs associated with police response to particular crimes, there is a wide array of ways to calculate the cost of crime (Domínguez & Raphael, 2015). For example, a murder in this analysis is valuated at $124 thousand dollars, whereas the value of a statistical life is often pegged at over one million dollars (Domínguez & Raphael, 2015). This difference has to do with who bears the cost of the murder – for police it is mostly in terms of investigative time, whereas the cost is much more severe for the victims family (and infinite for the victim themselves). It may also be that crime prevented for individuals accumulate over a lifetime (Cohen & Piquero, 2009), with failing to prevent crime resulting in long term negative externalities for various individuals. Finally, this does not consider additional considerations on allocating police resources, such as equity or preventing disproportionate minority contact with a hot spots policing approach (Wheeler, 2020). So while these cost of crimes estimates are relevant to police departments themselves, they may not be reasonable to the greater public when constructing a hot spots policing strategy.

Additionally, this work relying on public data does not include rapes, and we also do not include Part 2 crimes, which are often considered public disorder crimes and additionally contribute to quality of life (Boggess & Maskaly, 2014; Chappell et al., 2011; Ratcliffe, 2015). This is not a limitation for a police department applying such estimates internally, where they have access to all of the reported crime data, but even then non-reporting will cause such estimated cost of crime hot spots to be an underestimate of the cost of crime at particular places. Ultimately how to appropriately value crime harm is as much an ethical question as it is an empirical. This work shows one way to do so, using labor costs directly relevant to police departments, but this is not to say other ways, such as via sentencing severity or public surveys are inherently inferior. But it is likely the case that many of these different schemes will produce similar rankings in crime (e.g. all would likely identify homicide as a higher harm than theft), so we believe using any crime harm ranking scheme is likely better than the defacto standard of treating such crimes equally when creating hot spots.

So we believe that both of these biases likely underestimate the cost of crime that is occurring the hot spots we identify here. Given that the ultimate goal of identifying such hot spots is to prevent crime, these estimates may then be gross underestimates of the potential return on investment if Dallas PD targets crime in those identified areas. But that is also conditional on a hot spots policing strategy to be effective (Mitchell, 2019) – just placing hot spots on the map and not doing anything with that information will ultimately not reduce crime at those locations.

Related to this is that in the DBSCAN algorithm itself we need to define an agreed upon minimal threshold at which to identify a location as a hot spot. Here we arbitrarily choose $400,000 dollars. Ultimately this minimum threshold would be better determined by considering the nature of the intervention police departments wish to engage in that hot spot area. So for example if the particular intervention was more cost intensive (e.g. overtime patrols), this would likely only be justified in hot spot areas with higher cost thresholds. In that case, a $400,000 hot spot over a year may not be sufficient, and the department may wish to raise this threshold. Cheaper interventions, such as nudging officers to do more patrols in high crime areas (Mohler et al., 2015), may however justify lowering the threshold to lower dollar amounts.

It may also be that a police department wishes to focus on a particular set of crimes for a hot spots policing strategy, e.g. just focus on gun violence. Given our identified hot spots here tend to have a variety of different crimes, it may still make sense to generate general cost of crime hot spots, and then filter for those areas that contain certain levels of specific crime types, as it is likely the case an intervention focused on one crime type is likely to have positive spillovers to reducing other crime types.

While this work focuses on long term crime forecasting, the application of using DBSCAN can potentially be extended to short term crime forecasting (Flaxman et al., 2019; Garnier et al., 2018; Lee et al., 2020; Mohler et al. 2015; Ratcliffe et al., 2020). One approach that we believe may be fruitful is to use DBSCAN on the output of a predictive policing application applied at the address and intersection level to create homogenous areas to assign subsequent patrols (Deryol et al., 2016). Although raster based approaches to predictive policing are popular (Caplan et al., 2011), these raster grids do not conform to the actual micro places and the street network that heavily influence crime (Groff, 2014; Rosser et al., 2017). While prior work has used the creation of the raster grid orientation and size of the grid cells as a hyperparameter when tuning machine learning models (Flaxman et al., 2019; Mohler & Porter, 2018), one could similarly use the DBSCAN parameters of epsilon and minimum weight in the same capacity to avoid relying on grid cells entirely. Such results are likely to result in more natural, contiguous hot spot areas to target. Such contiguous areas would not only require fewer resources to target (relative to separate hot spots not in the same area), but also would likely result in better police adherence when targeting those hot spots boundaries (Sorg et al., 2017).

Given that errors associated with geocoding tend to be around 100 feet on average for crimes occurring outdoors (Wheeler et al., 2020), it likely does not make sense to cluster crimes within a smaller distance than that shown here, but it may be the case that one would identify more agglomerated areas if the epsilon clustering criteria was slightly larger, as one can see sub-clusters of the individual clustered hot spot areas in the resulting map in this analysis. It is likely the case that idiosyncratic characteristics of different jurisdictions will likely change what distance parameter captures the most weighted crime cost, so future research may be warranted about how to best tune this value given a similar test and validation data set.

Policy Implications

The policy implications of constructing hot spots based on cost-of-crime estimates, as opposed to either direct counts of crime or weights based on sentencing severity are rather straightforward. They provide a much more direct cost-benefit analysis of the potential return on investment directly relevant to police departments.

For a direct estimate, let us take an example from the hot spot that captured the largest cost of crime, a hot spot in the center of Dallas that captured around 1.5 million dollars of crime within 1.5 years. The average crime reduction in the Braga et al. (2019)meta-analysis is around 20%. As such, the expected return to partaking in a hot spots policing strategy at this one location is around $200,000 per year. Such a return on investment would justify assigning additional resources to target this hot spot. For locations with smaller overall costs of crime, while it likely does not justify allocating additional resources, it certainly may justify reallocating existing resources to target crime in those particular micro places.

While this analysis is illustrative of how one might create cost-of-crime weighted hot spots, ultimately the optimal strategy to construct hot spots needs to entail what actions the police department is going to take within those hot spots (as well as functional constraints in the resources the police department can allocate). Historically Dallas PD has only generically stated TAAG area hot spots were formulated to predict areas of high crime (Ferguson, 2017). They have not articulated specific strategies they would undertake at said hot spots.

Take for example if Dallas PD knew they wanted to create foot or bike patrols for identified hot spots, and budgeted enough to assign 4 different officers on some regular basis. It is likely the case that only four officers cannot cover all 61 micro place hot spots identified in this analysis, but it would be easily possible for those officers to cover at least one of the micro place hot spots identified here, which are all under 0.2 square miles (Haberman & Stiver, 2019; Ratcliffe & Sorg, 2017). Given that such areas have a specific cost attached to them, it also provides more straightforward cost-benefit analysis if the Dallas PD wishes to advocate for more resources to be able to cover such hot spots, or to justify reallocating existing resources to cover such hot spots.

While those same officers could cover the existing TAAG areas, expanding the size of such hot spots will result in a more diffused intervention by the police, no matter what the intervention is (Larson, 1975). As such it is likely in Dallas PD’s best interest to take a more vested approach to identify micro places, even if they do not have a specific plan for what to do in the current hot spots at the current time. It may be a police department wishes to generate such hot spots first, and then take a problem oriented approach to solving crimes in those areas (Braga et al., 1999). Given that targeted police interventions at large areas tend to be less successful than micro placed focused interventions (Lum et al., 2011), weighted cost-of-crime DBSCAN identified hot spots are a tool crime analysts for any police department should consider in formulating a hot spots policing strategy.

The feasibility of such an approach should be within the capabilities of Dallas PD to implement. This analysis comes with a set of replication materials using open source data and open source code, thus Dallas PD (or any other interested police department) can replicate the analysis using more current data for free if they so wish. Given the consistency of other papers identifying hot spots of crime, it is likely the case such work can generate areas of hot spots which show promise of generating significant returns on investment for many police departments. Hot spots weighted by cost of crime estimates can then be used as an upfront tool to justify either new investments in police departments to allocate resources to hot spots or to shift current resources to hot spots. This provides a direct cost-benefit calculation to justify to police departments to shift resources from reactive to proactive policies (Nagin et al., 2015).

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

Notes

1
This is not exclusive to DBSCAN, for example one could weight STAC clusters similarly (Block & Block, 1995). However, we do believe hierarchical clustering techniques are the most useful for irregularly shaped crime clusters.

2
The entropy statistic can be calculated as 𝐸=−∑𝑐𝑝𝑐log(𝑝𝑐)E=−∑cpclog(pc), where 𝑝𝑐pc is the proportion of crime c out of all crimes in that hot spot area. So for an example, a hot spot location of 5 robberies, 3 murders, and 2 motor vehicle thefts would have an entropy statistic of 𝐸=−[510𝑙𝑜𝑔(510)+310𝑙𝑜𝑔(310)+210𝑙𝑜𝑔(210)] ≈1.02E=−[510log⁡510+310log⁡310+210log⁡210]≈1.02. Crimes with zero values do not enter into the calculation.

References

Andresen, M. A., Curman, A. S., Linning, S. J. (2017). The trajectories of crime at places: Understanding the patterns of disaggregated crime types. Journal of Quantitative Criminology, 33(3), 427–449.
Google Scholar | Crossref
Bayley, D. H. (1994). Police for the future. Studies in Crime and Public Policy.
Google Scholar
Berk, R. (2008). Forecasting methods in crime and justice. Annual Review of Law and Social Science, 4(1), 219–238.
Google Scholar | Crossref
Block, R. L., Block, C. R. (1995). Space, place and crime: Hot spot areas and hot places of liquor-related crime. Crime Prevention Studies, 4, 145–184.
Google Scholar
Boggess, L., Maskaly, J. (2014). The spatial context of the disorder-crime relationship in a study of Reno neighborhoods. Social Science Research, 43, 168–183.
Google Scholar | Crossref | Medline
Braga, A. A., Bond, B. J. (2008). Policing crime and disorder hot spots: A randomized controlled trial. Criminology, 46(3), 577–607.
Google Scholar | Crossref | ISI
Braga, A. A., Papachristos, A. V., Hureau, D. M. (2010). The concentration and stability of gun violence at micro places in Boston, 1980–2008. Journal of Quantitative Criminology, 26(1), 33–53.
Google Scholar | Crossref | ISI
Braga, A. A., Turchan, B. S., Papachristos, A. V., Hureau, D. M. (2019). Hot spots policing and crime reduction: An update of an ongoing systematic review and Meta-analysis. Journal of Experimental Criminology, 15(3), 289–311.
Google Scholar | Crossref
Braga, A. A., Weisburd, D. L., Waring, E. L., Mazerolle, L. G., Spelman, W., Gajewski, F. (1999). Problem-oriented policing in violent crime places: A randomized controlled experiment. Criminology, 37(3), 541–580.
Google Scholar | Crossref | ISI
Brantingham, P. L., Brantingham, P. J. (1993). Nodes, paths and edges: Considerations on the complexity of crime and the physical environment. Journal of Environmental Psychology, 13(1), 3–28.
Google Scholar | Crossref | ISI
Campello, R. J. G. B., Moulavi, D., Sander, J. (2013). Density-based clustering based on hierarchical density estimates. Proceedings of the 17th Pacific-Asia Conference on Knowledge Discovery in Databases, PAKDD 2013, Lecture Notes in Computer Science 7819: 160–172.
Google Scholar
Caplan, J. M., Kennedy, L. W., Miller, J. (2011). Risk terrain modelling: Brokering criminological theory and GIS methods for crime forecasting. Justice Quarterly, 28(2), 360–381.
Google Scholar | Crossref | ISI
Chainey, S., Tompson, L., Uhlig, S. (2008). The utility of hotspot mapping for predicting spatial patterns of crime. Security Journal, 21(1–2), 4–28.
Google Scholar | Crossref
Chappell, A., Monk-Turner, E., Payne, B. (2011). Broken windows or window breakers: The influence of physical and social disorder on quality of life. Justice Quarterly, 28(3), 522–540.
Google Scholar | Crossref
Cohen, M. A., Piquero, A. R. (2009). New evidence on the monetary value of saving a high risk youth. Journal of Quantitative Criminology, 25(1), 25–49.
Google Scholar | Crossref
Curman, A. S., Andresen, M. A., Brantingham, P. J. (2015). Crime and place: A longitudinal examination of street segment patterns in Vancouver, BC. Journal of Quantitative Criminology, 31(1), 127–147.
Google Scholar | Crossref
Curtis-Ham, S., Walton, D. (2017). Mapping crime harm and priority locations in New Zealand: A comparison of spatial analysis methods. Applied Geography, 86, 245–254.
Google Scholar | Crossref
Curtis-Ham, S., Walton, D. (2018). The New Zealand crime harm index: Quantifying harm using sentencing data. Policing: A Journal of Policy and Practice, 12(4), 455–467.
Google Scholar | Crossref
Deryol, R., Wilcox, P., Logan, M., Wooldredge, J. (2016). Crime places in context: An illustration of the multilevel nature of hotspot development. Journal of Quantitative Criminology, 32(2), 305–325.
Google Scholar | Crossref
Domínguez, P., Raphael, S. (2015). The role of the cost-of-crime literature in bridging the gap between social science research and policy making: Potentials and limitations. Criminology & Public Policy, 14(4), 589–632.
Google Scholar | Crossref
Drawve, G. (2016). A metric comparison of predictive hot spot techniques and RTM. Justice Quarterly, 33(3), 369–397.
Google Scholar | Crossref | ISI
Drawve, G., Wooditch, A. (2019). A research note on the methodological and theoretical considerations for assessing crime forecasting accuracy with the predictive accuracy index. Journal of Criminal Justice Online Justice, 64, 101625.
Google Scholar | Crossref
Eck, J. E., Chainey, S. P., Cameron, J. G., Leitner, M., Wilson, R. E. (2005). Mapping crime: Understanding hot spots. National Institute of Justice.
Google Scholar
Eck, J. E., Clarke, R. V., Guerette, R. T. (2007). Risky facilities: Crime concentration in homogenous sets of establishments and facilities. Crime Prevention Studies, 21, 225–264.
Google Scholar
Fenimore, D. M. (2019). Mapping harmspots: An exploration of the spatial distribution of crime harm. Applied Geography, 109, 102034.
Google Scholar | Crossref
Ferguson, A. G. (2017). The rise of big data policing: Surveillance, race, and the future of law enforcement. New York University Press.
Google Scholar | Crossref
Flaxman, S., Chirico, M., Pereira, P., Loeffler, C. (2019). Scalable high-resolution forecasting of sparse spatiotemporal events with kernel methods: A winning solution to the NIJ “real-time crime forecasting challenge. The Annals of Applied Statistics, 13(4), 2564–2585.
Google Scholar | Crossref
Garnier, S., Caplan, J. M., Kennedy, L. W. (2018). Predicting dynamical crime distribution from environmental and social influences. Frontiers in Applied Mathematics and Statistics, 4, 13.
Google Scholar | Crossref
Groff, E. R. (2014). Quantifying the exposure of street segments to drinking places nearby. Journal of Quantitative Criminology, 30(3), 527–548.
Google Scholar | Crossref
Groff, E. R., Ratcliffe, J. H., Haberman, C. P., Sorg, E. T., Joyce, N. M., Taylor, R. B. (2015). Does what police do at hot spots matter? The Philadelphia policing tactics experiment. Criminology, 53(1), 23–53.
Google Scholar | Crossref | ISI
Grubesic, T. H. (2006). On the application of fuzzy clustering for crime hot spot detection. Journal of Quantitative Criminology, 22(1), 77–105.
Google Scholar | Crossref
Haberman, C. P. (2017). Overlapping hot spots? Examination of the spatial heterogeneity of hot spots of different crime types. Criminology & Public Policy, 16(2), 633–660.
Google Scholar | Crossref
Haberman, C. P., Stiver, W. H. (2019). The Dayton foot patrol program: An evaluation of hot spots foot patrols in a Central business district. Police Quarterly, 22(3), 247–277.
Google Scholar | SAGE Journals | ISI
Hahsler, M., Piekenbrock, M., Doran, D. (2019). DBSCAN: Fast density-based clustering with R. Journal of Statistical Software, 91(1), 1–30.
Google Scholar | Crossref
Hinkle, J. C., Weisburd, D., Telep, C. W., Petersen, K. (2020). Problem-oriented policing reducing crime and disorder: An updated systematic review and meta-analysis. Campbell Systematic Reviews , 16(2)
Google Scholar | Crossref
House, P. D., Neyroud, P. W. (2018). Developing a crime harm index for Western Australia: The WACHI. Cambridge Journal of Evidence-Based Policing, 2(1–2), 70–94.
Google Scholar | Crossref
Hunt, P. E., Anderson, J., Saunders, J. (2017). The price of justice: New national and state-level estimates of the judicial and legal costs of crime to taxpayers. American Journal of Criminal Justice, 42(2), 231–254.
Google Scholar | Crossref
Hunt, P. E., Saunders, J., Kilmer, B. (2019). Estimates of law enforcement costs by crime type for benefit-cost analyses. Journal of Benefit-Cost Analysis, 10(1), 95–123.
Google Scholar | Crossref
Ignatans, D., Pease, K. (2016). Taking crime seriously: Playing the weighting game. Policing, 10(3), 184–193.
Google Scholar | Crossref
Kounadi, O., Ristea, A., Araujo, A., Leitner, M. (2020). A systematic review on spatial crime forecasting. Crime Science, 9(1), 7.
Google Scholar | Crossref | Medline
Larson, R. C. (1975). What happened to patrol operations in Kansas city? A review of the Kansas city preventive patrol experiment. Journal of Criminal Justice, 3(4), 267–297.
Google Scholar | Crossref | ISI
Lee, Y., Eck, J. E. (2019). Comparing measures of the concentration of crime at places. Crime Prevention and Community Safety, 21(4), 269–294.
Google Scholar | Crossref
Lee, YJei., SooHyun, O., Eck, J. E. (2020). A theory driven algorithms for real-time crime hot spot forecasting. Police Quarterly Online Quarterly , 23(2), 174–201.
Google Scholar | SAGE Journals | ISI
Levine, N. (2008). The “hottest” part of a hotspot: Commentary on “the utility of hotspot mapping for predicting spatial patterns of crime. Security Journal, 21(4), 295–302.
Google Scholar | Crossref
Lum, C., Koper, C. S., Telep, C. W. (2011). The evidence-based policing matrix. Journal of Experimental Criminology, 7(1), 3–26.
Google Scholar | Crossref | ISI
Macbeth, E., Ariel, B. (2019). Place-based statistical versus clinical predictions of crime hot spots and harm locations in Northern Ireland. Justice Quarterly, 36(1), 93–126.
Google Scholar | Crossref
Mitchell, R. J. (2019). The usefulness of a crime harm index: Analyzing the Sacramento hot spot experiment using the California crime harm index (CA-CHI). Journal of Experimental Criminology, 15(1), 103–113.
Google Scholar | Crossref
Mohler, G. O., Porter, M. D. (2018). Rotational grid, PAI-maximizing crime forecasts. Statistical Analysis and Data Mining: The ASA Data Science Journal , 11(5), 227–236.
Google Scholar | Crossref
Mohler, G. O., Carter, J., Raje, R. (2018). Improving social harm indices with a modulated Hawkes process. International Journal of Forecasting, 34(3), 431–439.
Google Scholar | Crossref
Mohler, G. O., Short, M. B., Malinowski, S., Johnson, M., Tita, G. E., Bertozzi, A. L., Brantingham, P. J. (2015). Randomized controlled field trials of predictive policing. Journal of the American Statistical Association, 110(512), 1399–1411.
Google Scholar | Crossref
Nagin, D. S., Solow, R. M., Lum, C. (2015). Deterrence, criminal opportunities, and police. Criminology, 53(1), 74–100.
Google Scholar | Crossref | ISI
Norton, S., Ariel, B., Weinborn, C., O’Dwyer, E. (2018). Spatiotemporal patterns and distributions of harm within street segments. Policing: An International Journal, 41(3), 352–371.
Google Scholar | Crossref
O’Brien, D. T., Winship, C. (2017). The gains of greater granularity: The presence and persistence of problem properties in urban neighborhoods. Journal of Quantitative Criminology, 33(3), 649–674.
Google Scholar | Crossref
Ratcliffe, J. H. (2015). Towards an index for harm-focused policing. Policing, 9(2), 164–182.
Google Scholar | Crossref
Ratcliffe, J. H., Sorg, E. T. (2017). Foot patrol: Rethinking the cornerstone of policing. Springer.
Google Scholar | Crossref
Ratcliffe, J. H., Taniguchi, T., Groff, E. R., Wood, J. D. (2011). The Philadelphia foot patrol experiment: A randomized controlled trial of police patrol effectiveness in violent crime hotspots. Criminology, 49(3), 795–831.
Google Scholar | Crossref | ISI
Ratcliffe, J. H., Taylor, R. B., Perenzin-Askey, A., Thomas, K., Grasso, J., Bethel, K., Fisher, R., Koehnlein, J. (2020). The Philadelphia predictive policing experiment. Journal of Experimental Criminology. Advance online publication. https://doi.org/10.1007/s11292-019-09400-2
Google Scholar | Crossref
Rosser, G., Davies, T., Bowers, K. J., Johnson, S. D., Cheng, T. (2017). Predictive crime mapping: Arbitrary grids or street networks? Journal of Quantitative Criminology, 33(3), 569–594.
Google Scholar | Crossref | Medline
Sherman, L. W , and Cambridge University associates (2020). How to count crime: The Cambridge Harm Index consensus. Cambridge Journal of Evidence-Based Policing, 4(1-2), 1–14.,
Google Scholar | Crossref
Sherman, L. W., Gartin, P. R., Buerger, M. E. (1989). Hot spots of predatory crime: Routine activities and the criminology of place. Criminology, 27(1), 27–56.
Google Scholar | Crossref | ISI
Sherman, L. W., Neyroud, P. W., Neyroud, E. (2016). The Cambridge Crime Harm Index: Measuring total harm from crime based on sentencing guidelines. Policing, 10(3), 171–183.
Google Scholar | Crossref
Sorg, E. T., Wood, J. D., Groff, E. R., Ratcliffe, J. H. (2017). Explaining dosage diffusion during hot spot patrols: An application of optimal foraging theory to police officer behavior. Justice Quarterly, 34(6), 1044–1068.
Google Scholar | Crossref
Taylor, R. B. (2015). Community criminology: Fundamentals of spatial and temporal scaling, ecological indicators, and selectivity bias. NYU Press.
Google Scholar | Crossref
Van Patten, I., McKeldin-Conor, J., Cox, D. (2009). A microspatial analysis of robbery: Prospective hot spotting in a small city. Crime Mapping: A Journal of Research and Practice, 1, 7–32.
Google Scholar
Weinborn, C., Ariel, B., Sherman, L. W., O' Dwyer, E. (2017). Hotspots vs. harmspots: Shifting the focus from counts to harm in the criminology of place. Applied Geography, 86, 226–244.
Google Scholar | Crossref
Weisburd, D. (2015). The law of crime concentration and the criminology of place. Criminology, 53(2), 133–157.
Google Scholar | Crossref | ISI
Weisburd, D. L., Bushway, S. D., Lum, C., Yang, S. M. (2004). Trajectories of crime at places: A longitudinal study of street segments in the city of Seattle. Criminology, 42(2), 283–322.
Google Scholar | Crossref | ISI
Wheeler, A. P. (2020). Allocating police resources while limiting racial inequality. Justice Quarterly Online Quarterly, 37(5), 842–868.
Google Scholar | Crossref
Wheeler, A. P., Steenbeek, W. (2020). Mapping the risk terrain for crime using machine learning. Journal of Quantitative Criminology Online First.
Google Scholar | Crossref
Wheeler, A. P., Worden, R. E., McLean, S. J. (2016). Replicating group-based trajectory models of crime at micro places in Albany, NY. Journal of Quantitative Criminology, 32(4), 589–612.
Google Scholar | Crossref
Wheeler, A., Gerell, M., Yoo, Y. (2020). Testing the spatial accuracy of address based geocoding for gunshot locations. The Professional Geographer, 72, 398–410.
Google Scholar | Crossref
Wolfgang, M. (1985). The national survey of crime severity. Washington, DC: U.S. Department of Justice, Bureau of Justice Statistics.
Google Scholar
Worrall, J. L., Wheeler, A. P. (2019). Evaluating community prosecution code enforcement in Dallas, Texas. Justice Quarterly, 36(5), 870–899.
Google Scholar | Crossref

Author Biographies

Andrew Wheeler is currently a Data Scientist at HMS working on problems related to predictive modeling and optimization in relation to insurance claims. Before joining HMS, he received a PhD degree in Criminal Justice from SUNY Albany. While in academia his research focused on collaborating with police departments for various problems including; evaluating crime reduction initiatives, place based and person based predictive modeling, data analytics for crime analysis, and developing models for the efficient and fair delivery of police resources.

Sydney Reuter received her master's degree at the University of Texas at Dallas in criminology. Her research interests are in crime mapping and is interested in pursuing a career in law enforcement.


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK