- Research
- Open access
- Published:
Neighborhood predictors of suicide and firearm suicide in Detroit, Michigan
Injury Epidemiology volume 11, Article number: 53 (2024)
Abstract
Background
Suicide is a leading cause of death in the United States with rates increasing over the past two decades. The rate of suicide is higher in rural areas, but a greater number of people in urban areas die by suicide; understanding risk factors for suicide in this context is critically important to public health. Additionally, while many studies have focused on individual-level risk factors, few studies have identified social or structural features associated with suicide or firearm suicide, especially among young people.
Methods
Study outcomes included total firearm suicide, total youth (age 10–29) firearm suicide, total suicide, and total youth suicide in Detroit, Michigan from 2012 through 2019. The predictors in this study included 58 census-tract level variables characterizing the physical features, residential stability, socioeconomic status, and demographics of neighborhoods in Detroit over the study period. We used random forest, extreme gradient boosting (XGBoost), and generalized linear mixed models to predict the four outcomes.
Results
We found that the tract-level variables used in all three modeling approaches performed poorly at predicting the suicide outcomes, with area under the curve values at times exceeding 0.60 but with extremely low sensitivity (ranging from 0.05 to 0.45). However, the percentage of parcels sold in arms-length transfers in the previous 5 years, the count of vacant lots per square mile, and the percentage of children aged three and older who were enrolled in preschool each demonstrated associations with at least two of the outcomes studied.
Conclusions
Our findings suggest place-based factors at the tract level do not provide meaningful insight into the risk of suicide or firearm suicide among youth or the general population in Detroit, Michigan. Future practice and study should consider focusing on both larger and smaller areas, including city and individual-level factors. For example, studies might benefit from the use of both neighborhood and individual-level measures and their interactions to improve our understanding of place-based risk factors and suicide risk.
Background
Suicide is a leading cause of death in the United States (Centers for Disease Control and Prevention 2022). While the highest rates of suicide are among males over the age of 85 (57.8 per 100,000), the years of potential life lost due to suicide are the highest among males in their 20s. For young people aged 10–29, suicide is the second leading cause of death, preceded only by unintentional injury (Centers for Disease Control and Prevention 2022). For youth and young adults racialized as Black, rates of suicide have increased dramatically (by 36.6%) since 2018 (Stone 2023). These alarming trends amplify the need for further research. Additionally, while rates of suicide are highest in rural areas, far more suicide deaths occur in urban areas due to the concentration of people in cities and metro areas (Centers for Disease Control and Prevention 2022). Understanding risk factors for youth and young adult suicide in urban populations is of critical importance.
Firearms are the most common method of suicide (Centers for Disease Control and Prevention 2022), at least in part due to their lethality; it is estimated that 90% of suicide attempts with a firearm result in death (Conner et al. 2019a). Firearm availability is also associated with increased firearm suicide risk in United States cities (Miller et al. 2013). Additionally, firearm suicide deaths are more likely to occur in areas with high rates of firearm availability (Kaplan et al. 2009).
Research on suicide and firearm suicide tends to focus on individual-level risk factors such as mental illness and mood disorders, substance use disorders, medical conditions, firearm purchases and means availability, and relationship challenges (Boggs et al. 2018; Schleimer et al. 2023, 2021; Conner et al. 2019b; Bilsen 2018). There is far less research on the social or structural factors that influence suicide risk specifically, with a few notable exceptions. Economic contraction and financial strain, which are associated with longer periods of unemployment duration and housing loss, are associated with increased suicide risk (Fowler et al. 2015; Classen and Dunn 2012). These economic factors likely exacerbate the influence of individual level risk factors such as risky alcohol consumption (Kaplan et al. 2015). A systematic review of urban contextual factors and suicide risk concluded urban areas with higher levels of socioeconomic deprivation and social fragmentation were at elevated risk for self-harm and suicidality (Satherley et al. 2022). As one additional example, the availability and quality of the mental health care service network is also critically important for suicide risk (Lang 2013; Stone et al. 2017).
A larger literature has focused on how the built environment and social and economic context might shape the mental health experience of individuals, which is in turn related to suicide risk. The material infrastructure of neighborhoods, the social connectedness of their residents, and economic opportunity can influence psycho-social stress and mental health (Clark et al. 2007; Latkin and Curry 2003). Additionally, neighborhoods with more “problems” such as violence, heavy traffic, and lack of parks or playgrounds, tend to have higher rates of depression, even after controlling for individual-level factors (Echeverría et al. 2008). In a county-level study in the United States, ecological measures of the food environment and access to exercise were associated with fewer average poor mental health days while violent crime and income inequality were associated with more average poor mental health days (Olson-Williams et al. 2023).
Social norms, economic stability, and physical features of the environment are associated with mental health and well-being (Mukherjee et al. 2021). Yet little research has focused on the role of these contextual factors in shaping suicide risk in urban areas. Additionally, the ways in which these factors influence suicide risk may differ across localities and populations. The current study takes place in Detroit, Michigan, a city with a majority Black population (greater than 75%) and a specific set of place-based, social, and economic features (United States Census Bureau: Dicennial Census 2010, 2020). The median household income in Detroit in 2019 was slightly above $30,000, and approximately one-third of the population in Detroit live below the poverty level (United States Census Bureau 2019)—figures that are far beyond the national average. Crime rates in Detroit are also among the highest in the nation, with approximately 1965 violent crimes and 4303 property crimes per 100,000 residents (US Department of Justice, Federal Bureau of Investigation 2019).
Given the alarming rise in the of rate of suicide among Black youth and young adults, and the need to better understand mental health and suicide risk in urban contexts (Núñez-González et al. 2020), this study aimed to identify features of the social and structural environment that are predictive of suicide risk and may point to interventions with potential for wide-ranging benefits. We hypothesized that higher levels of neighborhood instability, indicating lower levels of social cohesion, and greater socioeconomic deprivation would be most predictive of suicide risk.
Data
Our study population included all neighborhoods in Detroit, Michigan over the study period 2011–2019, with outcomes measured 2012-2019 and predictive features measured 2011–2018. The primary unit of analysis was census tracts (N = 297 based on 2010 Census tracts). We used census tracts as our primary unit with the assumption that the people, physical environment, and economic conditions of census tracts are salient contextual features of the neighborhood. However, the true boundaries of neighborhoods recognizable to the people who reside in them are difficult to ascertain (Coulton et al. 2013).
The outcomes of interest included the count of firearm suicides (International Classification of Diseases (ICD) codes X73-X75), youth firearm suicides (age 10–29), all suicides (ICD codes X60-X84), and all youth suicides (age 10–29), obtained from the Michigan Department of Health and Human Services (World Health Organization 2004; Michigan Department of Health and Human Services 2019). We use the term “youth” to include both children and young adults; youth suicide is a clear concern, and, as previously noted, rates of suicide are particularly high among 20–29 year old young adults. These data were geolocated by decedent place of residence and aggregated to census tracts. Because suicide is a rare event in statistical terms, we converted the counts to binary outcomes indicating the presence or absence of each outcome in each tract-year. All covariates were lagged one year with respect to the outcome. For example, firearm suicides occurring in 2012 were predicted using covariates that described the state of census tracts in 2011.
The fifty-eight predictor variables (including year) can be grouped into four main categories: physical features, residential stability, socioeconomic features, and demographic features. We also included the latitude and longitude of each census tract’s centroid to capture some element of the spatial relationship between tracts (Li 2022). Physical features include variables such as road density and walkability; the count of schools, alcohol outlets, and religious buildings; and percent of land that is protected as well as counts of single family homes per square mile, commercial buildings per square mile, historical redlining grade, and the count of buildings with quality grade D or F (Data Driven Detroit 2009; Dynamo Metrics 2020; United States Environmental Protection Agency 2018; National Center for Education Statistics; Centers for Medicare and Medicaid Services; United States Department of Homeland Security, Geospatial Management Office; Esposito et al. 2020; Finlay et al. 2020; United States Environmental Protection Agency 2020; Nelson et al. 2020). The counts of hospitals and federally qualified health centers are included as physical features in an attempt to represent the availability of mental health care. Measures of residential stability include the percentage of housing units that are renter-occupied, recent movers (scaled by GeoLytics from 0 to 1000, where 100 is the national average), the percentage of parcels with arms-length transfers in the previous 5 years, and the maximum number of arms-length transfers for a parcel in the tract (Geolytics Inc. 2019; Dynamo Metrics 2020). An arms-length transfer is a real estate transaction between two independent parties who do not have an existing relationship. Here, arms-length transfers serve as a proxy for a home being in a cycle of high turnover and thus indicative of lower social cohesion. Socioeconomic features include the percentage of the census tract population that was unemployed, the percentage of children enrolled in private schools, and median household income. The percentage of tract residents that were naturalized citizens and racial and ethnic categories are among the demographic features included (Geolytics Inc). We also included two measures of economic and racial segregation calculated based on the Index of Concentrations at the Extremes (ICE), which previous studies have found to be associated with fatal and non-fatal assaults (Krieger et al. 2017). To measure residential economic segregation, the metric is calculated as \(ICE_i=\frac{(A_i-P_i)}{T_i}\), where \(A_i\) is the number of high-income earners, \(P_i\) is the number of low-income earners, and \(T_i\) is the total count of people with income data in tract i. The 20th and 80th U.S. household income percentiles were used as the high- and low-income thresholds (Krieger et al. 2017), which were $25,000 and $100,000 in 2010 (United States Census Bureau 2020). To use ICE as a measure of residential racial segregation, the same formula is used but with \(A_i\) as the number of non-Hispanic White residents, \(P_i\) as the number of non-Hispanic Black residents, and \(T_i\) as the total count of people with race and ethnicity data in the tract i. A complete list of variables can be found in Appendix B.
There were missing values present in three variables. Alcohol outlet data was available for 2010 through 2015 and 2017 only. We used modified interpolation for 2016 and carried forward 2017 values for 2018 and 2019. Missing values occurred in two other variables: the count of poor quality buildings per square mile (15.03% missing) and historical redlining grade (7.74% missing). Because some of our methods did not allow for missing values, we imputed them using random forest proximity (Stekhoven and Buehlmann 2012; Stekhoven 2022).
Prior to modeling, we identified and removed highly correlated variables (defined as an absolute value of correlation greater than 0.6) within each of the four categories of variables described above (physical features, residential stability, socioeconomic features, and demographic features). Among the demographic variables, ICE-race was highly correlated with the racial and ethnic categories (percentage Black, Hispanic, White, and other), the percentage of residents that were naturalized citizens, and the percentage of residents that were not citizens. We removed the variables with which ICE-race was highly correlated and retained ICE-race, population, and the percentage of the population that was 15–29 year old males. In the residential stability category, recent movers and the percentage of residents that were renters were highly correlated; we excluded recent movers. In the socioeconomic features category, ICE-income was highly correlated with median household income and the percentage of the population below 150% of the poverty line, and the percentage of parcels that were tax foreclosed was highly correlated with the percentage of parcels that were tax delinquent; we removed the median household income, percentage below poverty, and percentage tax foreclosed measures. Of the physical feature variables only the acres of land protected from development and the percentage of land protected from development were highly correlated; we excluded the amount in acres of protected land. In total, we removed 11 variables in this process, leaving 47 covariates for use in all models.
For XGBoost, all predictor variables were normalized to generate 0–1 data, and for the regression models, all predictor variables were standardized to facilitate interpretation. We attempted the use of non-standardized variables, but these models did not converge.
Methods
We began by exploring the data descriptively. We used measures of central tendency such as means and percentages to compare tract-years with and without firearm suicides and youth firearm suicides, excluding missing values. To assess significant differences, we estimated a series of generalized linear mixed effects logistic regression models (GLMMs)–modeling each covariate on its own, controlling for population and a random intercept for census tract and using a 0.05 significance level. We repeated this process for each of the outcomes.
For each year in the study period, we also calculated a Moran’s I statistic for each outcome to assess the presence of spatial autocorrelation.
We used machine learning algorithms and mixed effects logistic regression to develop prediction algorithms. Predictive accuracy was assessed for all approaches through performance metrics, including area under the ROC curve (AUC), sensitivity, balanced accuracy, and area under the precision-recall curve (AUPRC), a metric often used for imbalanced data. For the machine learning algorithms, we partitioned our data into training and test sets, such that 80% of census tracts were included in the training set and the remaining 20% were used in the test set.
We implemented random forest, which performs well in a variety of settings, including rare event detection (Muchlinski et al. 2016). Briefly, random forest is an ensemble of decision trees, each built from a random bootstrap sample of data. Each tree uses a recursive process of splits, built with a random sample of mtry variables at each node, with the best one picked to partition the data into two at each node (where “best” means it splits the data into the two most homogeneous groups in the classification setting). This process is repeated until node purity or a minimum node size is reached. Each tree in the forest classifies each observation, and the random forest aggregates across the tree votes (Breiman 2001; Hastie et al. 2009). Individual trees are prone to overfitting, producing low bias but high variance estimates. By aggregating across all trees, the random forest algorithm reduces this variance (Breiman 2001; Hastie et al. 2009). Because each tree consists of only a sample of the data, it uses an out-of-bag error estimate that is equivalent to N-fold cross validation error as the size of the forest increases, making it unnecessary to also implement cross validation when using random forest (Hastie et al. 2009). Additionally, random forest allows for undersampling as a way to address imbalanced data. When creating the bootstrapped sample to build each tree, it can select a stratified sample. This ensures each tree is built using balanced data, and its performance can still be tested with imbalanced data. We employed a stratified random forest on the training data, using the case count in the training set as the strata size, tuning the mtry parameter to maximize area under the ROC curve (AUC).
We also employed extreme gradient boosting, or XGBoost, another ensemble machine learning algorithm. Boosting is a method that recursively combines models together, with each model predicting the residuals of the prior one. It minimizes a regularized objective that consists of a loss function and a complexity penalty using gradient descent. This process continues until some minimum threshold in prediction improvement is met (Chen and Guestrin 2016). We assigned weights to the minority class that were equal to the ratio of the majority class size to the minority class size and kept weights for the majority class at 1. We used 5000 rounds of five-fold cross validation with XGBoost and tuned the hyperparameters using a random search with 50 iterations, with value ranges shown in the first 6 rows of Table 3. With those parameter values, we then tuned the number of rounds as shown in the last row of Table 3. For all tuning purposes, the evaluation metric was AUC to account for the imbalanced data set. We assessed performance on the test set using the metrics described above. All tuning and building was done using R packages xgboost and mlr (Chen et al. 2024; Bischl et al. 2016; Lang et al. 2014; Bischl et al. 2016, 2017; Probst et al. 2017; Casalicchio et al. 2017).
Finally, we estimated a series of generalized linear mixed effects logistic regression models with a random intercept for census tract to identify risk factors for each outcome. The variables in these models are standardized with mean of 0 and standard deviation of 1 to facilitate interpretation. We first used all of the de-correlated variables as independent predictors (denoted Model 1). We then ran a model including only the significant predictors (\(p<0.10\)) from the previous model (denoted Model 2). We repeated this process using population as an offset instead of a covariate, first with all de-correlated variables (denoted Model 3) and then with only significant covariates from Model 3 (denoted Model 4). For the models with an offset, we dropped three census tracts with zero population from the analysis. These models were built using R package glmmTMB (Brooks et al. 2017). To distinguish between outcomes in this paper, an abbreviation is attached to each model number: “FS” for firearm suicide, “YFS” for youth firearm suicide, “S” for all suicide, and “YS” for all youth suicide.
Results
In the next section, we show results of our exploratory analysis and then separately show machine learning and model results for each of the four outcomes. We present detailed results corresponding to the analysis for the firearm suicide outcome in the text below, while tables for the additional outcomes can be found in the appendices. Information on tuning parameters for random forest and XGBoost can also be found in the Appendix (Tables 3 and 4).
Exploratory results
Summaries of all covariates from all census tracts for the duration of our study period can be found in Table 5. Among all tracts, there were 203 tract-years with at least 1 firearm suicide and 2173 tract-years with 0 firearm suicides. With the three zero-population census tracts removed, there were 2149 tract-years with 0 firearm suicides, while the firearm suicide count remained the same. Comparing the values in Table 5, tract-years with firearm suicide deaths tended to have more condominiums per square mile, a higher average percentage of parcels with arms-length transfers in the last 5 years, and fewer vacant lots per square mile. (See also Table 1.)
There were 61 tract-years with youth firearm suicide, and there were no covariates that demonstrated a statistically significant association with the outcome in their individual GLMMs (Table 6).
There were 348 tract-years with any suicide, and four covariates showed significant differences between the tracts with and without suicide (Table 1). Tracts with suicide had a higher average maximum number of arms-length transfers per parcel, a higher average percentage of parcels with an arms-length transfer, a higher average percentage of children aged three and older in preschool, and a lower average number of vacant lots per square mile than tracts without suicide.
There were 113 tract-years with youth suicide, and two covariates with significant differences; tracts with youth suicide showed a higher percentage of parcels with an arms-length transfer and a higher percentage of children aged three and older in preschool than tracts without youth suicide (Table 1).
Using Moran’s I test, we did not detect spatial autocorrelation in the majority of years, with the exception of counts of firearm suicide in 2012 and youth firearm suicide in 2014 (Table 7).
Firearm suicide
Both machine learning algorithms demonstrated poor prediction performance for firearm suicide. Using the test set, the predictions from the stratified random forest yielded sensitivity of 0.10, AUC of 0.54, and AUPRC of 0.09 (Table 2). XGBoost showed similar performance to random forest: sensitivity of 0.05, AUC of 0.59, and AUPRC of 0.10. All model performance metrics are shown in Table 2.
Model 1-FS used all de-correlated variables as predictors. In Model 2-FS, we used only the significant (at \(\alpha =0.10\)) covariates from Model 1-FS: latitude, large multi-family units per square mile, vacant lots per square mile, count of hospitals, count of religious buildings, percent of children aged 3+ in preschool, and historical redlining grade. Model 3-FS used all de-correlated variables except population as a predictor, instead including population as an offset (where population remained on its original scale). Model 4-FS used the significant covariates from Model 3-FS with a population offset: small multi-family units per square mile, large multi-family units per square mile, vacant lots per square mile, percent of parcels with vacant buildings, count of hospitals, percent of housing occupied by renters, percent of children aged 3+ in preschool, and historical redlining grade. All four models also do a poor job of predicting firearm suicide tract-years (Table 2); predictions from these models are all zero (i.e. predict no firearm suicide in all tract-years).
Youth firearm suicide
The predictions of youth firearm suicide from the stratified random forest yielded a sensitivity of 0.25, AUC of 0.67, and AUPRC of 0.05 (Appendix Table 8). XGBoost showed worse performance than random forest: sensitivity of 0.17, AUC of 0.43, and AUPRC of 0.024.
For youth firearm suicide, Model 2-YFS and Model 4-YFS included the only significant variables from their full model counterparts: industrial buildings per square mile and road network density, plus population (as covariate in Model 2-YFS and offset in Model 4-YFS). The performance of these models was poor, with no predictions of the minority class (Table 8).
Suicide
The predictions of suicide from the stratified random forest displayed a sensitivity of 0.25, AUC of 0.59, and AUPRC of 0.19 (Table 9). XGBoost here performed similarly to random forest: sensitivity of 0.26, AUC of 0.53, and AUPRC of 0.20.
Model 2-S used the covariates vacant lots per square mile, percent of parcels undergoing arms-length transfer in previous 5 years, percent of children aged 3 and older enrolled in preschool, historic redlining grade, racial residential segregation, and population. Model 4-S included vacant lots per square mile, percent of parcels with vacant buildings, percent of parcels undergoing an arms-length transfer in previous 5 years, percent of children aged 3 and older enrolled in preschool, historic redlining grade, and population offset. These models performed poorly, with no predictions of the minority class (Table 9).
Youth suicide
The predictions of youth suicide from the stratified random forest displayed a sensitivity of 0.45, AUC of 0.69, and AUPRC of 0.09. XGBoost performed slightly worse than random forest: sensitivity of 0.09, AUC of 0.61, and AUPRC of 0.06. More performance metrics for these algorithms are shown in Table 10.
For all youth suicide, Model 2-YS used the covariates industrial buildings per square mile, percent of parcels with vacant buildings, percent of renter-occupied housing units, percent of children aged 3 and older enrolled in preschool, percent of tax-delinquent parcels, racial residential segregation, and population. Model 4-YS was similar, excluding racial segregation and instead including population as an offset. These models performed poorly, with no predictions of the minority class (Table 10).
Discussion
This study aimed to identify the physical, social, and economic features of the neighborhood environment that contribute most to the prediction of suicide and firearm suicide overall and among youth and young adults in Detroit, Michigan from 2012 to 2019. Identifying features of the social and structural environment that are predictive of suicide risk may point to interventions with potential for wide-ranging benefits. Using the machine learning algorithms random forest and XGBoost, as well as mixed effects logistic regression, we found that the variables we used were unable to predict firearm suicide well at the census tract level. Given that machine learning algorithms like XGBoost and random forest typically perform well with rare events, especially compared to traditional models, our results suggest the included variables provide little to no predictive insight at this level of granularity in our study site.
While model performance was poor overall, there were a few patterns that emerged from this analysis. The percentage of parcels sold in arms-length transfers in the previous 5 years exhibit a significant positive association in exploratory analyses with three outcomes (firearm suicide, suicide, and youth suicide), as well as appearing statistically significant in both full GLMMs for the suicide outcome (Models 1-S and 3-S). A higher percentage of arms-length transfers can indicate higher neighborhood turnover and potentially lower social cohesion. Several other studies have found a relationship between social isolation and adverse mental health effects (Fedina et al. 2023; Lim et al. 2017). The count of vacant lots per square mile showed a negative association with firearm suicide and suicide, and it also appeared as significant in Models 1 and 3 for firearm suicide and suicide. We hypothesize that the count of vacant lots per square mile may simply be a proxy for population at risk that is not otherwise captured by our variables: fewer vacant lots may indicate greater population density. Finally, the percentage of children aged three and older enrolled in preschool showed a significant positive association with suicide and youth suicide; it was also significant in Models 1 and 3 for firearm suicide, suicide, and youth suicide. This finding is unexpected, and we are unaware of a plausible explanation for this relationship.
Differences in our findings and those of some prior studies may in part be explained by our relatively unique study population. Whereas other studies that used statewide data found that non-metropolitan areas had higher suicide risks than metropolitan areas and that predominantly Black neighborhoods had lower suicide risk than neighborhoods that were not predominantly Black, Detroit as a city has relatively low variation within its boundaries with respect to race, urbanicity, and economics (Sugg et al. 2023; Fontanella et al. 2018). We posit that the lack of variation in citywide data compared to statewide data may account for our different findings. It is possible we may have detected associations if we were making comparisons across a more diverse area with respect to those three variables.
By limiting the study to neighborhoods in Detroit, we are looking at a restricted area of the full distribution of the studied variables; socioeconomic features in much of the city of Detroit tend to indicate reduced wealth and lower social and economic opportunity. Perhaps in this context, neighborhood features are less important than city features, and policies or programs that improve conditions for the city as a whole would be of greatest benefit. Additionally, this study did not include individual-level data and as such, cannot shed light on individual risk factors, though previous research indicates that programs tailored to individuals at greater risk are a critical element of prevention (Stone et al. 2017).
As a predominantly Black metropolis, the city’s population overall is at lower risk for suicide death than among other groups, especially when compared to a statewide population. However, it is important to emphasize that the suicide rate among Black Americans has increased significantly in recent years (Stone 2023). This coincides with an increase in Black gun ownership. During the 2020 surge in gun purchasing, Black individuals were 1.7 times more likely than white individuals to become first time gun owners (Simonson et al. 2021). These are concerning trends that deserve more attention and further study. Given how recently firearm ownership trends have changed among Black individuals, future studies are needed that also include death data from after 2020.
In addition to adding to the body of work about suicide in a primarily Black city, this study also contributes further research on the role of environmental factors and suicides and firearm suicides, particularly at the census tract level. Given that previous studies have shown individual-level factors are important but found mixed results for contextual variables measured at the area level, we hypothesized that analysis at the census tract-level, a relatively small areal unit, could provide useful insights (Rehkopf and Buka 2006). This study considers a large number of variables, including commonly used demographic and socioeconomic variables as well as the somewhat novel addition of building and land information; this better captures the neighborhood environments in which people live compared to previous studies with fewer such variables. Finally, it utilizes both traditional regression modeling techniques and novel machine learning algorithms, which handle rare outcomes and collinearity well and do not necessitate distributional assumptions.
This study should be interpreted with consideration of the following limitations. First, our proxies for mental health care availability (hospitals and federally qualified health centers) do not fully capture the range of mental health services available in Detroit; more explicit measures of mental health care availability (e.g. provider density) were found to be associated with the presence of suicide clusters in previous studies (Sugg et al. 2023; Fontanella et al. 2018). We do not expect that our measures biased the results, but they do not allow us to fully explore this specific aspect of the neighborhood environment. Also, this study does not explicitly account for spatial and temporal effects, though we did include year, latitude, and longitude. The primarily null results found in this paper, as well as conflicting results in previous works on this topic, may be an artifact of the modifiable areal unit problem (Buzzelli 2020). Finally, this study of Detroit may not be generalizable to other metropolitan areas, especially those that differ with respect to race and economic trends.
Conclusions
Place factors alone were not predictive of suicide or firearm suicide in our study of Detroit, Michigan at the census tract level, where variation in socioeconomic status, a commonly identified predictor, is limited. It may be that spatial measures characterizing the places where people live are less useful for explaining suicide risk in this setting. Future practice and study should consider focusing on both larger and smaller areas, including city and individual-level factors. For example, studies identifying features associated with suicide and firearm suicide risk might consider combining place-based variables with individual-level risk factors to understand their relative importance as well as the potential for interaction effects.
Data availability
The data used in this study are not openly availability due to sensitivity. Mortality data is owned and maintained by the Michigan Department of Health and Human Services.
Abbreviations
- AUC:
-
Area under ROC curve
- AUPRC:
-
Area under precision-recall curve
- GLMM:
-
Generalized linear mixed model
- ICD:
-
International classification of diseases
- ICE:
-
Index of concentrations at the extremes
- ROC:
-
Receiver operating characteristic curve
- XGBoost:
-
Extreme gradient boosting
References
Bilsen J. Suicide and youth: risk factors. Front Psychiatry. 2018;9:540.
Bischl B, Lang M, Kotthoff L, Schiffner J, Richter J, Studerus E, Casalicchio G, Jones ZM. mlr: Machine learning in r. J Mach Learn Res. 2016;17(170):1–5.
Bischl B, Kuehn T, Szepannek G. On class imbalance correction for classification algorithms in credit scoring. In: Operations Research Proceedings 2014, pp. 37–43. Springer, 2016.
Bischl B, Richter J, Bossek J, Horn D, Thomas J, Lang M. mlrmbo: A modular framework for model-based optimization of expensive black-box functions; 2017, arXiv preprint arXiv:1703.03373
Boggs JM, Beck A, Hubley S, Peterson EL, Hu Y, Williams LK, Prabhakar D, Rossom RC, Lynch FL, Lu CY, et al. General medical, mental health, and demographic risk factors associated with suicide by firearm compared with other means. Psychiatric Serv. 2018;69(6):677–84.
Breiman L. Random forests. Mach Learn. 2001;45:5–32.
Brooks ME, Kristensen K, van Benthem KJ, Magnusson A, Berg CW, Nielsen A, Skaug HJ, Maechler M, Bolker BM. glmmTMB balances speed and flexibility among packages for aero-inflated generalized linear mixed modeling. R J. 2017;9(2):378–400. https://doi.org/10.32614/RJ-2017-066.
Buzzelli M. Modifiable areal unit problem. Int Encycloped Hum Geogr, 2020;169.
Casalicchio G, Bossek J, Lang M, Kirchhoff D, Kerschke P, Hofner B, Seibold H, Vanschoren J, Bischl B. Openml: an r package to connect to the machine learning platform openml. Comput Statist. 2017;34:977–91.
Centers for Medicare and Medicaid Services: Provider Services. Accessed 2023 March 1, https://data.cms.gov/provider-characteristics/hospitals-and-other-facilities/provider-of-services-file-hospital-non-hospital-facilities.
Chen T, Guestrin C. Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, 2016:785–794.
Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H, Chen K, Mitchell R, Cano I, Zhou T, Li M, Xie J, Lin M, Geng Y, Li Y, Yuan J. Xgboost: extreme gradient boosting; 2024. R package version 1.7.7.1. https://CRAN.R-project.org/package=xgboost
Clark C, Myron R, Stansfeld S, Candy B. A systematic review of the evidence on the effect of the built and physical environment on mental health. J Public Mental Health. 2007;6(2):14–27.
Classen TJ, Dunn RA. The effect of job loss and unemployment duration on suicide risk in the United States: a new look using mass-layoffs and unemployment duration. Health Econ. 2012;21(3):338–50.
Conner A, Azrael D, Miller M. Suicide case-fatality rates in the United States, 2007 to 2014: a nationwide population-based study. Ann Intern Med. 2019a;171(12):885–95.
Conner KR, Bridge JA, Davidson DJ, Pilcher C, Brent DA. Metaanalysis of mood and substance use disorders in proximal risk for suicide deaths. Suicide Life-Threaten Behav. 2019b;49(1):278–92.
Centers for Disease Control and Prevention: Web-based Injury Statistics Query and Reporting System (WISQARS). 2022. http://www.cdc.gov/ncipc/wisqars. Accessed 13 Nov 2023.
Coulton CJ, Jennings MZ, Chan T. How big is my neighborhood? Individual and contextual effects on perceptions of neighborhood scale. Am J Commun Psychol. 2013;51(1–2):140–50.
Data Driven Detroit: Residential Parcel Survey; 2009. Accessed 2023 February 28, https://portal.datadrivendetroit.org/datasets/9b2f15c003dc4652a468358958151fac_0
Dynamo Metrics: Unpublished Report; 2020.
Echeverría S, Diez-Roux AV, Shea S, Borrell LN, Jackson S. Associations of neighborhood problems and neighborhood social cohesion with mental health and health behaviors: the multi-ethnic study of atherosclerosis. Health Place. 2008;14(4):853–65.
Esposito M, Li M, Finlay J, Gomez-Lopez I, Khan A, Clarke P, Chenoweth M. National Neighborhood Data Archive (NaNDA): Eating and Drinking Places by Census Tract, United States, 2003–2017. Ann Arbor, MI: Inter-university Consortium for Political and Social Research, 2020;2. https://www.openicpsr.org/openicpsr/project/115404/version.
Fedina L, Moss L, Rousson AN, Smith ME, Bright C, Herrenkohl TI, DeVylder J. Effects of neighborhood disconnection on psychological distress and suicide risk associated with interpersonal violence within racial groups. J Child Adolesc Trauma. 2023;16(3):681–97.
Finlay J, Li M, Esposito M, et al. National Neighborhood Data Archive (NaNDA): liquor, tobacco, and convenience stores by census tract, United States, 2003–2017. Ann Arbor, MI: Inter-university Consortium for Political and Social Research, 2020. https://www.openicpsr.org/openicpsr/project/123541/version, 1.
Fontanella CA, Saman DM, Campo JV, Hiance-Steelesmith DL, Bridge JA, Sweeney HA, Root ED. Mapping suicide mortality in Ohio: a spatial epidemiological analysis of suicide clusters and area level correlates. Prevent Med. 2018;106:177–84.
Fowler KA, Gladden RM, Vagi KJ, Barnes J, Frazier L. Increase in suicides associated with home eviction and foreclosure during the US housing crisis: findings from 16 national violent death reporting system states, 2005–2010. Am J Public Health. 2015;105(2):311–6.
Geolytics Inc.: 2011-2019 Annual Estimates Premium, Brunswick, New Jersey; 2019.
Hastie T, Tibshirani R, Friedman JH, Friedman JH. The elements of statistical learning: data mining, inference, and prediction, vol. 2. Cham: Springer; 2009.
Kaplan MS, McFarland B, Huguet N. Characteristics of adult male and female firearm suicide decedents: findings from the national violent death reporting system. Injury Prevent. 2009;15(5):322.
Kaplan MS, Huguet N, Caetano R, Giesbrecht N, Kerr WC, McFarland B. Economic contraction, alcohol intoxication and suicide: analysis of the national violent death reporting system. Injury Prevent. 2015;21(1):35–41.
Krieger N, Feldman JM, Waterman PD, Chen JT, Coull BA, Hemenway D. Local residential segregation matters: stronger association of census tract compared to conventional city-level measures with fatal and non-fatal assaults (total and firearm related), using the index of concentration at the extremes (ice) for racial, economic, and racialized economic segregation, Massachusetts (US), 1995–2010. J Urban Health. 2017;94:244–58.
Lang M. The impact of mental health insurance laws on state suicide rates. Health Econ. 2013;22(1):73–88.
Lang M, Kotthaus H, Marwedel P, Weihs C, Rahnenfuehrer J, Bischl B. Automatic model selection for high-dimensional survival analysis. J Statist Comput Simul. 2014;85(1):62–76.
Latkin CA, Curry AD. Stressful neighborhoods and depression: a prospective study of the impact of neighborhood disorder. J Health Soc Behav. 2003;44:34–44.
Li Z. Extracting spatial effects from machine learning model using local interpretation method: an example of shap and xgboost. Comput Environ Urban Syst. 2022;96: 101845.
Lim S, Chan PY, Walters S, Culp G, Huynh M, Gould LH. Impact of residential displacement on healthcare access and mental health among original residents of gentrifying neighborhoods in New York City. PloS One. 2017;12(12):0190139.
Michigan Department of Health and Human Services: Mortality Data; 2019.
Miller M, Warren M, Hemenway D, Azrael D. Firearms and suicide in US cities. Injury Prevention; 2013.
Muchlinski D, Siroky D, He J, Kocher M. Comparing random forest with logistic regression for predicting class-imbalanced civil war onset data. Polit Anal. 2016;24(1):87–103.
Mukherjee S, Frimpong Boamah E, Ganguly P, Botchwey N. A multilevel scenario based predictive analytics framework to model the community mental health and built environment nexus. Sci Rep. 2021;11(1):17548.
National Center for Education Statistics: School Geocodes and Geoassignments, 2015-2019. Accessed 2023 March 1, https://nces.ed.gov/programs/edge/geographic/schoollocations.
Nelson RK, Winling L, Marciano R, Connolly N, Ayers EL. Mapping inequality: redlining in new deal America. American Panorama: an atlas of United States history. University of Richmond: Digital Scholarship Lab 17, 19; 2020.
Núñez-González S, Delgado-Ron JA, Gault C, Lara-Vinueza A, Calle-Celi D, Porreca R, Simancas-Racines D. Overview of “systematic reviews’’ of the built environment’s effects on mental health. J Environ Public Health. 2020;2020(1):9523127.
Olson-Williams H, Grey S, Cochran A. Ecological study of urbanicity and self-reported poor mental health days across US counties. Commun Mental Health J. 2023;59(5):986–98.
Probst P, Au Q, Casalicchio G, Stachl C, Bischl B. Multilabel classification with R package mlr; 2017, arXiv preprint arXiv:1703.08991
Rehkopf DH, Buka SL. The association between suicide and the socio-economic characteristics of geographical areas: a systematic review. Psychol Med. 2006;36(2):145–57.
Satherley R-M, Hazell CM, Jones CJ, Hanna P. A systematic review of the effects of urban living on suicidality and self-harm in the UK and Ireland. J Urban Health. 2022;99(3):385–408.
Schleimer JP, Wright MA, Shev AB, McCort CD, Asif-Sattar R, Sohl S, Stewart SL, Wintemute GJ, Kagawa RM. Alcohol and drug offenses and suicide risk among men who purchased a handgun in California: A cohort study. Prevent Med. 2021;153:106821.
Schleimer JP, Kagawa RM, Laqueur HS. Association of medical conditions and firearm suicide among legal handgun purchasers in California: a case-control study. Injury Epidemiol. 2023;10(1):26.
Simonson MD, Lazer D, Perlis R, Baum M, Green J, Gitomer A, Ognyanova K, Druckman J, Lin J, Santillana M, et al. The COVID States Project# 37: Gun Purchases During the COVID-19 Pandemic. OSF, 2021.
Stekhoven DJ. missForest: nonparametric missing value imputation using random forest; 2022. R package version 1.5
Stekhoven DJ, Buehlmann P. Missforest-non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012;28(1):112–8.
Stone DM, Holland KM, Bartholow BN, Crosby AE, Davis SP, Wilkins N. Preventing Suicide: A Technical Package of Policies, Programs, and Practice; 2017.
Stone DM. Notes from the field: Recent changes in suicide rates, by race and ethnicity and age group—United States, 2021. MMWR. Morbidity and Mortality Weekly Report, 2023;72.
Sugg MM, Runkle JD, Andersen LM, Desjardins MR. Exploring place-based differences in suicide and suicide-related outcomes among North Carolina adolescents and young adults. J Adolesc Health. 2023;72(1):27–35.
United States Census Bureau. Table H-1: Income limits for each fifth and top 5 percent. Census Bureau: All Races. U.S; 2020.
United States Census Bureau: American Community Survey 5-Year Estimates. Accessed 2024 August 7; 2019, https://data.census.gov/table/DECENNIALPL2020.P1
United States Census Bureau: Dicennial Census, 2010 - Detroit, Michigan. Accessed 2024 August 7; 2010, https://data.census.gov/table/DECENNIALPL2010.P1
United States Census Bureau: Dicennial Census, 2020 - Detroit, Michigan. Accessed 2024 August 7; 2020, https://data.census.gov/table/DECENNIALPL2020.P1
United States Department of Homeland Security, Geospatial Management Office: Homeland Infrastructure Foundation-Level Data: All Places of Worship. Accessed 2023 March 1, https://hifld-geoplatform.opendata.arcgis.com/datasets/geoplatform::all-places-of-worship/explore
United States Environmental Protection Agency: National Walkability Database; 2020. Accessed 2023 March 1, https://catalog.data.gov/dataset/walkability-index1
United States Environmental Protection Agency: Smart Location Database, (2018). Accessed 2023 March 1, https://catalog.data.gov/dataset/smart-location-database1
US Department of Justice, Federal Bureau of Investigation: Uniform Crime Reporting Handbook. Accessed 2024 August 7; 2019, https://ucr.fbi.gov/crime-in-the-u.s/2019/crime-in-the-u.s.-2019
World Health Organization: International Statistical Classification of Diseases and Related Health Problems: Alphabetical Index vol. 3. World Health Organization, 2004.
Acknowledgements
The authors would like to acknowledge Benjamin Calnin for his contributions to data acquisition.
Funding
This project was supported by Award No. K01CE00322402, awarded by the Centers for Disease Control. Additional funding was provided by the California Firearm Violence Research Center.
Author information
Authors and Affiliations
Contributions
CS: design of the work, data acquisition, data analysis, interpretation of data, initial draft of work; VAP: conception and design of the work, data acquisition, substantive revisions; RMCK: conception and design of the work, data acquisition, initial draft, substantive revisions.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This study was reviewed by the Institutional Review Board of University of California Davis and received a determination of exempt on November 23, 2020. All authors certify that the study was performed in accordance with the ethical standards as laid down in the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards. Written informed consent was not required for this study as we did not gather data firsthand from participants but rather procured data from various government agencies and databases.
Consent for publication
Not applicable.
Competing interests
The authors have no Conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Machine learning algorithm tuning
Appendix B: Exploratory analyses
Appendix C: Algorithm and model performance metrics for additional outcomes
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Smirniotis, C., Pear, V.A. & Kagawa, R.M.C. Neighborhood predictors of suicide and firearm suicide in Detroit, Michigan. Inj. Epidemiol. 11, 53 (2024). https://doi.org/10.1186/s40621-024-00530-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s40621-024-00530-4