Analyzing Quality of Food Across Economic and Racial Groups in Chicago
Team NKMG
Gabriel Lima, Kenyu Kobayashi, Murat Topak, & Niko Masiero
Applied Data Analysis, EPFL, Autumn 2019
Racism is arguably institutionalized in American society. Result of almost 100 years of slavery and consequent lack of opportunity for the black population to get back on their feet has created various barriers to this community. This situation has led to the exclusion of the black population from health, educational, social, and economic resources. Data suggests that black Americans are more likely to suffer from mental illness, can live up to 20 years less than an average white American, and do not have the same higher level education opportunities.
Another important problem in the current world landscape is the precarious food distribution across different regions. For instance, diabetes has hit minorities the hardest and many layers of society worldwide suffer of poverty and hunger. However, there has not been any analysis on the quality of food provided to different social and racial groups in a certain location. Following the trend of minorities receiving lower quality services and products, we hypothesize that the same would follow in terms of the quality of the food available to the black and poorest population. Therefore, we ask the following research question:
In order to answer our question, we utilized a dataset of food inspections in the city of Chicago. We use the results of the inspections as a proxy for a measure of food quality in a certain location. If a certain neighborhood has a higher failure rate, we can infer that the food available there is of lower quality. As a form of measuring economic and racial variables in the city of Chicago, we also obtained information about the racial distribution across neighborhoods, the city of Chicago’s measurement of social and economic hardship , and data regarding crimes committed in the city. With these datasets, we divide our main research question in three different sub-questions:
The dataset provided by the city government identifies in which restaurant the inspection was conducted, the final result, violations, risk, location, and date. With this information, we calculate 6 measures of food quality and food inspections for each neighborhood.
Among all the variables obtained in the US nationwide census, we only used two factors for our project. First, we obtained each neighborhood's population so we can normalize our measurements according to the population size of a neighborhood, and its racial distribution, as a measure of how many citizens belong to racial minorities.
The city of Chicago also makes available the economic hardship of all its community areas. The economic hardship index is a function of:
This dataset was also provided by Chicago's government. It reflects reported incidents of crime of various types. We have chosen to remove crimes that do not directly correlate with the level of violence of a neighborhood. For instance, we remove non-violent crimes and records of car violations as the vehicle might not belong to the neighborhood where the crime was reported. We present the number of crimes per capita in the map on the right.
Our results indicate that there does not exist a strong correlation between the food quality and our tackled socio-economic variables. With the exception of a small non-linear correlation between the percentage of African Americans in a community and the number of inspections, both Pearson and Spearman correlation indexes are not statistically significant.