Restaurant Violations in NYC, 2022

Food poisoning is a real public health threat. Every year, approximately 48 million people get sick from food poisoning, 128 thousand are hospitalized, and 3,000 die from foodborne illnesses. According to estimates published by the Center for Science in the Public Interest, about two-thirds of food poisonings occur through restaurants. To keep this threat in check, the City of New York’s Department of Health and Mental Hygiene operates a restaurant inspection system that monitors all food vendors – from street trucks to five-star restaurants – regularly, and their grades must be posted publicly. A vendor with too many violations, or violations that are too serious, will be shut down. The system tries to safeguard the public from restauranteurs who do not take food safety seriously, and allows consumers to weigh food safety when choosing restaurants.

This analysis looks for patterns in food safety by geographic area. It asks whether there are neighborhoods in which food safety seems generally more or less safe than others. In terms of geography, there is no immediately obvious pattern, except perhaps that inspection failures might be more infrequent in low-density areas. This may be partly due to lower rates of rodent infestation, which is a major source of problems in New York City. Further analysis is required before being able discern finer differences.

It should be noted: Readers should avoid the ecological fallacy of assuming that general patterns observed in neighborhoods apply to all restaurants in that area. Restaurants are rated individually, and should be judged as such. If anything, an “A” rating in a generally “C” neighborhood or cuisine is a gem that stands out from its peers.

Data

The analysis uses violations data collected by the New York City Department of Health and Mental Hygiene, and distributed by NYC Open Data. In this analysis, we will look at all restaurants that were inspected at least once in 2022. Our analysis will focus on every restaurant’s lowest overall grade for that year. Restaurants are evaluated on a demerit points system, [described here] (https://www.nyc.gov/assets/doh/downloads/pdf/about/healthcode/health-code-chapter23.pdf).1

Analysis

Below, we examine geographic patterns in both high- and low-scoring restaurants.

C-Grade Restaurants

Which zip codes have the highest proportion of “C” rated restaurants, relative to their overall restaurant population? We only consider zip codes with at least 10 inspections. Note that we have top-coded the proportions at 30% to improve legibility. These outliers are described below:

I could find no immediate patterns in the map, except that the incidence of C-grades is low in parts of the city that I know to be lower density, at least for the most part. First, consider Table 1 (below), which describes the zip codes in which C grades were most common:

NeighborhoodBoroughZipcodeN% C Grade
HarlemManhattan10030230.421
MidtownManhattan10169160.313
Fresh MeadowsQueens11366440.306
South Richmond HillQueens11419770.306
East HarlemManhattan10035660.300
Springfield GardensQueens11413440.297
GravesendBrooklyn112291110.294
Borough ParkBrooklyn11219900.286
CanarsieBrooklyn112361080.280
MidtownManhattan10121120.273

I could not immediately discern a pattern here. The offending neighborhoods are spread across boroughs, and include zip codes that I know to be higher and lower income. Interestingly, the third-worst zip code in New York is just outside of Queens College campus. Make of that what you will.

Table 2 (below) describes the zip codes with the lowest incidence of C-grades.

NeighborhoodBoroughZipcodeN% C Grade
MidtownManhattan10118100.000
TottenvilleStaten Island10307250.000
JamaicaQueens11433200.000
JamaicaQueens11430520.020
Princes BayStaten Island10309640.032
Rockaway ParkQueens11694300.036
ArverneQueens11693230.046
Cambria HeightsQueens11411180.059
East New YorkBrooklyn11239170.063
BaysideQueens11426190.063

From these figures, and across the least-offending neighborhoods, the only pattern that I could discern was that there was a plurality of neighborhoods that I know to be low-density, for example the neighborhoods in Staten Islands, around the Rockaways, and in the uppermost reaches of the Bronx. One reason that this pattern could exist is that these neighborhoods might be lower density, which may cause a lower incident of rodent infestations. Along with improper temperature storage, rodent infestation is a leading cause of restaurant violations in the city.

Most A-Grade Restaurants

Note that the proportions on this figure are top-coded at 80%, with outliers described in the table below.

Table 3 (below) describes the zip codes with the highest incidence of “A” grades.

BoroughNeighborhoodzipcodeN% A Grade
QueensJamaica Estates11430520.90
ManhattanGarment District10118100.89
QueensFar Rockaway / Arverne11693230.86
BronxCity Island / Fordham10464290.85
QueensBeechhurst11360120.83
QueensEast Elmhurst11370190.83
ManhattanRockefeller Center / Diamond10020290.82
QueensRockaway Peninsula11694300.82
BrooklynBrownsville / Ocean Hill / E.11239170.81
QueensJamaica11436210.81

The main pattern observable to the naked eye is proximity to airports and low population density. Airports have many restaurants, and are surrounded by hotels with in-house or nearby restaurants. I speculate that these areas are likely to have many restaurants set in high-quality facilities.

Table 4 (below) describes the zip codes with the lowest incidence of “A” grades.

NeighborhoodBoroughZipcodeN% A Grade
HarlemManhattan10039230.4211
Upper East SideManhattan10128910.4217
Queens VillageQueens11428310.4286
WilliamsbridgeBronx10466670.4340
CanarsieBrooklyn112361080.4516
HarlemManhattan10030230.4737
Battery Park CityManhattan10280220.4737
GravesendBrooklyn112291110.4804
WilliamsbridgeBronx104671260.4825
South Richmond HillQueens11419770.4861

Some of the same neighborhoods appear in Table 1, which depicted the highest incidence of failing grades: Harlem, South Richmond Hill, and Canarsie. Aside from these high-offending areas, there appears to be a mixutre of neighborhoods.

Findings

This analysis sough geographic patterns in the distribution of high and low grades in restaurant inspections. The data did not give us many clear and decisive guidelines about neighborhoods that are safe or unsafe for dining. It did suggest that dining might be safer around airports, which may be driven by high cleanliness scores in airports and hotels. In terms of unsafe areas, it seems that risk is spread across boroughs, and can occur in higher- and lower-income areas. It may be safer in lower density areas that are further from the city center, which may be due to fewer rodent problems. Further research is required for these speculations to be confirmed. The data do suggest that many restaurants fare poorly in Harlem, South Richmond Hill, or Canarsie, though this analysis does not discern why these problems exist.


  1. I am grateful for Seth Mandel’s help in finding these documents.↩︎

Welcome to the New DataBlog

The QC DataBlog is a publication outlet for data analyses performed in the Queens College Data Analytics community.  It is intended to be a forum for original student- and faculty-produced data analyses.

How It Works

Professors can assign, and students can elect to submit, blog posts of 500 – 1500 word to QC DataBlog.  Submissions should use original data analysis to generate an insight that is of use to some group or community.  Submissions are otherwise open to topic, data, or analytical method.

The blog has several goals:

  • A Publication Outlet for Student Work.  The DataBlog intends to give students a platform to show the fruits of their skill sets.  Students can link to these posts on their web sites or LinkedIn pages to show potential collaborators or employers what they can do.  The students’ work will be published on an *.edu address with inward links from well-trafficked sites (e.g., the College site), and can include links to the student’s personal sites.
  • Learning Opportunities.  DataBlog publications can add meaning to class exercises, because they offer students an opportunity to showcase their skills publicly.  It may encourage more engagement and investment in assignments.  Moreover, students can be enlisted in the editorial, publication, or promotional operations of this publication as a means of learning other market-valued skills, like web development, research evaluation, social media marketing, mass communications, and much else.
  • Contribute Information to Public Discussion.  DataBlog entries have a good chance of appearing when people perform web searches for empirical analyses.  Publishing useful or meaningful analyses contributes information to the public, and allows students to contribute to larger societal discussions by donating their professional services.
  • Professional Identity Development.  Students can post under pseudonyms, but can also post work as a means of building an online professional identity.  It is likely that the posts on DataBlog will be among the higher-registering search returns when someone Googles the student.  Students may post to DataBlog to ensure that, when they are searched online, displays of professional skill and technical competency are what comes up.

Opportunities to Participate

Students are invited to participate in the development of the QC DataBlog.  Benefits include:

  • You will receive a listing on the DataBlog’s masthead and personnel page to which you can link on your web page and social media.
  • You will receive training and help in developing a personal website and social media ecosystem, in addition to your work on that of the DataBlog.
  • Experience publishing, reviewing and editing research, and opportunities to link to those products when telling your story online.
  • An opportunity for hands-on learning experiences related to your career aspirations in research evaluation, media enterprise management, web development, marketing and public relations, or content creation (including data journalism).

Roles available include:

  • Webmasters: Manage and develop the QC DataBlog site by managing user-interface design, website features, and content development.
  • Communications: Management of content delivery, publicity, and social media operations.  Writes content for web, reaches out to College and outside agencies to promote articles, and writes and schedules social media posts.
  • Editors: Will spearhead efforts to review and make publication decisions related to submissions.  This will include interfacing with authors and reviewers.
  • Reviewers:  Review and write a report on the rigor of the submission for editors.
  • Authors: Write original posts for the blog