-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathabout.html
95 lines (73 loc) · 7.83 KB
/
about.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8"/>
<link rel="stylesheet" type="text/css" href="/css/reset.css" />
<link rel="stylesheet" type="text/css" href="/css/style.css" />
<link rel="stylesheet" type="text/css" href="/css/colorbox.css" />
<link rel="icon" href="/images/favicon.ico" type="image/gif"/>
<title>Roach Map - Where are the roaches in NYC? - The Great Urban Hack</title>
<meta name="description" content="A map of New York City cockroach reports in restaurants recorded by the Health Department." />
<script src="//ajax.googleapis.com/ajax/libs/jquery/1.4.3/jquery.min.js"></script>
<script type="text/javascript" src="http://use.typekit.com/vbg6ymp.js"></script>
<script type="text/javascript">try{Typekit.load();}catch(e){}</script>
<script type="text/javascript" src="/jquery/jquery.colorbox-min.js"></script>
<script>
$(document).ready(function(){
$(".image").colorbox();
});
</script>
<script type="text/javascript">
var _gaq = _gaq || [];
_gaq.push(['_setAccount', 'UA-19554555-1']);
_gaq.push(['_trackPageview']);
(function() {
var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true;
ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js';
var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s);
})();
</script>
</head>
<body>
<div id="container">
<h1>About the New York City Roach Map</h1>
<a href="http://www.flickr.com/photos/k790i/3639234792/"><img src="/images/roach.png" class="right" padding-top="55px" padding-left="5px"></a><p padding="40px">They may not disrupt your life as much as bedbugs, but nobody wants to live in a place where cockroaches roam free. They can cause the spread of asthma, they can spread dangerous germs and they're just plain ugly.</p>
<p>Each week, New York City restaurant inspectors visit hundreds of restaurants and one of their jobs is to document sightings of live cockroaches. The inspectors add their findings to the publicly available restaurant reports, creating the only official database of roach sightings.</p>
<p>This website shows the zip codes around the city that had the highest number of roach sightings in the past week based on this data. We update the map every Monday, and every month, we send out an email with the ZIP codes and neighborhoods that had the most roach sightings in the most recent four-week period.</p>
<p>We came up with this idea at <a href="http://hackshackers.com/2010/11/08/open-all-night-the-great-urban-hack-nyc/">The Great Urban Hack NYC</a>, a two-day, overnight hackathon that brought together journalists, data scientists and developers on Nov. 6 and 7, 2010.</p>
<p>Read on for the technical details of how this works …</p>
<p>After brainstorming and chucking several ideas, five of us settled on creating a map of roach reports culled from city restaurant health inspection data. Since this site and the map were produced during a hackathon, think of it as a work in progress, one that's <a href="https://github.com/macdiva/RoachMap">free to fork</a>.</p>
<p>We used the <a href="http://www.nyc.gov/html/datamine/html/home/home.shtml">NYC Data Mine</a> “restaurant inspection results” raw data set, which describes almost 400,000 results from health inspections performed in restaurants across all of NYC. The most recent report can be downloaded by searching "restaurant inspection results" in the Data Mine <a href="http://www.nyc.gov/html/datamine/html/data/raw.shtml">raw data catalog</a>.</p>
<p>We then went through Violation.txt file within the data set to find the violation code, 04M, that corresponds to finding “live roaches present in facility's food and/or non-food areas.” This code’s current meaning went into effect July 26, 2010, so we choose to only analyze a 90-day window of data from the WebExtract.txt file.</p>
<p>We wrote a Python script that parses the data set’s WebExtract.txt file line-by-line, counting every single result for each ZIP code in New York in our window and every result specifically related to roaches. This script is called <a href="https://github.com/macdiva/RoachMap/blob/master/roach_parser.py">roach_parser.py</a>.</p>
<p>This count data tells us what percentage of inspections resulted in a violation for roaches. For that reason, we chose to analyze the data as draws from a binomial model. To estimate the parameters of this model, we used a hierarchical Bayesian model implemented both as an empirical Bayes approach in NumPy (<a href="https://github.com/macdiva/RoachMap/blob/master/draw_map.py">draw_map.py</a>) and using Gibbs sampling (<a href="https://github.com/macdiva/RoachMap/blob/master/jags_analysis.R">jags_analysis.R</a>) to correct for the large discrepancies in inspections across ZIP codes. Some ZIP codes only have a few inspections, while other ZIP codes have a very large number of inspections: the Bayesian approach allows us to pool information across all ZIP codes to make up for this discrepancy.</p>
<p>With that information, we plot the data on a map using a NYC shapefile using Matplotlib. This is done in draw_map.py.</br>
<div class="byline">— The Roach Map team, a.k.a. "Drawing Conclusions"</div></p>
<div class="clear"></div>
<p><b>Drawing Conclusions</b> is...<br/>
<ul>
<li><b><a href="http://www.cleanplates.com">Niles Brooks</a></b></li>
<li><a href="http://www.thetakeaway.org/people/jim-colgan/"><b>Jim Colgan</b></a>, an interactive producer at WNYC Radio. He is currently the digital editor for the national show, The Takeaway. Before that, he was at the Brian Lehrer Show, where he pioneered the station's first crowdsourcing efforts.</li>
<li><a href="http://max.shron.net"><b>Max Shron</b></a>, Data Scientist at OkCupid, where he works on the blog and general number crunching visualization.</li>
<li><a href="http://johnmyleswhite.com"><b>John Myles White</b></a>, a Ph.D student in psychology at Princeton, where he studies behavioral economics and machine learning.</li>
<li><a href="http://twitter.com/MacDivaONA"><b>Chrys Wu</b></a>, journalist, strategist, maker, cook, and co-organizer of Hacks/Hackers NYC.</li>
</ul>
</p>
<p>The Roach Map is a work in progress. Our files are <a href="https://github.com/macdiva/RoachMap">on Github</a>.</p>
<p>If you're concerned about cockroaches, the city has information about <a href="http://www.nyc.gov/html/doh/html/ehs/ehscroach.shtml">what to do</a>.</p>
<p>The Roach Map was one of a dozen Great Urban Hack NYC projects. See the others on <a href="http://hackshackers.com/2010/11/08/open-all-night-the-great-urban-hack-nyc/">HacksHackers.com</a>.</p>
<p>Ready for more? Go back to the <a href="index.html">Roach Map</a>.</p>
<br style="clear:both"/>
<div id="footer">
<a rel="license" href="http://creativecommons.org/licenses/by-sa/3.0/"><img alt="Creative Commons License" style="border-width:0; padding: 0 10px 0 40px; float:left;" src="http://i.creativecommons.org/l/by-sa/3.0/80x15.png" /></a><p><span xmlns:dct="http://purl.org/dc/terms/" href="http://purl.org/dc/dcmitype/InteractiveResource" property="dct:title" rel="dct:type"><a xmlns:cc="http://creativecommons.org/ns#" href="http://roachmap.com" property="cc:attributionName" rel="cc:attributionURL">Roach Map</a></span> by Niles Brooks, Jim Colgan, Max Shron, John Myles White and Chrys Wu is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-sa/3.0/">Creative Commons Attribution-ShareAlike 3.0 Unported License</a>.<br/>
Photo by <a href="http://www.flickr.com/photos/k790i/3639234792/">Anil Jedhav/Flickr</a>, Creative Commons Attribution 2.0 Generic License</p>
</div>
</div>
<div class="clear"></div>
<div id="archives">Archives:
<ul>
<li><a href="/">Week of Nov. 7, 2010</a></li>
</ul>
</div>
</body>
</html>