Introduction TXDoT has available, online, detailed data regarding traffic collisions throughout the state. The data itself must be queried and downloaded manually as CSV files, but that is not too bad. I downloaded the data for Harris county from 2010 to 2018. Database is documented at https://www.txdot.gov/inside-txdot/division/traffic/data-access.html Access is from https://cris.txdot.gov/secure/Share Log on and download one year at a time. The zip files will require the login password to open them.
Introduction Houston is one of the worst places in the country for allergies. Since there is reasonably good data available, I thought I should analyze the pollen and mold data with an eye towards prediction - both short and mid range time scales. As with any project like this, step one is reading in and cleaning up the raw data. The data is available online as artisanal spreadsheets at https://www.
Introduction In late 2017 I did an analysis of crime data in my neighborhood (The Heights) using the online Houston Police Department data. This was so interesting that I foolishly decided to expand the effort to cover the whole city. After all, how hard could it be to go from analyzing one police beat with about 13,000 records, to analyzing 109 beats, with a corresponding increase in volume? This effort is still ongoing in fits and starts today, but I thought it would be useful to start documenting the journey now before the pain fades away.
Houston Crime Statistics A debate began on my neighborhood list. “Crime is increasing in the Heights!”, “Crime is not going up in the Heights!” and so on, back and forth. Then someone noted that all the crime data is on the Houston Police Department website, so someone could just download the data and answer the question. I thought to myself, “I could be that someone”, so I started working on it.