The 2008 to 2013 NYC Taxi Trip Data set comes courtesy of a FOIL request to the Taxi & Limousine Commission. The data is currently hosted on Google's BigQuery service, where you can run SQL queries and batch jobs on it. There are nearly 850,000,000 rows and the data requires 98 gigabytes of disk space.