Bicycling
Capital Bikeshare data already yields interesting facts
Reader Corey H has already taken the Capital Bikeshare anonymous trip data, released just a few days ago, and crunched the numbers to come up with some fascinating nuggets of information:
1) Downhill flow. Average trip is -1.94 meters, or over 2,632 kilometers in elevation change total. The average ride from Wisconsin and Macomb loses 55 meters in elevation.Fairfax Village has the highest start station:end station ratio (71 trips started, only 29 ended).
2) Last mile usage. The four most common one-way trips are Adams Mill/Columbia to Calvert/Woodley and back as well as Eastern Market Metro to Lincoln Park and back.
3) Tourists like to use it to sight-see. The 6th most common one-way trip is from the Smithsonian station back to the Smithsonian Station at 3,586 trips.
The average [such] trip is 2 hours, 48 minutes. 76.1% of [these] trips generate usage fees. Breaking that down between casual and members, 86.0% of casual incurred fees on these rides while 18.8% of members incurred fees.Corey added in an email, "I've already taken the time to clean the data and get it into a usable database. So if there are specific questions you'd like to be answered I can easily put together a query to get those answers (and I'm sure the others can as well)."4) Casual vs. members usage fees. 40.7% of casual rides incur fees.
3.3%3.3% of member rides incur fees.*Using GPS elevation data so all caveats apply. And it only factors in station-to-station elevation change.
What would you like to know about Capital Bikeshare usage? He can't necessarily investigate everyone's questions, but if anyone posts some interesting questions that catch Corey's eye, maybe he will analyze them for us.
Comments
- Bikeshare is a gateway to private biking, not competition
- Short-term Washingtonians deserve a voice, too
- Judge denies injunction against closing schools
- Public land deals have both benefits and pitfalls
- Long-term closures: A solution to single-tracking?
- DC Council makes major policy changes overnight
- PG planners propose bold new smart growth future







by tom veil on Jan 13, 2012 1:46 pm • link • report
Thanks!
by AR on Jan 13, 2012 1:47 pm • link • report
by Kevin Beekman on Jan 13, 2012 1:55 pm • link • report
I also worked out an estimate for how much CaBi might have made in overage fees for 2011. Again, excluding trips over 24 hours, I came up with more than $1.3 million, mostly from casual users (87.9% of the money). The average overage fee for casual users was also much higher than for registered users.
I put together a blog post this morning with my numbers at bikesharestation.com/capital-bikeshare-data/, with some more numbers probably coming soon.
by Eric O on Jan 13, 2012 2:03 pm • link • report
by Brad on Jan 13, 2012 2:10 pm • link • report
by Falls Church on Jan 13, 2012 2:18 pm • link • report
by David C on Jan 13, 2012 2:19 pm • link • report
I've also cleaned up the data, because there's not a whole lot you can do with the CSVs up on the web, because the date and time stamps are stored as character strings. It also doesn't help that the station variable doesn't have the ID parsed out from the huge long station name.
I have the data stored as SAS datasets and could answer many questions as well - it's just a matter of a) finding the time to sit down and code it and b) figuring out where to begin.
by Rob P on Jan 13, 2012 2:28 pm • link • report
The 10 most common one-way trips on weekdays by registered memebers:
5:00 AM to 9:59 AM:
1) Lincoln Park to Eastern Market
2) 13th/D NE to Union Station
3) 18th/Bell (Crystal City Metro) to 27th/Crystal Drive
4) 14th/V NW to Dupon Circle Metro
5) 17th/Corcoran to 17th/K
6) 18th/Bell to 23rd/Crystal
7) 13th/H St to Union Station
8) Union Station to L'Enfant Plaze
9) Adams Mill/Columbia to Calvert/Woodley
10) 15th/P to Dupont Circle
3:00 PM to 7:59 PM
1) Eastern Market to Lincoln Park
2) Calvert/Woodley to Adams Mill/Columbia
3) Union Station to 13th/D
4) Dupon Circle to 15th/P
5) Union Station to 13th/H
6) 27th/Crystal to 18th/Bell
7) Adams Mill/Columbia to Calvert/Woodley
8) Dupont Circle to 16th/U
9) Dupont Circle to 14th/V
10) Eastern Market to 13th/D
@Falls Church
The data are for trips only. It's difficult to find whether these stations were dockblocked/empty at any time. www.cabitracker.com has good data on that.
@Rob P, David C, Eric O
Basically I normalized the five CSVs, then removed the trips from non-standard stations, removed trips to and from the same station less than 60 seconds, removed incomplete entries, and removed trips over 24 hours. I also made the personal decision to remove the White House station trips, but that's a judgement call. Once I did that, I added elevation change and station-to-station distance.
GPS Coordinates came from http://www.capitalbikeshare.com/stations/bikeStations.xml
Elevations came from http://www.gpsvisualizer.com/elevation
Distance using the GPS coordinates and the Great Circle Distance (http://en.wikipedia.org/wiki/Great-circle_distance)
by Corey H. on Jan 13, 2012 2:45 pm • link • report
by Rob P on Jan 13, 2012 3:08 pm • link • report
by charlie on Jan 13, 2012 3:26 pm • link • report
by Brad on Jan 13, 2012 3:26 pm • link • report
Lincoln Park to Eastern Market is by no means an insurmountable distance to walk. However, it'd be annoying to have to do every day. Bikeshare adds a huge convenience factor.
Ditto for Crystal City. There are a few points in the area that are irritatingly far from the Metro. My only gripe is that CC still has a long way to come in terms of bicycle friendliness. Biking from the Metro to the 23rd St restaurants should be an no-brainer, although it's one of the few Bikeshare trips that I'll *never* attempt at night or without a helmet.
by andrew on Jan 13, 2012 3:31 pm • link • report
They're the most common one-way routes, not necessarily the heaviest-used stations. Those top 10 trips are almost exclusively last-mile connections to transit.
But look at 14th/V, with the second most originations in the AM rush, and the trips go all over the city, Dupont Cirlce, 19th/L, 17th/K, 18th/M, 21st/M, 14th/H are the most popular destinations.
by Corey H. on Jan 13, 2012 3:37 pm • link • report
by tour guide on Jan 13, 2012 3:53 pm • link • report
But one could definitely look into pairs of stations that are connected by bike lanes, and assume that a fair amount of riders would choose the bikelane routes.
Similarly, when DDOT gets around to adding new bike lanes in the city, it might be possible to do a before/after comparison (I think a time-series, although it's been a long time since I took stats) to see if the lanes impact overall usage in stations along those routes.
I would be interested in seeing how bad weather days impact CaBi usage at various stations. I would hypothesize that stations near Metro (or trips originating/ending at metro stations) don't have as much of a drop in rainy-day usage as station pairs in which Metro is not involved.
by Jacques on Jan 13, 2012 4:01 pm • link • report
Thanks again for doing this.
I'd love to look at the EOTR data. I know it has picked up in recent months, but it might be interesting to see where it goes.
Another point. David's bllboard concect -- I suggested on the bikehare side you have station availability and the 5 most used station from that station. With that data, you could do that.
by charlie on Jan 13, 2012 4:07 pm • link • report
by Adam L on Jan 13, 2012 4:16 pm • link • report
@Jacques, it would be easy enough to add historic
temperatures, humidity, and precipitation to the database. However, you'd have to factor in the cyclical trends of usage to make any real insights. (http://goo.gl/J3B5f for historic weather data)
@Rob P, David C, Eric O
For clarification on the cleaned data set, starting with 1,361,007 total records, here are the records I removed
18 incomplete records (no bike or member type)
169 record with non-standard start stations
807 non-standard or empty end stations
2,224 trips that started or ended at the white house station
16,027 trips that started AND ended at the same station and lasted 60 seconds or less
287 trips that lasted 24 hours or longer
That leaves 1,341,478 usable records. I then added station-to-station distance in miles, and elevation change in feet while converting the duration to seconds.
The CSV is at:
www.coreyholman.com/flat.csv
The Station IDs are at:
www.coreyholman.com/stations.txt
Member Type 1 = Registered, 2 = Casual
by Corey H. on Jan 13, 2012 4:42 pm • link • report
Of the trips origination from Ward 7 and Ward 8, not surprisingly, given geography, most of them are either self-contained or go to Ward 6.
Start Node - End Node - Percent of Total
DC7-DC1 1.0%
DC7-DC2 3.8%
DC7-DC5 2.5%
DC7-DC6 41.6%
DC7-DC7 38.6%
DC7-DC8 12.3%
DC7-RB 0.1%
DC8-CC 0.3%
DC8-DC1 0.6%
DC8-DC2 8.1%
DC8-DC3 0.1%
DC8-DC4 0.2%
DC8-DC5 0.8%
DC8-DC6 34.3%
DC8-DC7 4.6%
DC8-DC8 50.1%
DC8-RB 0.9%
(CC = Crystal City cluster, RB = Rosslyn-Ballston cluster)
@Adam L.
You've highlighted one of the biggest issues with Bikeshare data collection is that there's no way to tell rides not taken or rides that went to multiple stations before finding a dock.
by Corey H. on Jan 13, 2012 5:14 pm • link • report
Something else that would be neat - and difficult - to correlate is how CaBi use changes on days when there are Metro service outages.
by David C on Jan 13, 2012 5:28 pm • link • report
by Brad on Jan 13, 2012 5:43 pm • link • report
by David C on Jan 13, 2012 5:48 pm • link • report
by beatbox on Jan 13, 2012 5:54 pm • link • report
http://g.co/maps/d9wnc
38.8975160356978 -77.0273450932033
Closest station: Metro Center/12th & G (terminalName 31230)
by akg on Jan 13, 2012 6:15 pm • link • report
by RE on Jan 14, 2012 8:30 am • link • report
by Richard Layman on Jan 14, 2012 12:53 pm • link • report
My three most-used routes are 1.4x, 1.4x, and 1.2x longer than CABI's point-to-point calculations.
Obviously you can't know the actual route that any rider takes, but we do know that unless someone simply rides in a straight line from station to station on the same road, that rides are longer than the point-to-point numbers. Probably nearly all of them will be.
It shouldn't be too hard to approximate the real distances for say the Top 25 trips: it's probably safe to guess that if a route is that popular then there is probably a best path that most reasonable people would take.
by Mike C on Jan 14, 2012 3:58 pm • link • report
by Ghosts of DC on Jan 14, 2012 9:02 pm • link • report
http://oliverobrien.co.uk/2012/01/bike-share-route-fluxes/
by Corey H. on Jan 17, 2012 4:03 pm • link • report
by MLD on Jan 17, 2012 6:04 pm • link • report
by JDAntos on Jan 17, 2012 10:43 pm • link • report
by John on Jan 18, 2012 11:53 am • link • report
by Dmitry Kachaev on Jan 26, 2012 1:31 pm • link • report
by Matt Caywood on Feb 3, 2012 10:36 am • link • report
Add a Comment