The EGE
Senior Member
- Joined
- Jun 29, 2013
- Messages
- 1,761
- Reaction score
- 4,682
Because of how detailed ODX data is, it not publicly accessible due to privacy concerns. Even though the actual fare card numbers are anonymized (except for when using known trips to verify), all trips with a given fare card are still grouped because of the way the algorithm works. When I was working with the data as a student, I could find myself just by knowing my regular commute, as I was the only person in the system who regularly commuted between the two specific endpoints. Even a rider with a more common pattern would be easy to find just by knowing one or two less common single trips they made.
More aggregated data can be safely shared. Group by hour, by line, by region, whatever, and there's no longer personally identifiable data. For example, BART (which is tap-in/tap-out) publishes station ridership data as an origin/destination matrix.
More aggregated data can be safely shared. Group by hour, by line, by region, whatever, and there's no longer personally identifiable data. For example, BART (which is tap-in/tap-out) publishes station ridership data as an origin/destination matrix.