U.S. DOT ITS Connected Vehicle Pilot Sandbox
42

This S3 bucket contains sanitized raw data from the Connected Vehicles (CV) Pilot Programs. Each file is a newline JSON file containing multiple messages. You may access the files in this this bucket by downloading them in the file explorer below or via command line or code. An overview of open data resources related to the Connected Vehicle Pilot can be found in this data story.

Please see change notes relating to any major changes to the data at the cv_pilot_ingest GitHub repository wiki page and consult our detailed data dictionaries and log of known data pipeline downtime and caveats prior to using the data. The data held here are not affected by the interventions of the CV pilot, and thus the data can be analyzed without accounting for the baseline/test periods.

Retrieving the Data

Data is currently stored in the following folder hierarchy based on when the data is generated:

{Source_Name}/{Data_Type}/{Year}/{Month}/{Day}/{Hour}

  • {Source_Name}: The data producer of the pilot. Acceptable values: wydot, wydot_backup, thea, nycdot.
  • {Data_Type}: The message type of the data. Acceptable values: BSM, TIM, SPAT, EVENT. SPAT is only available when the Source_Name is thea. EVENT is only available when the Source_Name is nycdot.
  • {Year}: Four-digit year value based on the metadata.recordGeneratedAt field in the record (e.g., 2019). Based on UTC time. When Source_Name is nycdot, the year value is based on the eventHeader.eventTimeBin field in the record, which uses the NYC (EST/EDT) time zones.
  • {Month}, {Day}, {Hour}: Two-digit month/day/hour value based on the metadata.recordGeneratedAt field in the record(e.g., 01). Based on UTC military time. When Source_Name is nycdot, the month, day, and hour value is based on the eventHeader.eventTimeBin field in the record, with the day being day-of-week bins (MON, TUE, WED, THU, FRI, SAT, SUN, NA) and hour being time-of-day bins (AM, PM, MD, EV, NT, NA). Records are assigned to the day-of-week and time-of-day bins based on the NYC (EST/EDT) time zones.
Sanitized Data
Object Folder Last Modified Size
AWS S3 Explorer 
42
Sanitize Data
Object Folder Last Modified Size