Skip to main content

EDGAR Log File Data Set

The Division of Economic and Risk Analysis (DERA) has assembled information on internet search traffic for EDGAR filings through SEC.gov generally covering the period February 14, 2003 through December 31, 2016.  The data is intended to provide insight into the usage of publicly accessible EDGAR company filings in a simple but extensive manner. 

The EDGAR Log File Data Set contains information in CSV format extracted from Apache log files that record and store user access statistics for the SEC.gov website.  Due to certain limitations, including the existence of lost or damaged files, the information assembled by DERA may not capture all SEC.gov website traffic.  Given the large size of the data files which can include more than a million entries, for best results users should avoid using software that limits the amount of data that can be read.

DERA intends to update this data set on a quarterly basis. Data is posted with a six-month delay.

Notes:

  • A revision was posted on December 23, 2015 to include updates to the data from May 1, 2012 through August 31, 2012 (123 files total) which addressed issues with inconsistent IP address formats.
  • On May 16, 2017 the file log20121231.zip was replaced. The replaced file was incomplete.

Logfile List (HTML, 57 KB)

List of Variables (PDF, 78 kb)

EDGAR Log File Data Set FAQs