We are always on the look out for interesting sources of financial data beyond the usual suspects. One of the best sources of raw data we know of is the U.S. federal government. You see a lot of data from the Securities and Exchange Commission (SEC) being made available through some of the financial data brokers out there, but there is plenty of other data being published that you just don’t see anywhere else. Government data can be a little harder to work with, as they tend to be published in less than friendly formats, and you have to wade through a variety of historical data sources, before you will find the gems.
We have begun looking through all of the data sources available at the federal level looking for data sources that would benefit from being made available in a more streaming, real time format, and we thought it would be worth sharing some of the best sources of financial data we’ve found so far. Here are fourteen of the most interesting data portals for discovering open data.
– U.S. Department of Housing and Urban Development
– U.S. Department of Agriculture
– Department of Commerce
– Federal Deposit Insurance Corporation
– United States Census Bureau
– Pension Benefit Guaranty Corporation
– Bureau of Labor and Statistics
– Department of Energy
– US Department of the Treasury
– Small Business Administration Data
– General Services Administration
– Federal Trade Commission
– Securities and Exchange Commission
– Federal Reserve Bank
Most of these sources of data are available as downloads, but there is also a number of web APIs available. You’ll have to look through the sites to find the most valuable datasets, just like us. We’ll keep publishing stories as we find interesting data sets that compliment other APIs we are publishing to the Streamdata.io API Gallery, and working to make available to our financial data hungry customers. There are thousands of datasets available across these fourteen federal agencies, making them quite a gold mine for financial data.
We’ve also begun mining city, county, and state sources of data looking at the different types of data being published. Once we have documented the financial data available across lower levels of government, we’ll share some more details here on the blog. So far, we have over 250 independent sources of data targeted, with well over 100K datasets. Making for some pretty interesting sources of commercial and financial data for use in training machine learning algorithms, and enriching other existing streams of data our customers are either providing, or tapping into. If there is any particular area you are most interested in, please let us know, as we are happy to focus our work in on specific areas to help you find what you are looking for.