I will write a more detailed post about the parsing and database import, but suffuce it to say that it was not as straightforward as one might imagine. A number of helpful sources exist on the web:
- XBRL Specification
- Financial Accounting Standards Board – taxonomy viewer
- SEC Company filings viewer
The 1st on the list is invaluable as it has explained how the data is structured and I used this alot in the parser which I originally wrote a few months back. The parsers have been re-written a number of times as my knowledge and understanding has improved.
In the end I have completed the import of only a subset of the XBRL data. I have limited the scope to only the 500 companies of the S&P 500.