FinSight - Financial Analysis Pipeline
End-to-end business intelligence architecture centered around an ETL pipeline to extract, prepare, visualize and analyze 10-40 thousand facts per SEC and EU ESEF filing. I built FinSight as a portfolio project to gather pipeline, data engineering, and analysis experience. And because it's just kind of fun to be honest 😊 Want to analyze a publicly listed company? Give it a go!
Pipeline Stages
- Ingestion: Download 10-K/20-F filings from SEC EDGAR
- XBRL Parsing: Extract ALL facts using Arelle (10k-40k per company)
- Normalization: Standardize units, currencies, taxonomies (US-GAAP, IFRS)
- Validation: Verify accounting identities and cross-statement consistency
- Storage: Load into PostgreSQL data warehouse with full provenance
- Analysis: Query, visualize, and export for downstream use
Limited Company Coverage: This is a free-to-use portfolio project, and database storage is limited. As a result, the number of companies in the database is restricted. You can request custom analyses for additional companies (up to 10 per month), but pre-loaded data is limited to a curated set of companies.
Example Analyses
See what FinSight can produce with Apache Superset dashboards built on this pipeline:
Built with Arelle • PostgreSQL • Flask • Next.js • Apache Superset