FinSight - Financial Analysis Pipeline

End-to-end business intelligence architecture centered around an ETL pipeline to extract, prepare, visualize and analyze 10-40 thousand facts per SEC and EU ESEF filing. I built FinSight as a portfolio project to gather pipeline, data engineering, and analysis experience. And because it's just kind of fun to be honest 😊 Want to analyze a publicly listed company? Give it a go!

Pipeline Stages

  1. Ingestion: Download 10-K/20-F filings from SEC EDGAR
  2. XBRL Parsing: Extract ALL facts using Arelle (10k-40k per company)
  3. Normalization: Standardize units, currencies, taxonomies (US-GAAP, IFRS)
  4. Validation: Verify accounting identities and cross-statement consistency
  5. Storage: Load into PostgreSQL data warehouse with full provenance
  6. Analysis: Query, visualize, and export for downstream use

Example Analyses

See what FinSight can produce with Apache Superset dashboards built on this pipeline:

📊
Novo Nordisk - Pharma Industry Analysis
5-year trend analysis, peer comparison, comprehensive financial metrics
📊
NVIDIA - Tech Sector Analysis
Coming soon
📊
Apple - Consumer Tech Analysis
Coming soon

Built with Arelle • PostgreSQL • Flask • Next.js • Apache Superset

View source code on GitHub →