Data Sources
We use a mix of primary regulatory filings and structured financial feeds. Every number in a company analysis report can be traced back to a source listed here.
Primary sources
SEC EDGAR
- What: official 10-K (annual) and 10-Q (quarterly) filings for US-listed companies
- Why: authoritative, audited, public-record source
- Coverage: all US-listed companies (NYSE, NASDAQ)
- Update frequency: new filings appear within minutes of SEC acceptance
Structured financial feeds
- US market normalization: SEC Company Facts API and related SEC public datasets
- China A-share / Hong Kong normalization: Eastmoney Datacenter endpoints
- Why we use them: they let us turn filings into a normalized cross-company financial dataset faster than manual extraction
- How we treat them: as convenience layers, not final authority. If a structured field conflicts with the filing, the filing wins.
How the pipeline works
- We fetch raw financial statement data from the relevant primary source or structured feed.
- We normalize it into a shared schema so the same framework can work across markets.
- We cache the normalized dataset for repeat analysis.
- We generate the report against that normalized dataset.
- On US company pages, we expose filing links back to SEC EDGAR wherever possible.
Source hierarchy
When sources disagree, this is the order we trust:
- Original regulatory filing
- SEC structured company facts for US issuers
- Eastmoney structured market data for CN/HK issuers
- Our own derived calculations
Coverage notes
- US stocks: full coverage
- A-shares / HK: planned, not yet shipped
- Private companies: out of scope
Update cadence
- Company analysis reports regenerate when a new 10-K or 10-Q is filed
- Ranking pages refresh quarterly with explicit "Last updated" timestamps
- Methodology changes are logged on the methodology page
