Data Sources

We use a mix of primary regulatory filings and structured financial feeds. Every number in a company analysis report can be traced back to a source listed here.

Primary sources

SEC EDGAR

What: official 10-K (annual) and 10-Q (quarterly) filings for US-listed companies
Why: authoritative, audited, public-record source
Coverage: all US-listed companies (NYSE, NASDAQ)
Update frequency: new filings appear within minutes of SEC acceptance

Structured financial feeds

US market normalization: SEC Company Facts API and related SEC public datasets
China A-share / Hong Kong normalization: Eastmoney Datacenter endpoints
Why we use them: they let us turn filings into a normalized cross-company financial dataset faster than manual extraction
How we treat them: as convenience layers, not final authority. If a structured field conflicts with the filing, the filing wins.

How the pipeline works

We fetch raw financial statement data from the relevant primary source or structured feed.
We normalize it into a shared schema so the same framework can work across markets.
We cache the normalized dataset for repeat analysis.
We generate the report against that normalized dataset.
On US company pages, we expose filing links back to SEC EDGAR wherever possible.

Source hierarchy

When sources disagree, this is the order we trust:

Original regulatory filing
SEC structured company facts for US issuers
Eastmoney structured market data for CN/HK issuers
Our own derived calculations

Coverage notes

US stocks: full coverage
A-shares / HK: planned, not yet shipped
Private companies: out of scope

Update cadence

Company analysis reports regenerate when a new 10-K or 10-Q is filed
Ranking pages refresh quarterly with explicit "Last updated" timestamps
Methodology changes are logged on the methodology page