Skip to content
/api/v1/scrapping/public/{filepath}

Public scraped-file fetch by file path.

Public scraped-file fetch by file path. Reads pre-cached scrape outputs from the public scrapping cache. Used by clients that have a cached file URL from a prior scrape call.

10 tokensSince v1.0.0

Why use this

Fetch a previously-cached SEC filing artifact (raw HTML body, exhibit attachment, XBRL document, image) from FinRadar's gateway cache instead of hitting SEC EDGAR directly. The gateway caches artifacts with a 24-hour TTL — useful for clients that want to avoid SEC's rate limits (10 req/sec per IP) or want deterministic latency for repeated reads of the same artifact. Returns the original SEC `Content-Type` header so binary artifacts (PDFs, images) round-trip correctly. The cache is populated by prior scrape calls — pass a `filepath` you obtained from a previous `/scrapping/extractor` or `/scrapping/form-10` response. For structured item extraction prefer `/scrapping/extractor`; for PDF rendering use `POST /api/v1/scrapping/pdf`.

Serves a public-cached copy of a filing artifact (raw HTML, exhibit attachment, primary document). Useful when you want to avoid hammering EDGAR directly — FinRadar caches the artifact at the gateway with a 24h TTL. Returns the original SEC content-type header. For structured extraction prefer GET /api/v1/scrapping/extractor; for PDF rendering use POST /api/v1/scrapping/pdf.

Parameters

NameInRequiredDefaultAllowedDescriptionExample
filepathpathrequiredCache-relative path to the artifact (typically mirrors the SEC EDGAR archive path: `edgar/data/{cik}/{accession-no-dashes}/{filename}`). Returned by prior scrape calls in their `cached_filepath` field. URL-encode forward slashes if your HTTP client doesn't auto-encode path params. Returns 404 if the artifact is not cached or has expired past the 24h TTL.edgar/data/320193/000032019324000123/aapl-20240928.htm

Response schema

FieldTypeNullableDescription
contentstringnoRaw artifact contents. For text content types (`text/html`, `text/plain`, `application/xml`) returned as decoded UTF-8 text. For binary content types (`application/pdf`, `image/*`) returned as base64-encoded string — decode client-side before consuming. Always non-empty (404 returned when cache miss; never empty-string).
content_typestringnoOriginal SEC `Content-Type` header preserved verbatim (e.g. `text/html; charset=utf-8`, `application/pdf`, `application/xml`, `image/png`). Use this to dispatch the correct decoder client-side — base64 vs UTF-8 — based on the MIME prefix.
cached_atstringnoISO-8601 UTC timestamp when this artifact was first cached at the gateway. The 24-hour TTL is computed from this timestamp — values older than 24h indicate the cache will be revalidated against SEC on the next miss. Use to detect freshness for time-sensitive workflows.
size_bytesintegernoDecoded artifact size in bytes. For text artifacts this is the UTF-8 byte length; for binary artifacts this is the post-base64-decode byte length (i.e. the actual file size, not the inflated base64-string length). Useful for client-side size budgeting (e.g. reject artifacts > 50 MB before downloading).

Sample response

·
  • "content": "<html><head><title>Form 10-K</title></head>..."
  • "content_type": "text/html"
  • "cached_at": "2026-05-01T20:55:12.000Z"
  • "size_bytes": 481032
}

Errors

StatusLabelDescription
200OKRequest succeeded.
400Bad RequestInvalid query, body, or path parameter.
401UnauthorizedMissing or invalid Authorization header / api_Token.
402Payment RequiredInsufficient token balance for this call. Top up
429Too Many RequestsRate limit exceeded for your tier (see /pricing for tier limits). Tier limits
500Server ErrorUnexpected server-side failure. Retry with backoff; report if persistent.

Code samples

curl "https://api.finradar.ai/api/v1/scrapping/public/{filepath}?api_Token=YOUR_API_KEY" \
  -H "Authorization: Bearer YOUR_JWT_TOKEN"

Generate an API key in /account/credentials to run live queries (literal YOUR_API_KEY placeholder shown until then).

Try it

Related endpoints