Endpoints
Get extraction results
Get the results of a schema extraction job by polling
GET
/
extract
/
{extraction_id}
Response structure
The response contains extracted data organized hierarchically. Each field in extracted_data
can be either:
- A primitive field (like
invoice_number
) - An object containing nested fields (like
customer
) - An array of objects (like
line_items
)
Each primitive field contains:
name
: Field identifiervalue
: Extracted contentconfidence
: Accuracy score (0-1)field_type
: Data type (string, number, date, phone, email)page_index
: Page from which the value was extractedsource_context
: Extraction context details
See below for an example with primitives, objects and an array:
Headers
Authorization
string
requiredBearer token for authentication
Path Parameters
extraction_id
string
requiredID of the extraction job
Response
200 - application/json
extraction_id
string
requiredID of the extraction job
file_id
string
requiredID of the file being processed
status
enum<string>
requiredStatus of the extraction job
Available options:
PENDING
, PROCESSING
, COMPLETED
, FAILED
created_at
string
requiredWhen the job was created
schema_name
string
requiredName of the schema used for extraction
schema_description
string
requiredDescription of the schema used for extraction
extracted_data
object
requiredObject containing extracted data. See above for more information about the structure of this object.
processing_metadata
object
required