pdf2struct: extract structured JSON from PDFs (text, metadata, tables, OCR, invoice key-value fields).