Convert Large Number of Files to Text-Readable PDFs


$500.00
Fixed price

Will provide a link to a shared folder with a large number (60-70k) of files of various formats (pdf, png, msg, etc.). Three example files are provided. Primary deliverable will be a folder with all original files converted to PDF format with searchable text. Currently some files (~10k) are text-searchable PDFs; these do not have to be altered. The remaining files should be converted to PDF (if necessary) and then OCRed. High accuracy in the text recognition is essential. File names should be identical to those originally provided except for the file extension. If possible, we would also like a second deliverable: a spreadsheet with three columns: the original file name (with the original extension), the current file name (with the .pdf file extension), and the full extracted text from the document. Thanks for considering us; we look forward to your application!

Keyword: Python

Price: $500.0

Python PDF Conversion Data Extraction OCR Software File Conversion

 

Excel Data Extraction Tool

I need an Excel parser to extract numerical data from multiple sheets with headings. Requirements: - Ability to handle multiple sheets - Accurate extraction of numerical data - Output in a specified format (e.g., CSV, JSON) Ideal Skills: - Proficiency in Python or si...

View Job
Desenvolvedor para criar sistema de licença com chave de ativação p...

Tenho um plugin, feito em AutoLISP, que roda no AutoCAD. Agora quero implementar um sistema de licença simples com chave de ativação para proteger o plugin e evitar compartilhamento indevido. Preciso que o sistema atenda aos seguintes pontos: ✅ O cliente compre e receba...

View Job
Intensive Data Analysis & Stats Tutor Needed

N/D

View Job