Web Scrapper to generate clean pdf with data


We are looking for an experienced developer to create a web scraper that extracts data from PDFs and formats it into a clean, structured PDF report. The final output should match the professional, well-organized format of the attached sample. Responsibilities: Develop a web scraper that extracts relevant market data from PDFs Process and structure the extracted data into a formatted report Generate a PDF output that matches the provided example in terms of layout, design, and readability Ensure the scraper can handle multiple PDF formats and variations in data presentation Automate the process to allow seamless updates Requirements: Proficiency in Python (BeautifulSoup, PyPDF2, PDFPlumber, or similar) Experience with data extraction and parsing from PDFs Strong skills in generating structured PDFs using libraries like ReportLab or Pandas Ability to automate the workflow for ongoing data updates Attention to detail to ensure accuracy in data presentation Bonus Skills: Experience with OCR (Tesseract or other) for scanned PDFs Familiarity with cloud-based automation solutions Deliverables: A working script or web-based tool for PDF data extraction A formatted PDF output replicating the provided example Documentation on usage and maintenance We have a somewhat working python script that has many of the required nessecary links and calculations.

Keyword: cloud

 

Part-Time Legal Tech Automation Developer (Remote)

Part-Time Legal Tech Automation Developer (Remote) Our firm is seeking a part-time, hourly developer with expertise in automation, legal tech, and process optimization. We are looking for a highly skilled tech professional to help us streamline our operations by automa...

View Job
ESP32 Programming for Low-Latency Modbus Communication with Solar Inverters via MQTT

We are looking for an experienced embedded systems developer to program an ESP32 microcontroller for real-time communication between solar inverters and a cloud server. The device should support Modbus RTU/TCP for data exchange with inverters and utilize MQTT for cloud ...

View Job
Python Developer for Varied Project Tasks

N/D

View Job