Job Description: We're looking for an experienced Python web scraping developer to build a script or lightweight app that can log into a paid user account on one or more public-facing websites and extract structured data based on specific keywords and conditional logic (including identifying location-related details). You’ll be working with me (the founder) to build the MVP version of a tool that will gather high-value information from a select set of websites behind paywalls. The project is fully legal and compliant with terms for authorized users—we are seeking to automate our own access, not scrape data illegally. What You'll Build: A web scraper that: Logs into websites using valid paid credentials (we’ll provide test accounts) Navigates through paginated or dynamic content (may require JavaScript rendering) Searches for and extracts text containing targeted keywords Applies custom logic rules to identify or infer locations from unstructured text Stores results in a structured format (CSV, JSON, or sends to a database) Reusable and modular codebase, ideally with documentation and ability to scale Key Requirements: Strong experience with Python scraping frameworks: Playwright or Selenium (Playwright preferred) BeautifulSoup, lxml, requests, or similar Familiarity with handling: Login sessions, cookies, tokens Paywalls and browser-like behavior (headless scraping) Ability to write clean conditional logic and use regex/NLP to extract and categorize data (especially geographic/location-based info) Experience with basic NLP libraries like spaCy or nltk is a plus Able to respect rate limits and use proxy/anti-bot tools if needed Fluent English communication and responsiveness Nice-to-Have (Bonus): Prior experience scraping real estate, job boards, B2B directories, or other structured databases API design (Flask, FastAPI) for packaging this as a microservice later Familiarity with geolocation tools (Google Maps API, Geopy, etc.) You must sign a non-disclosure agreement (NDA) before access to project details. Optional: Non-compete agreement, especially if the project scales. Scope: MVP (initial script for 1–2 sites), with opportunity for ongoing work Budget: Open to fixed-price or hourly. Please propose your preferred structure.
Keyword: Data Analysis
Price: $25.0
Data Scraping Data Extraction Python Selenium Scrapy Web Crawling