Seeking an experienced Database Architect & Data Pipeline Developer to design, build, and maintain an ETL pipeline that collects and processes athlete data from multiple sources using MongoDB and Python. The ideal dev will refine open-source scrapers (see below) or develop new ones if they prefer and wrap them into scalable APIs, ensuring efficient data extraction, transformation, and loading while maintaining data integrity and real-time updates. Summary of the skills and experience require: -Database Design & Architecture: Structure a MongoDB database optimized for storing, indexing, and querying sports data. -ETL Pipeline Development: Establish an automated Extract, Transform, Load (ETL) process to ingest real-time data from multiple sources. -Web Scraping & API Development: Customize and improve existing open-source scrapers, wrapping them into APIs for real-time data retrieval. -Data Merging & Normalization: use algorithm to match and merge athlete data across different sources using fuzzy matching, ensuring consistency and resolving minor name discrepancies between datasets. Can provide one on request. -Pipeline Automation: Ensure real-time updates by integrating data pipeline scheduling tools. -Error Handling & Performance Optimization: Build logging, monitoring, and error-handling mechanisms to maintain a high level of reliability. Budget is firm. We respect developers right to set hourly rates of their desire. Equally, please respect our right to set our budget at our desire. If our budget is less than your minimum rate, we ask that you do not apply. For devs who prove their reliability to actually finish projects and at a high level, there will be more work at significantly higher rates. To apply, please submit your interest and availability for video call (camera on). All LLM generated responses and replies will be disqualified.
Price: $900.0
MongoDB Python Selenium Database Architecture ETL Pipeline API
We are seeking a professional video crew to capture and produce high-quality video content for a small event of approximately 50 attendees in Gloucester, Massachusetts. The event is scheduled for tomorrow from 2 PM to 7 PM. We require a team that can handle all aspects ...
View JobWe are seeking a skilled developer to create a program that will automatically populate our MyCase calendar with emails received from the Casenet court system. This integration is essential for streamlining our workflow and ensuring that all court-related appointments a...
View JobWe are seeking male and female voice actors to read our short stories for a new app we are launching that is for erotic audios. We are looking for a long term partnership with our actors as well. These audios will live on our app. We are open to negotiating the price as...
View Job