Data Migration Between Databases with Docker and Python


$50.00

Perform data migration from a PostgreSQL database to MySQL using Docker Compose to set up containers. Develop ETL scripts in Python to manage the extraction, transformation, and loading (ETL) process, including data cleaning to handle missing, duplicate, and inconsistent values. Use a Kaggle database as the source, prioritizing large and complex datasets. Kaggle: https://www.kaggle.com/datasets/olistbr/brazilian-ecommerce Tasks: Setting up Databases in Containers: Configure PostgreSQL and MySQL in separate containers using Docker Compose. Create databases and tables in both databases, ensuring compatibility for migration. ETL Development: Extraction: Import data from the selected Kaggle database into PostgreSQL. Transformation: Perform cleaning and standardization, addressing: Identifying and handling missing values. Removing or adjusting duplicate records. Fixing data inconsistencies. Loading: Migrate the transformed data to MySQL. Data Validation: Conduct robust validations, such as: Quantitative: Compare the number of records between databases to ensure consistency. Qualitative (optional): Review data samples to ensure successful transformation. Modeling and Architecture: Structure the project based on a star schema or snowflake schema diagram, as appropriate for the chosen dataset. Document the overall architecture, including table relationships and ETL processes. Deliverables: Present Python scripts, the project diagram, and a data validation report. Provide well-separated scripts for execution on your machine, along with an installation tutorial.Category: IT & ProgrammingSubcategory: Data ScienceProject size: SmallIs this a project or a position?: ProjectRequired availability: As needed

Keyword: Docker

Price: $50.0

Data Modeling MySQL PostgresSQL Python Docker

 

Instalar bot do Discord no cPanel

Tenho um bot que quero instalar no meu cPanel. Esse bot tem dois métodos de instalação: Docker e Python.Me chama que envio o link.

View Job
Mapeamento de operação atraves de dados demograficos com geolocaliz...

1. Definição dos Requisitos Objetivo do Software: Mapear operações (como vendas, distribuição, logística, etc.) Com base em dados demográficos (idade, gênero, renda, etc.) E geolocalização (latitude, longitude). Funcionalidades Principais: Importação de dados demográfic...

View Job
Implementar docker do github no azure

Preciso de um implementador de um projeto do github em docker e publicar no meu azure https://github.com/simonrob/email-oauth2-proxyCategory: IT & ProgrammingSubcategory: Other

View Job