Perform data migration from a PostgreSQL database to MySQL using Docker Compose to set up containers. Develop ETL scripts in Python to manage the extraction, transformation, and loading (ETL) process, including data cleaning to handle missing, duplicate, and inconsistent values. Use a Kaggle database as the source, prioritizing large and complex datasets. Kaggle: https://www.kaggle.com/datasets/olistbr/brazilian-ecommerce Tasks: Setting up Databases in Containers: Configure PostgreSQL and MySQL in separate containers using Docker Compose. Create databases and tables in both databases, ensuring compatibility for migration. ETL Development: Extraction: Import data from the selected Kaggle database into PostgreSQL. Transformation: Perform cleaning and standardization, addressing: Identifying and handling missing values. Removing or adjusting duplicate records. Fixing data inconsistencies. Loading: Migrate the transformed data to MySQL. Data Validation: Conduct robust validations, such as: Quantitative: Compare the number of records between databases to ensure consistency. Qualitative (optional): Review data samples to ensure successful transformation. Modeling and Architecture: Structure the project based on a star schema or snowflake schema diagram, as appropriate for the chosen dataset. Document the overall architecture, including table relationships and ETL processes. Deliverables: Present Python scripts, the project diagram, and a data validation report. Provide well-separated scripts for execution on your machine, along with an installation tutorial.Category: IT & ProgrammingSubcategory: Data ScienceProject size: SmallIs this a project or a position?: ProjectRequired availability: As needed
Keyword: Docker
Price: $50.0
Data Modeling MySQL PostgresSQL Python Docker
Resumo das Habilidades Essenciais: - Front-End: html, css, javascript, react/angular/vue.js, Docker para front-end, ci/cd, figma; - back-end: .NET (C#), ASP.NET Core, APIs RESTful, Docker para back-end; - Boas práticas: SOLID, Teste Unitário e Integrado, Segurança em Co...
View JobPreciso de um desenvolvedor, programador com experiência em programação em PHP utilizando o framework Laravel, com conhecimentos das ferramentas Docker, Git, GitHub, MySQL e JavaScript. Esta pessoa vai assumir a parte técnica do sistema, que já existe, fará melhorias e ...
View JobEstoy haciendo un curso de ciberseguridad y me piden hacer una prueba de carga con gatling, es posible que necesite ayuda a nivel de poder comprender la tarea y ejecutar bien los test de carga. Hemos instalado previamente un docker con juiceshop ...
View Job