Job Overview We’re looking for an experienced Data Engineer to own and enhance our existing Python-based data pipeline that scrapes public audio data and performs extensive post-processing, including transcription, diarization, and annotation using Rev API and Gemini Pro API. Responsibilities: Take ownership of our existing audio data pipeline Maintain and improve the reliability and efficiency of the current system Develop new features and functionality for audio processing Integrate and optimize usage of external APIs (Rev, Gemini Pro) Implement robust error handling and monitoring Document the system and processes Requirements: Strong Python programming skills Experience building and maintaining data pipelines Experience integrating and working with external APIs Knowledge of best practices for scalable data engineering Fast execution and iterative development mindset Self-motivated with ability to work independently Nice to Have: Experience with speech-to-text systems Familiarity with audio processing techniques Knowledge of speaker diarization techniques Background in audio signal processing Experience with cloud computing platforms Prior work with large language models or AI APIs Project Details: This is an ongoing role with potential for long-term collaboration. The ideal candidate will be able to understand our current system quickly and start making improvements immediately. We value speed of execution and pragmatic solutions over perfection.
Keyword: Machine Learning
Price: $80.0
Python Amazon Web Services Python Script API Artificial Intelligence
Objetivo do Projeto: Criar uma plataforma contábil digital dedicada a um escritório de contabilidade especializado no setor de saúde, oferecendo soluções inovadoras para clínicas, consultórios e profissionais de saúde. A ferramenta deve superar as funcionalidades de con...
View Job