We’re using GPT via API inside our mobile biofeedback app (Genius Insight) and want to switch to a cost-effective, self-hosted AI solution. We need a developer or DevOps expert to: • Deploy an open-source model (Mistral 7B, OpenChat, or LLaMA 3) • Host it via a cloud GPU instance (Runpod.io, Vast.ai, or similar) • Serve it using Ollama, vLLM, or Text Generation Inference • Provide a clean, simple REST API endpoint we can call from our app (same as OpenAI-style) • Ensure it’s reasonably fast and stable for 50–500 daily users You should have experience with: • Docker and Linux server setup • LLM model deployment • Hosting models on GPUs (A100, 4090, or T4) • API setup and basic security This is a one-time job, but future maintenance work may be available. Deliverables: • Deployed model + inference server • API docs (how we send prompts + receive replies) • Basic walkthrough so our team understands how to monitor it
Keyword: openai
Price: $800.0
Android JavaScript API Java Python
Estou em busca de um desenvolvedor experiente para criar um sistema de atendimento automatizado utilizando a OpenAI (ChatGPT API) integrado ao WooCommerce. O objetivo é personalizar o atendimento ao cliente, permitindo que o assistente virtual identifique o cliente, con...
View JobQuero criar um aplicativo de apoio emocional com escuta ativa e inteligência artificial. O app se chama DDE (Diário do Despertar Essencial). O usuário escolhe um ‘diário’ com base em seu estado emocional: Diário de Emoções, Diário do Essencial, Diário do Empreendedor In...
View Job