We’re using GPT via API inside our mobile biofeedback app (Genius Insight) and want to switch to a cost-effective, self-hosted AI solution. We need a developer or DevOps expert to: • Deploy an open-source model (Mistral 7B, OpenChat, or LLaMA 3) • Host it via a cloud GPU instance (Runpod.io, Vast.ai, or similar) • Serve it using Ollama, vLLM, or Text Generation Inference • Provide a clean, simple REST API endpoint we can call from our app (same as OpenAI-style) • Ensure it’s reasonably fast and stable for 50–500 daily users You should have experience with: • Docker and Linux server setup • LLM model deployment • Hosting models on GPUs (A100, 4090, or T4) • API setup and basic security This is a one-time job, but future maintenance work may be available. Deliverables: • Deployed model + inference server • API docs (how we send prompts + receive replies) • Basic walkthrough so our team understands how to monitor it
Keyword: openai
Price: $800.0
Android JavaScript API Java Python
Analytical Investments LLC is seeking a highly capable AI Developer or agency to build and hand off a full-scale, production-grade AI system that powers our core financial analytics platform. As a fintech and educational insights company, we provide real-time, complianc...
View JobQuero criar um Micro SaaS Integrado ao WhatsApp com n8n e OpenAI. O objetivo é automatizar processos e oferecer respostas inteligentes usando n8n (automação de fluxo de trabalho) + OpenAI (IA generativa).Category: IT & ProgrammingSubcategory: OtherIs this a project ...
View JobNos gustaría poder hacer que el usuario hablara directamente con un Chatbot que tenga acceso a datos de reservas de autocaravanas así como las características. Trabajamos con dos endpoints de la API de la aplicación de reservas de autocaravanas: listado de vehiculos y l...
View Job