I am looking for a response that include a project plan for accomplishing. I need someone to help me build this out that has done this before, I prefer a fixed fee but open to hourly with a good project plan you can deliver to. The right person will assist with the additional phases of this project. This is only phase 1 and it is extremely minimal. I want to have someone start no later than the 31st of March. If I can find the right individual we will start sooner. Objective: Build out a minimal version of an AI RAG service Time Frame: 2 weeks Scope of Project: Phase 1 - build out a minimal version of RAG in two weeks. Success criteria 1) JWT (username password) and OAuth 2.0 for authentication (google and Microsoft) 2) From Web UI query private LLM and get response 3) From Web UI attach a file as part of the query 4) History per chat session stored and leveraged for context as part of chat session 5) Leverage API from make.com and be able to query LLM and get a response to make.com 6) Leverage API from make.com and be able to select a document from the Vector DB as part of the query to the LLM 7) Put a document blob storage and it is ingested and chunked into the DB (sizes will be provided) High Level Architecture Layer Recommended Tool LLM Host Ollama (version will be provided) GPU H100 RAG Framework LlamaIndex Vector Store Weaviate Frontend/API FastAPI & React for UI & Document Ingestion & Chunking Docs Loader / Orchestration LlamaIndex Storage/Auth Blob storage Cloud Azure Container Kubernetes Data Sources Layer: • Document ingestion processes • Storage using Azure Blob Storage • Document loading via LlamaIndex Processing Pipeline: • Handles document chunking and indexing • Uses LlamaIndex as the RAG framework for orchestration Vector Database: • Weaviate as the vector store for embeddings • Provides semantic search capabilities LLM Engine: • Ollama as the LLM host • H100 GPU for acceleration • Handles query processing and response generation Frontend: • React-based user interface • Provides chat interaction capabilities and documentation upload API Gateway: • Built with FastAPI • Manages REST endpoints and routes requests Auth/Storage Layer: • Handles user management and permissions • Connects to Azure Blob Storage for document persistence There will be a phase 2 of this project and is outside the scope of the request. The second phase will be to make this a multitenant version, improve availability and tune for performance. It will include enhancements to the interface. Phase 1 is extremely minimal with no focus on features, performance, scaling or availability.
Keyword: React Native
Price: $60.0
React Kubernetes Python Retrieval Augmented Generation NLP Tokenization FastAPI LLM Prompt Engineering Web Development
Este projeto é um módulo de um aplicativo de hotelaria. Neste módulo o usuário irá reservar a mesa do restaurante que ele vai utilizar durante toda sua hospedagem. O detalhe é que esta escolha deverá ser feita clicando sobre a mesa diretamente em um mapa do ambiente do ...
View JobI’m looking for an experienced developer to build a powerful, conversational AI assistant designed specifically for apartment locators. The AI will leverage a large language model (e.g., GPT-4) to: • Understand and interpret natural language from apartment locators, cap...
View JobO **Projeto Programador** é uma iniciativa que visa preparar pessoas para atuar como programadores ou desenvolvedores de software, independentemente de seu nível atual de conhecimento. Ele pode variar dependendo de quem está promovendo (escolas, empresas, comunidades, o...
View Job