AI Engineer – Local Setup for Private LLM System (South Bend, IN – On-Site)


$45.00
Hourly: $45.00 - $70.00

Seeking an experienced AI/ML engineer for a one-time on-site setup of a high-performance local LLM system. This project involves configuring and optimizing a large open-weight language model (LLaMA 4 – 70B) for use in a secure, offline private research environment. Responsibilities will include: • Installing and configuring LLaMA 4 (Maverick version) locally on a high-performance Ubuntu system with RTX 6000 Ada GPU • Setting up token streaming or prompt-response architecture using vLLM, Ollama, or similar inference stack • Building a lightweight FastAPI (or CLI) interface for model interaction • Implementing logging of inputs/outputs to disk in JSON or plain text • Assisting with setup of a local embedding model (e.g., MiniLM or BGE) for vector search/memory recall Requirements: • Prior experience running large models locally (13B–70B) • Familiarity with GPU inference and memory optimization (without quantization) • Strong Linux skills (Ubuntu CLI) • Security-first mindset; must respect that the system is fully airgapped • Ability to communicate clearly and implement from spec Nice to have (not required): • Familiarity with LangChain, LangGraph, or agent orchestration frameworks • Knowledge of inference schedulers, token streaming, or routing logic Project Details: • Estimated time: 1–1.5 working days total • Compensation: Rate negotiable — please include your typical hourly or day rate when applying • Location: Must be available to work on-site in South Bend, IN • Security: NDA will be required

Keyword: Web Programmer

Price: $45.0

 

Performance Optimization for .NET Core, Entity Framework, SQL Server, and Angular Application

We are looking for an experienced developer to optimize the performance of our web application built with .NET Core, Entity Framework, SQL Server, and Angular. The application is currently experiencing slow API responses, inefficient database queries, and high load time...

View Job
Postgresql/typeorm/express/typescript -- build APIs and manage GCP DB

Our backend stack is hosted on Google Cloud SQL + Postgresql. We are using typeorm/express/typescript. Need someone to help build out APIs, write tests, and manage the database generally. Need work to start immediately depending on how it's going may extend to anywhere ...

View Job
Custom AI Helpbot Development for Non-Profit Training Resource Site

We are seeking an experienced programmer to develop a custom AI helpbot for a non-profit website. The helpbot will assist users in finding relevant training and resources effectively (ex: I am looking for xyz). The ideal candidate will have a strong background in AI pro...

View Job