Train an RL policy to maximize profit in a blackjack environment


Description:

Looking for someone to train an RL policy to maximize profit in a blackjack environment with the a list of game rules. Then create and give me the deterministic policy in a cheatsheet/table format"

Tags: Machine Learning Model

Keyword: Machine Learning

Job Type: Hourly

 

Private project or contest #39292651

N/D

View Job
Fractional CTO/CIO For B2B Platform - ML, Holograms, AR/VR, Geospatial, Global

We are a B2B AI, Digital Twin Platform. We are scaling up the business. This technology supports a global market. We are the first mover in this hundreds of billion market. We have been in market for a 5+ years, proving the technology. Now we are going to scale it. Our ...

View Job
Ia - desarrollador App Llm para colabrar en proyecto

Se busca desarrollador Python con experiencia en Apps LLM (LangChain, OpenAi/Llama3, rag, db vectoriales) para sumarse a equipo en proyecto muy interesante. Es un plus, o alternativa, conocimientos en Deeplearning (para sistema de recomendación en particular). Posibilid...

View Job