Workspace template

Llama 3.1

Heavyweight 8-billion-parameter variant served through Ollama on a single GPU-capable host, fronted by an OpenWebUI chat interface. Requires roughly 16 GB of RAM and is tuned for longer-context reasoning tasks.

Recipes in this template

Primary

Ollama (Llama 3.1 8B)

Open WebUI

Hire on EasyEnv

Hire engineers who actually ship with Llama 3.1

Stop guessing from resumes. Drop candidates into this exact workspace, watch them build and operate it end-to-end, score the result automatically, and replay the session. We also evaluate how they work with AI.

See how it works Request a demo

Related jobs

View all roles

Role

Hire AI Engineer

Hire AI engineers who ship LLM features that hold up in production. Real RAG pipelines, real eval harnesses, real prompt regressions to debug. Live or take-home in a browser workspace.

Role

Hire LangChain Developer

Hire LangChain developers in real Python or TypeScript workspaces with real chains, real tools and real eval sets. Live or take-home.

Role

Hire PostgreSQL Engineer

Hire PostgreSQL engineers in real databases with real plans, real indexes and real locks. Live or take-home in a browser.

Role

Hire Accessibility Engineer

Hire accessibility engineers in real workspaces with real screen-reader runs, real WCAG audits and real fixes. Live or take-home.

Related templates

Template

DeepSeek

Run DeepSeek locally with OpenWebUI for advanced reasoning and coding assistance, fully offline.

Template

Gemma 3

Google's Gemma 3 (1B) language model served locally via Ollama with OpenWebUI - a compact, efficient open-weights model suitable for resource-friendly on-device AI.

Template

Llama 3.2

Run Meta Llama 3.2 locally with OpenWebUI for a private, fully offline AI chat experience.

Template

Qwen

Run Alibaba Qwen locally with OpenWebUI for a private, multilingual AI assistant.

Browse all templates