TALKS - Voxxed Days Zürich 2026

Running Your Coding Agent Locally: Lessons from a Real-World Experiment

Conference (ADVANCED level)

Cloud-based coding assistants like Claude Code or GitHub Copilot are powerful—but what happens when you try to bring that experience fully on-premise?
In this talk, we’ll explore the practical journey of building and running a local AI coding setup: choosing models, hosting them on consumer hardware, connecting frontends like LM Studio, and evaluating what really works (and what doesn’t).We’ll discuss trade-offs in latency, memory, and tool integration, the role of KV cache and model routing, and how far open-source models can go in replicating commercial AI dev environments.
Expect a mix of architecture insights, debugging war stories, and honest conclusions about what’s currently feasible—and what’s still wishful thinking—when it comes to local AI coding.

Alessio Soldano

IBM

Open-source software engineer with over 15yrs of experience in the field and people manager of a worldwide distributed and diverse team of engineers. Contributor of RESTEasy and many other successful open-source projects (WildFly, Quarkus, Apache CXF, Apache WSS4J, Apache Santuario, ...)

Stefano Maestri

IBM

With over 25 years of experience in enterprise software development and AI engineering, I explore how artificial intelligence is transforming the way we build, learn, and live with technology. I have an old passion for Open Source and more recent one for AI Engineering.