Fabian G. Williams

Principal Product Manager, Microsoft Subscribe to my YouTube.

Agent + Local Model + Evals — One‑Pager

August 10, 2025

Fabian Williams

5-Minute Read

If you’ve ever wanted a single repo that lets you flip a switch between a locally‑hosted LLM, a hosted OpenAI model, and a Python‑based evaluation suite—while keeping every call visible in a unified observability stack—this is it.

Mastering Llama 3.3 – A Deep Dive into Running Local LLMs

I compared Llama 3.3 with older models like 3.170B and Phi 3, tested Microsoft Graph API integrations using OpenAPI specs, and explored function calling quirks. The results? Not all models are created equal!

December 22, 2024

Fabian Williams

3-Minute Read

Over the holiday break, I decided to dive deep into Llama 3.3, running it on my MacBook Pro M3 Max (128GB RAM, 40-core GPU). What started as curiosity quickly turned into a full exploration of local AI models, Semantic Kernel, and API integrations using Microsoft Graph.

Fabian G. Williams

Agent + Local Model + Evals — One‑Pager

Mastering Llama 3.3 – A Deep Dive into Running Local LLMs

Recent Posts

Agent + Local Model + Evals — One‑Pager

Agent-First Commerce: I Built a Website That Sells Like a Sales Team

Debugging GenAI with NLWeb and OpenTelemetry: A Real-Time Visibility Win

Vibe Coding with AI – Best Practices for Every Project

Start with Why: The How, the Why, and the What Behind My Reading Habit

Categories

About