Roy Zhu · Waterloo, ON

Big systems taught me to build small ones.

Seventeen years of commerce, gaming, and platform engineering, from Beijing-scale distributed systems to a single SQLite file. Now consulting from Waterloo, learning every new field the same way: build something real, find where it breaks, write down what holds up.

Read writing About Roy RSS

Featured Projects

codebase-rag

A Python RAG system for chatting with codebases: chunking source code into a vector store, then using LLMs for semantic search and natural-language Q&A over code.

PythonRAGLLMVector Search

View case study GitHub

moziBot

A TypeScript multi-channel AI agent runtime: Telegram and Discord connected to one runtime with session persistence, task scheduling, job resumption, and structured observability.

TypeScriptAI AgentsNode.jsTelegramDiscord

View case study GitHub

View All Projects

Recent Posts

The AI Memory Funnel: Triaging and Crystallizing Agent Context
Updated:Jun 20, 2026 at 06:00 AM
AI memory isn't just dumping logs into a vector DB. It's a lifecycle. We built a three-tier funnel in membox—Trace to Unit to Crystal—that automatically decays noise, consolidates facts, and crystallizes high-frequency context without ever explicitly deleting anything.
Decoupling LLM Extraction: Building an Async Ingestion Queue in SQLite
Updated:Jun 14, 2026 at 07:00 AM
LLM extraction is brutally slow. We solved it without a message broker: a single SQLite table acts as a durable async queue, a transient worker process drains it, and a lease in the meta table guarantees only one worker runs at a time, with built-in crash recovery.
Measuring AI Memory: Fusing BM25 with Graph Traversal
Updated:Jun 14, 2026 at 06:00 AM
Why pure graph traversal fails on long passages and pure vector search loses logical links, and how fusing SQLite FTS5 BM25 with graph BFS plus a dynamic token budget knapsack cutoff hit 92.3% recall in our offline setting.
Concurrency in Local-First AI: Making SQLite Multi-Agent Safe
Updated:Jun 10, 2026 at 08:00 AM
How to fix SQLite database locks when multiple AI agents write to memory simultaneously, using WAL mode, threading.local, and application-layer RLocks.

All Posts

Big systems taught me to build small ones.

Featured Projects

codebase-rag

moziBot

Recent Posts

The AI Memory Funnel: Triaging and Crystallizing Agent Context

Decoupling LLM Extraction: Building an Async Ingestion Queue in SQLite

Measuring AI Memory: Fusing BM25 with Graph Traversal

Concurrency in Local-First AI: Making SQLite Multi-Agent Safe