Work

Projects and public work.

A focused list of work with public proof: repositories, write-ups, live links, or enough context to understand what was built.

AI risk evaluation

AI Risk Evaluation Workbench

A lightweight evaluation workbench for comparing open-source and frontier AI assistants before enterprise deployment.

Surfaces real differences between model backends with structured CSV results, one-page PDF reports, and auditable risk findings across hallucination, bias, jailbreak resistance, and cost.

Python
Chainlit
LLM evaluation
Guardrails
Modal

View repository Open live app

LLM evaluation

AdversarialBench

A public JavaScript project for testing language models against adversarial prompts and safety-focused evaluation cases.

Useful because it turns model-safety questions into repeatable prompt runs instead of one-off screenshots.

JavaScript
LLM evaluation
Adversarial prompts
MIT license

View repository

Automation plugin

GitHub issue resolver plugin

An OpenClaw plugin for fetching and analyzing GitHub issues, built from the hackathon work I wrote about.

Shows the practical direction of the site: reducing repetitive project work with AI-assisted tooling.

GitHub workflow automation
LLM tooling
OpenClaw
JavaScript

View repository Read write-up

AI workflow

Memoria Dashboard

A dashboard and CLI workflow for syncing Gemini CLI conversations across devices.

The public README documents cloud sync, per-project sync, copyable commands, and automatic resume behavior.

React
TypeScript
Supabase
Gemini CLI

View repository Open live app