Work

Projects and public work.

A focused list of work with public proof: repositories, write-ups, live links, or enough context to understand what was built.

01

AI risk evaluation

AI Risk Evaluation Workbench

A lightweight evaluation workbench for comparing open-source and frontier AI assistants before enterprise deployment.

Surfaces real differences between model backends with structured CSV results, one-page PDF reports, and auditable risk findings across hallucination, bias, jailbreak resistance, and cost.

  • Python
  • Chainlit
  • LLM evaluation
  • Guardrails
  • Modal
02

LLM evaluation

AdversarialBench

A public JavaScript project for testing language models against adversarial prompts and safety-focused evaluation cases.

Useful because it turns model-safety questions into repeatable prompt runs instead of one-off screenshots.

  • JavaScript
  • LLM evaluation
  • Adversarial prompts
  • MIT license
03

Automation plugin

GitHub issue resolver plugin

An OpenClaw plugin for fetching and analyzing GitHub issues, built from the hackathon work I wrote about.

Shows the practical direction of the site: reducing repetitive project work with AI-assisted tooling.

  • GitHub workflow automation
  • LLM tooling
  • OpenClaw
  • JavaScript
04

AI workflow

Memoria Dashboard

A dashboard and CLI workflow for syncing Gemini CLI conversations across devices.

The public README documents cloud sync, per-project sync, copyable commands, and automatic resume behavior.

  • React
  • TypeScript
  • Supabase
  • Gemini CLI