build
- deterministic LLM-agent evals
- oracle-as-sole-authority scoring
- TypeScript/Python tooling
- reproducible security datasets
Offensive AI-security engineer — I build deterministic harnesses that prove and disprove agent exploits.
Offensive AI-security engineer — I build deterministic harnesses that prove and disprove agent exploits.
Offensive AI-security engineer — I build deterministic harnesses that prove and disprove agent exploits.
~6 vulnerabilities responsibly disclosed and submitted to MITRE (CNA-LR, IDs pending assignment); a plugin authorization-bypass (CWE-862) submitted to a major bug-bounty program; coordinated disclosures to Chile's national CSIRT / ANCI.
Built deterministic eval harnesses, disclosure tooling, and reproducible security artifacts.
Prior operations leadership before moving into independent security research.