(ethical) ai · food & agriculture · the legal system · education · open source · data centers and the energy buildout
a few things i'm working on or have shipped recently:
- books — the math inside the machine (how intelligence emerges from eleven simple operations) and this is server country (ai, power, and the remaking of rural america). author page on amazon.
- slopython — a "slop" experiment: iteratively evolving cpython with frontier llms to chase performance gains and see what breaks.
- linux uml redesign — rebuilding the linux user-mode-linux kernel from scratch, with notes.
- glaurung — a new ghidra. modern, scriptable, less crufty.
- nupunkt + charboundary — fast, deterministic sentence boundary detection for legal text.
- opengloss — open english lexical knowledge graph (537k senses, 9m semantic edges, generated end-to-end with llms in under a week for less than $1k).
- binary-30k + binary-bpe tokenizers — first heterogeneous binary-analysis dataset and tokenizers built for executables.
- parallel iliad: brainrot edition — a parallel greek-english reader of homer's iliad with ai-generated brainrot translations. genuinely useful, also genuinely cursed.
- moratorium nation — 113-page survey of 116 moratoria across 30 states targeting data centers, solar, wind, and batteries.
- president — alea institute (non-profit; copyright-clean ai for the legal system)
- ceo — 273 ventures (legal data infrastructure)
- cto — licens.io
- ceo — bommarito consulting
academic affiliations: codex (stanford), msu college of law (adjunct). past: chicago-kent law lab (head of research), umich complex systems (lecturer).
ceo, lexpredict (acquired 2018) · plus the lexnlp / openedgar / "gpt takes the bar exam" / network-of-supreme-court / complexity-of-the-u.s.-code work people still cite.
~50 papers, 3,000+ citations across legal informatics, network science, quantitative finance, and ai. science, quantitative finance, cambridge university press, ssrn, arxiv.
- OpenMPSC Data — blog ·
2026-04-30 - linux-drivers.com — blog ·
2026-04-01 - tokenizing raw executables for malware analysis with bbpe — blog ·
2026-03-19 - building a 150K-word english dictionary with llms: opengloss — blog ·
2026-03-19 - fast zero-dependency sentence splitting in python with nupunkt — blog ·
2026-03-19 - Parallel Iliad: Brainrot Edition — blog ·
2026-03-06
bio, blog, wiki, publications, gallery, bookmarks → michaelbommarito.com
email · linkedin · x · scholar





