It’s worth a look, because there’s quite a lot of it, published openly.
Anthropic released Claude’s constitution under CC0 earlier this year. It covers exactly the distinction you’re drawing between immutable rules and overridable defaults, including the reasoning behind each. OpenAI’s Model Spec does much the same job in a different style. Both are public, both are reusable, both are the result of years of work by people working on it full-time.
Similarly, Google DeepMind publishes a Frontier Safety Framework, now v3, plus the FSF reports for each Gemini release. It’s more focused on capability thresholds and severe-risk mitigation than on day-to-day behaviour, so it complements rather than overlaps the OpenAI/Anthropic specs.
Microsoft’s Responsible AI Standard v2, full PDF linked from there, is the most “engineering process” of the lot; six principles operationalised into impact assessments, gating reviews, and concrete product requirements. Useful if you want to show what governance-side rules look like in practice.
Meta’s Llama Acceptable Use Policy are the closest analogues from the open-weights side. Less philosophical than Anthropic’s constitution, more “here’s what you can’t use it for”, but still a published, reusable artefact.
So the foundational work isn’t missing from the field.
Which makes it more useful, I think, to ask the concrete question: in the actual proposal here, what specifically do you think needs different rules than the ones those documents already lay out?
I feel that’s something the team can engage with, where “start from first principles” tends to stall.
In short, beyond Anthropic and OpenAI, Google DeepMind’s Frontier Safety Framework, Microsoft’s Responsible AI Standard, the NIST AI Risk Management Framework, and ISO/IEC 42001, all cover the same ground from different angles.
Open-source coding agents largely inherit these rules from the model layer rather than redefining them, which is itself an answer to “where is the foundational work?”. It’s upstream.