Update checklists/platform-architecture.md

This commit is contained in:
2025-12-05 16:32:23 +00:00
parent fe31d875a4
commit 5886236c88

View File

@@ -6,7 +6,7 @@ a work-through checklist, from Metal → Network → Virtualization → Self-se
* [ ] **Define platform goals** * [ ] **Define platform goals**
* [ ] Write 35 bullet points for what the platform must enable (e.g. “fast provisioning”, “reliable GPU workloads”). * [ ] Write 3-5 bullet points for what the platform must enable (e.g. “fast provisioning”, “reliable GPU workloads”).
* [ ] Define initial SLOs for the *platform itself* (e.g. “VM provision succeeds within 15 minutes 99% of the time”). * [ ] Define initial SLOs for the *platform itself* (e.g. “VM provision succeeds within 15 minutes 99% of the time”).
* [ ] **Define platform customers** * [ ] **Define platform customers**
@@ -41,7 +41,7 @@ a work-through checklist, from Metal → Network → Virtualization → Self-se
* [ ] **Design MAAS/Ironic architecture** * [ ] **Design MAAS/Ironic architecture**
* [ ] Decide MAAS region(s) and rack controllers per site. * [ ] Decide MAAS region(s) and rack controllers per site.
* [ ] Decide where MAAS database/API lives and how its backed up. * [ ] Decide where MAAS database/API lives and how it's backed up.
* [ ] Define access rules to MAAS (who can log in, via what SSO/LDAP/etc). * [ ] Define access rules to MAAS (who can log in, via what SSO/LDAP/etc).
* [ ] **Standardize provisioning pipeline** * [ ] **Standardize provisioning pipeline**
@@ -64,7 +64,7 @@ a work-through checklist, from Metal → Network → Virtualization → Self-se
* [ ] **Define image catalogue** * [ ] **Define image catalogue**
* [ ] Choose base OS (e.g. Ubuntu LTS / Debian stable). * [ ] Choose base OS (e.g. Ubuntu LTS / Debian stable).
* [ ] Define 35 golden images (max), e.g.: * [ ] Define 3-5 golden images (max), e.g.:
* [ ] `base-os` (minimal hardened image). * [ ] `base-os` (minimal hardened image).
* [ ] `infra-node` (for MAAS/OpenStack/Proxmox controllers). * [ ] `infra-node` (for MAAS/OpenStack/Proxmox controllers).
@@ -222,7 +222,7 @@ a work-through checklist, from Metal → Network → Virtualization → Self-se
* [ ] **Resource & naming policy** * [ ] **Resource & naming policy**
* [ ] Define naming for Proxmox clusters and VMs. * [ ] Define naming for Proxmox clusters and VMs.
* [ ] Decide whether teams get self-service Proxmox or its SRE-only. * [ ] Decide whether teams get self-service Proxmox or it's SRE-only.
### 3.4 Configuration Management ### 3.4 Configuration Management
@@ -253,7 +253,7 @@ a work-through checklist, from Metal → Network → Virtualization → Self-se
* [ ] “Provision compute (VM) for a service.” * [ ] “Provision compute (VM) for a service.”
* [ ] “Request GPU capacity.” * [ ] “Request GPU capacity.”
* [ ] “Onboard a new service to monitoring.” * [ ] “Onboard a new service to monitoring.”
* [ ] “See my projects resource usage.” * [ ] “See my project's resource usage.”
* [ ] **For each flow:** * [ ] **For each flow:**