For business sponsors
Executive summary
Skim this before the full guide. Technical detail follows in the sections below.
- Decision
- How NIST, ISO, and vendor RAI programs map to what you will actually build.
- Primary metric
- Control map completed for one use case with named owners and evidence.
- Stop rule
- Do not claim certification from documentation alone without operating artefacts.
Related worked example
Human-in-the-Loop Approval GatesNeed facilitation on this topic? Start a conversation.
Use frameworks as shared vocabulary
Frameworks do not replace architecture. They give risk, legal, and engineering a shared checklist aligned to NIST AI RMF or ISO 42001.
Map each control to a pattern on reference architecture patterns you can show in a workshop, not principles alone.
Sponsors use framework language in board packs. Engineers use it to prioritise telemetry, evals, and OpenAI safety best practices.
Pick one primary framework for external messaging, then crosswalk vendor responsible AI programs in an appendix.
- One-page control map per pilot scoping use case
- Owner named for each control theme (NIST AI RMF Govern)
- Demo pattern linked on reference architecture patterns, not only policy text
- Evidence artefact in AWS AI compliance
- Review map quarterly as scope grows (production readiness conversation)
NIST AI RMF: Govern
Govern covers accountability, policies, and change control for prompts, indexes, and models.
Translate Govern into a RACI for the AI council (NIST AI RMF Govern), a prompt change process in CI, and production readiness (checklist guide) gates.
Auditors ask who approved go-live per production readiness conversation and how HITL overrides are logged.
Your programme office should maintain govern artefacts per NIST Govern, not scatter them across wikis.
- RACI for pilot and production (NIST AI RMF)
- Prompt change control in Git (production readiness conversation)
- Model register with owners (production readiness conversation)
- Exception process (NIST AI RMF Govern risk acceptance)
- Steering committee minutes archived (program office)
NIST AI RMF: Map
Map covers context, use-case boundaries, and data flows per NIST playbook Map function.
Use choosing cloud stack and Copilot coexistence to show where the system starts and stops.
Document prohibited uses and data classes per Microsoft Copilot data protection.
Update maps when you add tools or connect a new system of record.
- Use-case charter (NIST AI RMF)
- Data-flow diagram with trust boundaries (security controls)
- Channel matrix: Microsoft 365 Copilot vs custom
- Subprocessor table (AWS AI compliance)
- Prohibited data examples agreed with legal (Copilot privacy)
NIST AI RMF: Measure
Measure covers evaluations, monitoring, safety metrics, and operational KPIs.
Run golden set (OpenAI evals) evals before each prompt promotion and track citation rate, unknown answers, and tool safety.
Observability examples show latency via Foundry monitor and OpenAI evals dashboards.
Define thresholds upfront so debates use golden set numbers, not opinions.
- golden set (OpenAI evals) eval in CI with pass thresholds
- Content Safety and Bedrock Guardrails scores
- Production dashboards (telemetry) with alert routes
- Bias checks where required (Azure responsible AI)
- Monthly metric review with sponsors (NIST AI RMF Govern)
NIST AI RMF: Manage
Manage covers incident response, human override, decommissioning, and continuous improvement.
human-in-the-loop (OpenAI safety best practices) approval patterns show how writes are proposed then executed after supervisor sign-off.
Incident examples illustrate disable-tools drills per NIST Manage.
Decommission retired pilots from Teams and Entra to prevent shadow usage.
- HITL queues with SLA and audit export
- Disable-tools drill annually (MITRE ATLAS)
- Incident runbook (AI security controls)
- Post-incident review adds golden set cases
- Decommission checklist for apps and API keys
ISO/IEC 42001: operating model
ISO 42001 (standard overview) is an AI management system standard useful when procurement asks for certified process, not only technical controls.
It aligns with production readiness conversation, NIST AI RMF Govern roles, and management review.
Do not claim certification because you read a guide. Show artefacts ISO 42001 expects.
Map ISO clauses to ITIL processes and NIST AI RMF Playbook control IDs to reduce duplicate paperwork.
- Management review cadence (NIST AI RMF Govern quarterly)
- Documented AI policy (Microsoft responsible AI)
- Competence records for champions and security liaisons
- Internal audit schedule (AWS AI compliance)
- Corrective action log with owners (NIST Manage)
Vendor responsible-AI programs
Each major provider publishes responsible-AI guidance complementing NIST AI RMF and ISO 42001.
Reference vendor programs when aligning with choosing cloud stack anchor.
Use Content Safety and Guardrails as implementation detail, not substitute for evals.
Track transparency when models change (Anthropic scaling policy).
- Microsoft Responsible AI and transparency notes
- AWS Responsible AI and Bedrock Guardrails
- OpenAI safety best practices
- Anthropic responsible scaling
- Crosswalk vendor control to your control ID (NIST AI RMF Playbook)
Workshop exercise
In a 90-minute session, pick one use case from NIST AI RMF and fill a one-page control map.
Each row lists framework control, owner, reference architecture patterns pattern, and eval or log evidence.
Participants leave with assignments aligned to NIST AI RMF Govern intake.
Store output in AWS AI compliance folder the same day.
- Pick HR, ITSM, or RAG policy Q&A use case
- Assign owner per NIST Govern, Map, Measure, Manage row
- Link to example slug on reference architecture patterns
- Evidence due in two weeks (production readiness conversation)
- Follow-up before production promotion
Board and audit language
Translate controls into outcomes: who is accountable, what is logged, how overrides work, and how models change over time.
Avoid jargon without definitions. Say human approval for CRM writes, not only HITL (OpenAI safety best practices).
Provide trend metrics month on month via OpenAI evals, not a launch-day snapshot.
When auditors visit, open AWS AI compliance index first.
- Accountability chart (NIST AI RMF Govern RACI)
- Three KPIs maximum (OpenAI evals)
- Exception register with expiry (AI security controls)
- Third-party assurance letters (ISO 42001) if applicable
- Plain-language glossary: HITL = human approval
Evidence artefacts by control theme
Keep a folder per pilot: architecture, data-flow, eval report, Bedrock logs, HITL screenshot, runbook.
Name files so reviewers find NIST Govern vs Measure artefacts quickly.
Redact customer data per data privacy. InfoSec needs realistic event types.
Version folder when prompts change. Auditors compare point-in-time NIST AI RMF Playbook evidence.
- Govern: RACI, prompt change process
- Map: channel matrix, residency note
- Measure: eval dashboard, safety scores
- Manage: HITL (OpenAI safety best practices) metrics, disable-tools drill
- Index spreadsheet on the cover sheet
What good looks like
Good looks like engineering and risk using the same control IDs in Jira and AWS AI compliance.
Good looks like reference architecture patterns demos matching pilot scoping boundaries.
Good looks like NIST AI RMF accelerating decisions, not duplicate paperwork.
Good looks like retired pilots removed from maps per NIST Manage decommission checklist.
- Control map signed by sponsor and risk (NIST AI RMF Govern)
- Every row has evidence link or dated gap
- Workshop output filed in 48 hours
- Quarterly review on program steering agenda
- reference architecture patterns cited as patterns, not certification
Common mistakes
Teams paste NIST AI RMF text into slides without linking to running systems.
Teams claim ISO 42001 alignment with no NIST AI RMF Govern management review.
Teams ignore Map when adding tools, then fail prompt injection reviews.
Teams measure only launch-week quality, then cannot answer OpenAI evals trend questions.
- Principles without owners (NIST AI RMF Playbook rows empty)
- Different vocabulary in IT and legal docs
- Vendor safety toggles with no golden set eval
- Copilot coexistence channel map missing
- Security evidence pack stale after prompt change