For business sponsors
Executive summary
Skim this before the full guide. Technical detail follows in the sections below.
- Decision
- Which cloud anchor matches residency, IAM, and existing enterprise agreements.
- Primary metric
- One end-to-end workflow demoed on the anchor with logging and safety in place.
- Stop rule
- Do not run two production anchors for the first pilot without explicit program reason.
Related worked example
Enterprise RAG on Azure AI FoundryNeed facilitation on this topic? Start a conversation.
Anchor to existing commitments
The strongest anchor is rarely the newest model headline. It is the cloud where your organisation already holds data processing agreements, identity federation, and operational runbooks. Map commitments against the NIST AI RMF Playbook before comparing vendors.
Start by mapping where customer PII, employee records, and regulated content already live. If that footprint is concentrated in one hyperscaler, your first production pilot should inherit the same logging, backup, and access review patterns. Follow production readiness conversation for go-live gates.
Engineering leads often want a bake-off across three vendors. Sponsors should redirect that energy toward one end-to-end workflow on the anchor that procurement already audits. Use vendor and model selection rubrics, not keynote demos.
A useful workshop question is: who can operate the control plane in production on day one, not who gave the best keynote demo last quarter? Name that team in the program charter.
- Where does customer PII already live, and who owns the DPA?
- Which cloud has your SSO, RBAC, and audit log pipeline today? Check Entra conditional access.
- Who on your team can operate the control plane in production?
- Which vendor already appears in your enterprise agreement spend?
- What incident response playbooks exist for that platform? Link AI security controls.
Microsoft / Azure anchor
Choose Azure when Microsoft 365, Entra ID conditional access, and existing Azure landing zones are the programme centre of gravity.
Azure AI Foundry gives a unified portal for models, prompt flow, evaluation, and content safety. That alignment matters when Copilot governance conversations are already underway in IT.
Azure OpenAI deployments fit organisations that need private networking, regional residency, and content filtering integrated with the same tenant as Copilot.
Technical proof should include Entra app registration, diagnostic logs to your SIEM, and a retrieval pipeline tied to approved corpora, not a public playground key. Demo Azure Foundry RAG.
- Foundry evaluation and prompt flow for golden set testing
- Azure AI Search for enterprise retrieval with ACL filters
- Content Safety for input and output scoring
- Copilot coexistence matrix published alongside custom apps
- Private endpoints where network teams require them
AWS anchor
Choose AWS when Bedrock, in-VPC inference, and existing AWS Organisations guardrails are non-negotiable for your data office.
Knowledge Bases, Agents, and Guardrails map cleanly to accounts that already run data lakes, IAM permission boundaries, and CloudTrail centralisation.
Procurement teams often already have AWS marketplace commitments. Aligning the pilot architecture with those contracts reduces duplicate security reviews. Attach evidence to the AWS AI compliance.
Demonstrate one workflow with VPC endpoints, KMS encryption, and Guardrails policies attached to the model invocation, not only a console chat test. Start from Bedrock KB example.
- Bedrock Knowledge Bases with source attribution
- Guardrails for topic denial and PII handling
- Agents with tool allow lists scoped per account
- CloudTrail logging and Config evidence for security packs
- Organisation SCPs for model access in production
Application-layer anchor (AI SDK + gateway)
Choose an AI SDK + gateway path when shipping product quickly matters and you want one AI SDK across OpenAI, Anthropic, and gateway routing.
This anchor suits product teams that already deploy in production and need streaming UX, edge caching, and predictable CI for prompt changes. See streaming agent example.
Pair the AI SDK with AI Gateway when you need provider failover, team key management, and observability hooks without maintaining three separate SDK integrations. Demo unified model gateway.
Enterprise programmes still add a hyperscaler anchor later for procurement. Treat the application layer as the control plane for UX and routing, not a replacement for data residency decisions. Read production readiness conversation.
- Single SDK for streaming chat and structured output via AI SDK intro
- Gateway routing for primary and fallback models
- Environment-scoped keys in your deployment environment
- Eval hooks in CI before prompt promotion
- Clear subprocessors list for legal review
Multi-cloud is a program choice, not a default
Running production pilots on two clouds without a reason doubles security review, key management, and on-call runbooks. Prefer one anchor from this guide, then add gateway failover if needed.
Use multi-cloud comparisons in workshops to teach patterns, then pick one anchor for the first workflow that touches customer data. Teach RAG vs fine-tuning on one stack first.
Regulated entities sometimes require model diversity via a gateway while keeping one application control plane. That is provider failover, not duplicate data planes. Document in production readiness conversation.
If a team insists on parallel production paths, require a written exception with incremental risk accepted by the NIST AI RMF Govern.
- Workshop compare patterns; production picks one anchor
- Gateway failover is not the same as two retrieval indexes
- Document exception owners and review dates
- Avoid duplicate Copilot plus custom app for the same task
- Measure operational cost of two key rotation cycles
Enterprise agreements and procurement
Anchor choice is rarely greenfield. Map existing enterprise agreement spend, marketplace commitments, and who already holds the DPA with each hyperscaler. Attach schedules to the AWS AI compliance.
Procurement will ask for a single throat to call. Align the pilot architecture narrative with the vendor your organisation already audits annually. Reference Microsoft responsible AI where relevant.
Finance needs a forecast that separates Copilot licences from custom API usage. Surprise invoices after pilot expansion erode sponsor trust. Use unified model gateway templates.
Include professional services and support tiers in the comparison, not only per-token pricing from a calculator. Score with OpenAI evals metrics, not list price alone.
- Microsoft-centric EA: Foundry plus Azure OpenAI narrative
- AWS-centred data lake: Bedrock in-VPC narrative
- Product in production: speed narrative with enterprise anchor at scale
- Attach DPA and subprocessor schedules to the AWS AI compliance
- Name the contract owner before architecture sign-off
Technical proof points per anchor
Engineering should demo one workflow end to end on the anchor: identity, logging, retrieval, safety, and cost caps. Follow production readiness conversation checklist items.
Avoid bake-offs that only compare raw model eloquence on public prompts. Score citation rate, unknown-answer discipline, and tool proposal safety on your golden set.
Capture latency p95 and token cost per successful task. Sponsors care whether the workflow fits SLA and budget at expected volume. Forecast with cost controls.
Record a short screen capture of approval, logging, and disable-tools behaviour for the security review pack.
- Golden set eval report checked into version control
- Sample redacted logs exported for InfoSec. Enable Bedrock logging or Foundry monitor.
- Rate limits and spend caps demonstrated live
- Rollback plan for prompt and index changes
- On-call runbook linked from the architecture diagram
Data residency and sovereignty
Australian and regional programmes often require explicit region selection for inference, search indexes, and log storage. Document flows in the Microsoft Copilot data protection.
Document which components may call US-based model APIs even when compute stays in-region. Legal will ask about subprocessors and training use. Review AWS AI compliance and Azure terms.
If data must not leave a VPC, show the network diagram with private endpoints and egress controls, not a verbal assurance. Azure OpenAI deploy docs cover private networking.
Revisit residency when you add a fallback provider through AI Gateway. Failover paths are still data flows.
- Region list per service: model, search, logs, backups
- Subprocessor register updated before production readiness conversation
- Customer notification if regions change
- Test failover in the same residency zone where possible. See production readiness conversation fallback rules.
- Archive data-flow diagram with each release
Workshop questions for sponsors
Sponsors do not need to memorise SKUs. They need crisp answers that connect architecture to risk and spend. Use NIST AI RMF workshop format.
Ask which system of record the workflow writes to, and whether Copilot already covers the summarise-and-draft parts. Publish a coexistence matrix.
Ask what evidence you will show when an answer is challenged in audit or court. Point to RAG citations and security controls.
Close the session with a single anchor decision, one pilot workflow, and a date for AWS AI compliance submission.
- If policy changes tomorrow, how fast can answers update?
- Who approves writes to CRM, billing, or external email? See OpenAI safety best practices.
- What is the monthly API forecast at 2x pilot volume? Use cost controls.
- Which channel uses Copilot versus custom RAG?
- What is the fallback if the primary model is degraded? Test unified model gateway.
What good looks like at decision time
Good looks like a one-page decision record: anchor, regions, identity model, logging destination, and first workflow scope. Store it with the NIST AI RMF Govern.
Good looks like engineering and procurement referencing the same diagram in the security review, not parallel slide decks.
Good looks like reference architecture patterns in vendor documentation mapped to your anchor for workshop teaching, while production config is reviewed on its own merits.
Good looks like a planned second phase for gateway or multi-model diversity only after the first workflow is stable in production.
- Signed anchor choice with exception log
- Security pack draft attached to the pilot charter
- Copilot coexistence matrix published to IT
- Golden set eval baseline frozen before go-live
- Steering committee date for production promotion
Common mistakes
Teams pick a cloud because a vendor gave credits, then discover identity and logging do not fit the enterprise standard. Re-anchor using Well-Architected or AWS ML lens reviews.
Teams run a model beauty contest while retrieval and citations remain unbuilt, which invalidates the bake-off for policy Q&A. Fix RAG foundations first.
Teams promise multi-cloud production on day one to appease every stakeholder, then stall in duplicated security reviews. Escalate exceptions to the NIST AI RMF Govern.
Teams omit cost caps and receive a finance freeze after the first successful demo goes viral internally.
- Choosing anchor after building the wrong integration
- Skipping unknown-answer behaviour in eval rubrics
- Treating Copilot as a replacement for governed RAG
- No fallback model or queue in production design. See unified model gateway.
- Security pack started after code is already in prod