For business sponsors
Executive summary
Skim this before the full guide. Technical detail follows in the sections below.
- Decision
- What employees use Copilot for vs a governed custom application.
- Primary metric
- Incremental value in systems of record Copilot cannot write to safely.
- Stop rule
- Stop duplicating M365 scenarios unless you can show measurable CRM/ITSM lift.
Related worked example
Enterprise RAG on Azure AI FoundryNeed facilitation on this topic? Start a conversation.
What Copilot already owns
Microsoft 365 Copilot covers email, calendar, Office documents, Teams meetings, and broad Microsoft Graph search.
Employees often already have licences funded by workplace programmes. Rebuilding generic summarise-my-inbox features rarely returns incremental value. See Copilot adoption resources.
Copilot strength is personal productivity inside M365, not deep integration with CRM, ITSM, or custom policy corpora that require audit-grade citations. Custom RAG fills that gap.
Programme leads should document what Copilot is chartered to do before approving another internal chatbot. Publish alongside the program charter.
- Draft and refine email, documents, and presentations
- Meeting recap and action suggestions in Teams
- Graph-wide search across M365 content user can access
- Personal productivity scenarios with Copilot privacy commitments
- Not a substitute for cite-only policy corpora. See Azure RAG solution guide.
What custom builds should own
Custom applications win when workflows tie to your systems of record with strict policy. Model them with agent vs workflow patterns where approvals matter.
Examples include CRM writes behind approval, citations to internal policy sets, escalation queues, custom evals, and IAM mapped to your roles.
Position custom work as systems-of-record workflows, not another chatbot in the intranet. Demo Azure Foundry RAG for cite-only Q&A.
Measure incremental value: tasks Copilot cannot complete because Graph scope or connectors do not reach your data. Track in OpenAI evals dashboards.
- Regulated Q&A with cite-only answers from approved document sets. See RAG concepts.
- Agent proposals that require supervisor approval before execution
- ServiceNow (ITSM integration patterns), Salesforce, or internal API actions Copilot does not reach
- Golden set evals and prompt versioning in your CI pipeline
- Custom retention and logging aligned to InfoSec standards
Copilot Studio vs custom Foundry apps
Copilot Studio fits conversational bots on Teams channels with connector-based actions and lighter governance.
It suits FAQ, simple handoffs, and departmental assistants that do not need full RAG pipelines or complex orchestration.
Custom Foundry and Azure OpenAI applications fit when you need retrieval pipelines, Content Safety scoring, sequential orchestration, and enterprise IAM to your app registration.
Choose Studio when time to channel matters and risk is low. Choose custom when citations, approvals, and eval gates are mandatory.
- Studio: faster Teams deployment, extensibility catalogue
- Custom: full RAG, ACL filters, eval in CI
- Studio: lighter change control acceptable for FAQ
- Custom: HITL queues and tool separation
- Document choice in the channel matrix
Publish a channel matrix
Employees confuse overlapping tools when every team launches a chat surface. A matrix prevents duplicate OWASP LLM Top 10.
Publish a simple matrix: Copilot for draft and summarise, custom app for policy answers, human queue for exceptions. Link Copilot overview and custom app URLs.
Include examples of allowed and disallowed prompts per channel, especially for HR and legal topics. Align restricted topics with data privacy classes.
Review the matrix quarterly as Copilot features and your custom pilots evolve. Present updates to the NIST AI RMF Govern.
- Channel name and URL or Teams entry point
- Primary user persona and use cases
- Citation required yes or no. Link Azure RAG solution guide for cite-only channels.
- Write actions allowed yes or no
- Escalation path when confidence is low
- Owner team and support contact
Governance alignment
Use the same content safety, retention, and logging rules across Copilot and custom apps where possible.
Document which channels may use general chat versus citation-required RAG. Reference responsible AI overview.
Align with Copilot data protection commitments while adding controls custom apps need beyond Graph defaults.
Risk teams prefer one narrative, not parallel policies that contradict each other. Consolidate in the AWS AI compliance.
- Shared definitions for confidential and restricted data
- Consistent retention schedule across channels
- Single incident response owner for AI issues. See security controls.
- Champion network trained on matrix rules
- Audit samples from both Copilot logs and custom apps
Licence and spend clarity
Copilot M365 licences are often already in flight as an enterprise programme. Track separately from custom API spend in unified model gateway.
Custom apps add API usage for tokens, search, safety, and storage on top of those licences.
Finance needs a line-item forecast for custom usage, not a surprise invoice after pilot expansion. Forecast with OpenAI evals volume assumptions.
Avoid duplicating Copilot scenarios unless you show incremental value in a system of record Copilot cannot write to. See Azure Well-Architected for spend narrative.
- Separate budget lines for M365 Copilot and custom APIs
- Chargeback model per business unit for custom usage
- Monthly review of token and search spend
- Kill duplicate pilots that only repeat Graph search
- Sponsor sign-off before expanding custom seats
Identity and access boundaries
Copilot respects Graph permissions the user already has. Custom apps must implement the same discipline via search filters and app roles. Use Entra conditional access.
Do not expose corpora in custom RAG that employees would not see in SharePoint or the source system.
Map Entra ID groups to application roles and retrieval filters. Test with a standard user, not only admins. Demo content safety example.
Document service accounts versus delegated user identity for automation workflows. Align with OpenAI safety best practices for writes.
- ACL filters on every retrieval query
- App registration with least-privilege scopes
- Regular access review for bot service accounts
- Break-glass procedure for admin testing
- Sample JWT claims in redacted logs for audits
Joint narrative for IT and the business
IT wants fewer bespoke bots to secure and operate. The business wants governed answers inside CRM and ITSM. Frame complementarity in the NIST AI RMF Govern deck.
Frame the programme as complementary: Copilot lifts individual productivity, custom apps automate governed workflows with agent patterns.
Use shared metrics: ticket deflection, time to answer policy questions, approval queue SLA.
Celebrate wins in both channels without forcing a single tool for every task. Retire redundant bots per OWASP LLM Top 10 guidance.
- Executive one-pager on complement, not competition
- Shared steering committee with IT and business sponsors
- Champion network trained on the channel matrix
- Quarterly demo of custom workflow plus Copilot tip
- Clear criteria to retire redundant custom bots
Workshop scenarios
Run table-top scenarios with HR, legal, sales, and IT. Ask which channel should answer each question. Use NIST AI RMF facilitation timing.
Scenario: employee asks for parental leave policy with mandatory citation. Expected channel: custom RAG. Demo Foundry RAG.
Scenario: manager drafts performance review language. Expected channel: Copilot in Word.
Scenario: agent proposes updating Salesforce opportunity stage. Expected channel: custom app with approval. See meeting-to-actions example.
- Policy Q&A with citations. Score with golden evals.
- Draft customer email tone
- ITSM ticket summarisation for agent
- Procurement clause comparison
- Meeting actions pushed to task system
What good looks like
Good looks like employees knowing which door to knock on without a survey of fifteen chat links. The channel matrix lives on the intranet next to trust materials.
Good looks like risk reviewing one coherent control set with channel-specific attachments in the AWS AI compliance.
Good looks like custom apps passing the same eval gates you would require for any production AI.
Good looks like Copilot licence spend visible alongside custom API forecast in the same program dashboard.
- Published channel matrix on the intranet
- No duplicate policy bots without exception
- Custom app in security pack with diagrams
- Champions cite matrix in onboarding
- Retired pilots removed from Teams app bar
Common mistakes
Teams rebuild M365 summarise features and annoy users who already have Copilot. Focus custom work on systems of record.
Teams deploy custom policy bots without citations, creating a second fluent wrong answer surface. Require RAG cite-only behaviour.
Teams skip the matrix and wonder why adoption fragments across shadow tools. Log duplicates in OWASP LLM Top 10.
Teams fight IT instead of aligning on systems-of-record scope for custom work. Escalate through the NIST AI RMF Govern.
- Another general chatbot on the homepage
- RAG index broader than user entitlements
- No approval on write tools
- Competing messages from IT and business sponsors
- Licence cost surprise for custom APIs