AI Labs by The Ops Toolbox

Business sponsorsTechnical leaders

What Copilot already owns

Microsoft 365 Copilot covers email, calendar, Office documents, Teams meetings, and broad Microsoft Graph search.

Employees often already have licences funded by workplace programmes. Rebuilding generic summarise-my-inbox features rarely returns incremental value. See Copilot adoption resources.

Copilot strength is personal productivity inside M365, not deep integration with CRM, ITSM, or custom policy corpora that require audit-grade citations. Custom RAG fills that gap.

Programme leads should document what Copilot is chartered to do before approving another internal chatbot. Publish alongside the program charter.

Draft and refine email, documents, and presentations
Meeting recap and action suggestions in Teams
Graph-wide search across M365 content user can access
Personal productivity scenarios with Copilot privacy commitments
Not a substitute for cite-only policy corpora. See Azure RAG solution guide.

What custom builds should own

Custom applications win when workflows tie to your systems of record with strict policy. Model them with agent vs workflow patterns where approvals matter.

Examples include CRM writes behind approval, citations to internal policy sets, escalation queues, custom evals, and IAM mapped to your roles.

Position custom work as systems-of-record workflows, not another chatbot in the intranet. Demo Azure Foundry RAG for cite-only Q&A.

Measure incremental value: tasks Copilot cannot complete because Graph scope or connectors do not reach your data. Track in OpenAI evals dashboards.

Regulated Q&A with cite-only answers from approved document sets. See RAG concepts.
Agent proposals that require supervisor approval before execution
ServiceNow (ITSM integration patterns), Salesforce, or internal API actions Copilot does not reach
Golden set evals and prompt versioning in your CI pipeline
Custom retention and logging aligned to InfoSec standards

Copilot Studio vs custom Foundry apps

Copilot Studio fits conversational bots on Teams channels with connector-based actions and lighter governance.

It suits FAQ, simple handoffs, and departmental assistants that do not need full RAG pipelines or complex orchestration.

Custom Foundry and Azure OpenAI applications fit when you need retrieval pipelines, Content Safety scoring, sequential orchestration, and enterprise IAM to your app registration.

Choose Studio when time to channel matters and risk is low. Choose custom when citations, approvals, and eval gates are mandatory.

Studio: faster Teams deployment, extensibility catalogue
Custom: full RAG, ACL filters, eval in CI
Studio: lighter change control acceptable for FAQ
Custom: HITL queues and tool separation
Document choice in the channel matrix

Publish a channel matrix

Employees confuse overlapping tools when every team launches a chat surface. A matrix prevents duplicate OWASP LLM Top 10.

Publish a simple matrix: Copilot for draft and summarise, custom app for policy answers, human queue for exceptions. Link Copilot overview and custom app URLs.

Include examples of allowed and disallowed prompts per channel, especially for HR and legal topics. Align restricted topics with data privacy classes.

Review the matrix quarterly as Copilot features and your custom pilots evolve. Present updates to the NIST AI RMF Govern.

Channel name and URL or Teams entry point
Primary user persona and use cases
Citation required yes or no. Link Azure RAG solution guide for cite-only channels.
Write actions allowed yes or no
Escalation path when confidence is low
Owner team and support contact

Governance alignment

Use the same content safety, retention, and logging rules across Copilot and custom apps where possible.

Document which channels may use general chat versus citation-required RAG. Reference responsible AI overview.

Align with Copilot data protection commitments while adding controls custom apps need beyond Graph defaults.

Risk teams prefer one narrative, not parallel policies that contradict each other. Consolidate in the AWS AI compliance.

Shared definitions for confidential and restricted data
Consistent retention schedule across channels
Single incident response owner for AI issues. See security controls.
Champion network trained on matrix rules
Audit samples from both Copilot logs and custom apps

Licence and spend clarity

Copilot M365 licences are often already in flight as an enterprise programme. Track separately from custom API spend in unified model gateway.

Custom apps add API usage for tokens, search, safety, and storage on top of those licences.

Finance needs a line-item forecast for custom usage, not a surprise invoice after pilot expansion. Forecast with OpenAI evals volume assumptions.

Avoid duplicating Copilot scenarios unless you show incremental value in a system of record Copilot cannot write to. See Azure Well-Architected for spend narrative.

Separate budget lines for M365 Copilot and custom APIs
Chargeback model per business unit for custom usage
Monthly review of token and search spend
Kill duplicate pilots that only repeat Graph search
Sponsor sign-off before expanding custom seats

Identity and access boundaries

Copilot respects Graph permissions the user already has. Custom apps must implement the same discipline via search filters and app roles. Use Entra conditional access.

Do not expose corpora in custom RAG that employees would not see in SharePoint or the source system.

Map Entra ID groups to application roles and retrieval filters. Test with a standard user, not only admins. Demo content safety example.

Document service accounts versus delegated user identity for automation workflows. Align with OpenAI safety best practices for writes.

ACL filters on every retrieval query
App registration with least-privilege scopes
Regular access review for bot service accounts
Break-glass procedure for admin testing
Sample JWT claims in redacted logs for audits

Joint narrative for IT and the business

IT wants fewer bespoke bots to secure and operate. The business wants governed answers inside CRM and ITSM. Frame complementarity in the NIST AI RMF Govern deck.

Frame the programme as complementary: Copilot lifts individual productivity, custom apps automate governed workflows with agent patterns.

Use shared metrics: ticket deflection, time to answer policy questions, approval queue SLA.

Celebrate wins in both channels without forcing a single tool for every task. Retire redundant bots per OWASP LLM Top 10 guidance.

Executive one-pager on complement, not competition
Shared steering committee with IT and business sponsors
Champion network trained on the channel matrix
Quarterly demo of custom workflow plus Copilot tip
Clear criteria to retire redundant custom bots

Workshop scenarios

Run table-top scenarios with HR, legal, sales, and IT. Ask which channel should answer each question. Use NIST AI RMF facilitation timing.

Scenario: employee asks for parental leave policy with mandatory citation. Expected channel: custom RAG. Demo Foundry RAG.

Scenario: manager drafts performance review language. Expected channel: Copilot in Word.

Scenario: agent proposes updating Salesforce opportunity stage. Expected channel: custom app with approval. See meeting-to-actions example.

Policy Q&A with citations. Score with golden evals.
Draft customer email tone
ITSM ticket summarisation for agent
Procurement clause comparison
Meeting actions pushed to task system

What good looks like

Good looks like employees knowing which door to knock on without a survey of fifteen chat links. The channel matrix lives on the intranet next to trust materials.

Good looks like risk reviewing one coherent control set with channel-specific attachments in the AWS AI compliance.

Good looks like custom apps passing the same eval gates you would require for any production AI.

Good looks like Copilot licence spend visible alongside custom API forecast in the same program dashboard.

Published channel matrix on the intranet
No duplicate policy bots without exception
Custom app in security pack with diagrams
Champions cite matrix in onboarding
Retired pilots removed from Teams app bar

Common mistakes

Teams rebuild M365 summarise features and annoy users who already have Copilot. Focus custom work on systems of record.

Teams deploy custom policy bots without citations, creating a second fluent wrong answer surface. Require RAG cite-only behaviour.

Teams skip the matrix and wonder why adoption fragments across shadow tools. Log duplicates in OWASP LLM Top 10.

Teams fight IT instead of aligning on systems-of-record scope for custom work. Escalate through the NIST AI RMF Govern.

Another general chatbot on the homepage
RAG index broader than user entitlements
No approval on write tools
Competing messages from IT and business sponsors
Licence cost surprise for custom APIs

Coexisting with Microsoft Copilot

What Copilot already owns

What custom builds should own

Copilot Studio vs custom Foundry apps

Publish a channel matrix

Governance alignment

Licence and spend clarity

Identity and access boundaries

Joint narrative for IT and the business

Workshop scenarios

What good looks like

Common mistakes

Plan your next pilot

Coexisting with Microsoft Copilot

Executive summary

What Copilot already owns

What custom builds should own

Copilot Studio vs custom Foundry apps

Publish a channel matrix

Governance alignment

Licence and spend clarity

Identity and access boundaries

Joint narrative for IT and the business

Workshop scenarios

What good looks like

Common mistakes

Plan your next pilot