AI App Discovery and Governance

📋 Overview

About This Lab

This lab covers AI App Governance using Microsoft Defender for Cloud Apps. the practice of discovering, assessing, and controlling AI and large language model (LLM) applications across your organization. With the rapid adoption of generative AI, employees often use tools like ChatGPT, Google Gemini, Claude, and dozens of other AI services without IT approval. creating shadow AI risks. You will learn how to use Cloud Discovery to identify AI app usage, assess risk scores based on data handling and compliance posture, enforce session policies with Conditional Access App Control to block sensitive data uploads to AI services in real time, and build governance policies that balance productivity with data protection.

🎯 What You Will Learn

Discover AI and LLM applications in Cloud Discovery logs
Identify shadow AI usage: ChatGPT, Gemini, Claude, Midjourney, etc.
Assess AI app risk scores based on data handling and compliance
Create app policies to sanction or unsanction AI applications
Configure session policies with Conditional Access App Control for AI tools
Monitor data uploads to AI services in real-time
Block paste of sensitive data into AI chat interfaces
Create DLP policies scoped to AI application categories
Set up alerts for bulk data transfers to AI platforms
Integrate with Insider Risk Management for AI misuse detection
Build an AI app usage dashboard
Configure governance policies for approved AI tools
Monitor API-based AI integrations and OAuth consent
Create compliance reports for AI app usage
Build an enterprise AI application governance framework

🏢 Enterprise Scenario

A technology consulting firm with 8,000 employees discovers that over 400 employees are using unauthorized AI tools including ChatGPT, Claude, and various AI coding assistants. Employees are pasting client code, financial projections, and legal documents into these tools. The security team needs to discover all AI app usage, assess risks, block unauthorized data sharing, and establish an approved AI app catalogue with proper governance controls.

⚙️ Prerequisites

Microsoft 365 E5 or E5 Security licence
Microsoft Defender for Cloud Apps licence
Conditional Access licensing (included in Entra ID P1/P2)
Cloud Discovery configured with log collectors or Defender for Endpoint integration
Test user accounts in a pilot group for session policy testing

Step 1. Discover AI Applications

In Defender for Cloud Apps, navigate to Cloud Discovery > Discovered apps. Filter by the Generative AI category to see all AI applications detected in network traffic. Review the risk score, user count, data volume, and compliance certifications for each AI app.

Step 2. Assess and Categorize AI Apps

Review each AI application's data handling policies, training data usage terms, and compliance certifications. Tag apps as Sanctioned (approved for use), Unsanctioned (blocked), or Under Review. Create app policies to automatically block unsanctioned AI applications at the network level.

Step 3. Configure Session Policies for AI Tools

Deploy Conditional Access App Control to proxy AI application sessions and enforce real-time policies that monitor and block sensitive data uploads.

Portal Instructions

Navigate to Entra ID > Conditional Access > + New policy
Name: CA-AI-Session-Control
Users: All users (or a pilot group)
Cloud apps: select Microsoft Defender for Cloud Apps
Session controls: enable Use Conditional Access App Control > Monitor only (initially)
In Defender for Cloud Apps, go to Policies > Session policies > + Create policy
Session policy type: Control file upload (with inspection)
Activity source: filter by app category Generative AI
Inspection method: Data Classification Service
Action: Block when content matches sensitivity labels Highly Confidential or Restricted

💡 Pro Tip: Start with Monitor only mode for the first two weeks to gather baseline data on AI usage without disrupting users. Switch to Block after reviewing the activity logs and confirming policy accuracy.

Step 4. Monitor Data Uploads in Real-Time

Use the session monitoring dashboard to observe live user sessions with AI applications and identify sensitive data being shared in real time.

Portal Instructions

Navigate to Defender for Cloud Apps > Activity log
Filter by: Activity type = Upload, App category = Generative AI
Review file names, sizes, and sensitivity labels of uploaded content
Click on individual activities to see the full session timeline
Identify patterns: which departments upload the most data, which AI apps receive the most sensitive content

Activity Query

// ---------------------------------------------------------------
// PURPOSE: Find all file uploads to AI and LLM services in the last 7 days.
// WHY: Employees may upload sensitive documents, source code, or
//      customer data to AI tools without realising the data-loss risk.
//      This query surfaces who is uploading, how much, and to which AI app.
// HOW TO USE: Run in Defender XDR > Hunting > Advanced hunting,
//      or in Microsoft Sentinel if CloudAppEvents is ingested.
// OUTPUT COLUMNS:
//   Application - the AI service name (ChatGPT, Claude, Gemini, etc.)
//   UploadCount - total file uploads in the 7-day window
//   UniqueUsers - distinct users uploading to that app
//   TotalBytes - total data volume uploaded (in bytes)
// INTERPRETATION:
//   High UniqueUsers with low UploadCount per user = normal exploration.
//   One user with very high UploadCount = potential data exfiltration.
//   Large TotalBytes to consumer AI apps = high-priority investigation.
// ---------------------------------------------------------------
CloudAppEvents
| where TimeGenerated > ago(7d)
| where ActionType == "FileUploaded"
| where Application has_any ("ChatGPT", "Claude", "Gemini", "Midjourney", "Copilot")
| summarize UploadCount=count(), UniqueUsers=dcount(AccountId),
    TotalBytes=sum(FileSize) by Application
| order by UploadCount desc

Step 5. Block Paste of Sensitive Data into AI Chats

Configure session policies that detect and block the paste of sensitive content into AI chat interfaces in real time.

Portal Instructions

In Defender for Cloud Apps, go to Policies > Session policies > + Create policy
Policy template: Block upload of files matching sensitivity labels
Session control type: Control file upload (with inspection)
Activity filters:
- App category: Generative AI
- Activity type: Upload and Print/paste content
Content inspection: enable Data Classification Service
Match condition: content includes credit card numbers, SSN, API keys, or files with label Highly Confidential
Action: Block with custom notification: "Pasting sensitive data into AI tools is not permitted. Contact IT Security for approved alternatives."

⚠️ Important: Paste blocking requires the browser session to be proxied through Conditional Access App Control. Ensure your Conditional Access policy covers all user groups and that fallback access is configured for break-glass scenarios.

Step 6. Create DLP Policies for AI Applications

Extend Microsoft Purview DLP policies to cover data shared with AI applications through Defender for Cloud Apps integration.

Portal Instructions

Navigate to Microsoft Purview > Data loss prevention > Policies > + Create policy
Template: Custom policy
Name: DLP-AI-Data-Protection
Locations: enable Defender for Cloud Apps instances
Conditions: content contains sensitive info types (PII, financial data, health records, source code patterns)
Actions:
- High confidence match: Block access
- Medium confidence match: Warn and require business justification
- Low confidence match: Audit only
User notifications: enable policy tip explaining why the action was blocked
Incident reports: send to Security Operations DL

Step 7. Set Up Alerts for Bulk Data Transfers

Create anomaly detection policies that alert when users transfer unusually large volumes of data to AI platforms.

Portal Instructions

Navigate to Defender for Cloud Apps > Policies > + Create policy > Activity policy
Name: Bulk-Data-Transfer-AI
Act on: Repeated activity
Filters:
- App category = Generative AI
- Activity type = Upload
Threshold: More than 20 uploads in 30 minutes by a single user
Actions: Send alert to SOC, Suspend user (for critical data transfers)

KQL: Bulk Transfer Detection

// ---------------------------------------------------------------
// PURPOSE: Detect users who are uploading unusually large volumes
//          of data to AI services in short time windows (30 min).
// WHY: Bulk uploads to AI platforms are a strong indicator of either
//      data exfiltration (intentional) or risky copy-paste habits
//      (unintentional). Normal use is a few queries; 20+ uploads
//      in 30 minutes suggests automated extraction or batch pasting.
// THRESHOLDS:
//   UploadCount > 20 - more than 20 file uploads in a 30-min window
//   TotalMB > 50 - more than 50 MB transferred in a 30-min window
//   Adjust thresholds based on your organisation's baseline.
// OUTPUT COLUMNS:
//   TimeGenerated - the 30-minute window when the activity occurred
//   AccountId - the user performing the uploads (UPN or object ID)
//   Application - which AI service received the data
//   UploadCount - number of uploads in that window
//   TotalMB - total data volume in megabytes
// ACTION: Investigate high-volume users; check if sensitive data
//   (source code, financial docs, PII) was uploaded.
// ---------------------------------------------------------------
CloudAppEvents
| where TimeGenerated > ago(24h)
| where ActionType == "FileUploaded"
| where Application has_any ("ChatGPT", "Claude", "Gemini", "Copilot")
| summarize UploadCount=count(), TotalMB=round(sum(FileSize)/1048576.0, 2)
    by AccountId, Application, bin(TimeGenerated, 30m)
| where UploadCount > 20 or TotalMB > 50
| project TimeGenerated, AccountId, Application, UploadCount, TotalMB
| order by TotalMB desc

Step 8. Integrate with Insider Risk Management

Connect AI app usage signals to Insider Risk Management to detect employees who may be using AI tools to exfiltrate intellectual property or prepare for departure.

Configuration Steps

Navigate to Microsoft Purview > Insider Risk Management > Settings
Under Intelligent detections, verify Defender for Cloud Apps integration is enabled
Create or edit a policy using template: Data theft by departing users
Add triggering events: include Cloud app activity indicators for AI applications
Configure risk indicators:
- Uploading files to unsanctioned AI apps (high weight)
- Accessing AI apps from personal devices (medium weight)
- Bulk paste operations into AI chat interfaces (high weight)
- First-time use of multiple AI apps in a short period (medium weight)
Set alert thresholds and assign reviewers from the compliance team

💡 Pro Tip: Combine Insider Risk signals with HR triggers (resignation notice, performance improvement plan) to identify employees who may be using AI tools to extract proprietary knowledge before leaving the organisation.

Step 9. Build an AI App Usage Dashboard

Create a dashboard in Microsoft Sentinel that visualises AI app usage trends, policy violations, and risk metrics across the organisation.

Dashboard KQL Queries

// ===============================================================
// AI APP USAGE DASHBOARD - 4 tiles for a Sentinel workbook or
// Defender XDR custom detection dashboard.
// Run each query as a separate tile/visualisation.
// ===============================================================

// ---------------------------------------------------------------
// TILE 1: AI app usage summary by application (last 30 days).
// Shows which AI apps are most popular by session count and user count.
// Sessions = total interactions; Users = distinct people using the app.
// Use a bar chart: X=Application, Y=Users.
// ---------------------------------------------------------------
CloudAppEvents
| where TimeGenerated > ago(30d)
| where Application has_any ("ChatGPT", "Claude", "Gemini", "Midjourney",
    "Copilot", "Perplexity", "GitHub Copilot")
| summarize Sessions=dcount(SessionId), Users=dcount(AccountId) by Application
| order by Users desc

// ---------------------------------------------------------------
// TILE 2: Daily AI usage trend over 30 days.
// Shows adoption growth or decline. A sharp spike may indicate
// a viral AI tool spreading through a department.
// Use a line chart: X=TimeGenerated (daily), Y=DailyUsers.
// ---------------------------------------------------------------
CloudAppEvents
| where TimeGenerated > ago(30d)
| where Application has_any ("ChatGPT", "Claude", "Gemini", "Midjourney")
| summarize DailyUsers=dcount(AccountId) by bin(TimeGenerated, 1d)
| order by TimeGenerated asc

// ---------------------------------------------------------------
// TILE 3: Policy violations by policy name and application.
// Shows which DLP/session policies are triggering most often.
// High violation counts may indicate overly broad policies (tune)
// or genuine data protection issues (investigate).
// Use a stacked bar chart: X=PolicyName, Y=Violations, stack by Application.
// ---------------------------------------------------------------
CloudAppEvents
| where TimeGenerated > ago(30d)
| where ActionType == "PolicyViolation"
| where Application has_any ("ChatGPT", "Claude", "Gemini")
| summarize Violations=count() by PolicyName, Application
| order by Violations desc

// ---------------------------------------------------------------
// TILE 4: Top 10 departments using AI apps.
// Joins CloudAppEvents with IdentityInfo to resolve department names.
// Helps target AI training and governance to the heaviest-using teams.
// Use a horizontal bar chart: X=Users, Y=Department.
// ---------------------------------------------------------------
CloudAppEvents
| where TimeGenerated > ago(30d)
| where Application has_any ("ChatGPT", "Claude", "Gemini")
| join kind=inner (IdentityInfo | project AccountUPN, Department) on $left.AccountId == $right.AccountUPN
| summarize Users=dcount(AccountId) by Department
| top 10 by Users

Step 10. Configure Governance Policies for Approved AI Tools

Create governance policies that define acceptable use for sanctioned AI applications, including data classification, session duration, and acceptable use cases.

Portal Instructions

Navigate to Defender for Cloud Apps > Cloud app catalog
For each sanctioned AI app, set governance actions:
- Microsoft 365 Copilot: Sanctioned, full access with sensitivity label enforcement
- GitHub Copilot: Sanctioned, code-only access, block proprietary data uploads
- ChatGPT Enterprise: Sanctioned with restrictions, session policy with DLP
- ChatGPT Free/Plus: Unsanctioned, blocked at network level
- Claude/Gemini: Under review, monitor only
Create an App policy that automatically tags newly discovered AI apps as Under Review
Configure monthly review cadence for AI app catalogue updates

Step 11. Monitor API-Based AI Integrations and OAuth

Detect when developers and users connect AI services via OAuth or API integrations that grant access to organisational data.

Portal Instructions

Navigate to Defender for Cloud Apps > OAuth apps
Filter by app category: Generative AI
Review OAuth permissions granted:
- Does the AI app have Mail.Read or Files.ReadWrite.All?
- Was consent granted by a user or an admin?
- What data scopes does the integration access?
Revoke OAuth consent for AI apps that have excessive permissions
Create an OAuth app policy to alert when new AI apps request high-privilege permissions

KQL: OAuth AI App Consent Monitoring

// ---------------------------------------------------------------
// PURPOSE: Monitor OAuth consent events for AI applications to
//          detect when users or admins grant AI apps access to
//          organisational data (email, files, calendar).
// WHY: A single OAuth consent to a malicious AI app can expose
//      your entire tenant's data. This query surfaces all consent
//      events for AI-related apps so you can review what permissions
//      were granted and by whom.
// OUTPUT COLUMNS:
//   TimeGenerated - when the consent was granted
//   AccountId - which user clicked "Allow"
//   Application - the AI app name
//   ConsentType - "AllPrincipals" (admin consent) or "Principal" (user)
//   Permissions - the scopes granted (e.g. Mail.Read, Files.ReadWrite.All)
// RED FLAGS: Admin consent to unrecognised AI apps; user consent
//   granting Mail.ReadWrite or Files.ReadWrite.All to any AI app.
// ---------------------------------------------------------------
CloudAppEvents
| where TimeGenerated > ago(30d)
| where ActionType == "Consent to application"
| where Application has_any ("ChatGPT", "Claude", "Gemini", "OpenAI",
    "Anthropic", "ai", "copilot")
| project TimeGenerated, AccountId, Application,
    ConsentType=tostring(RawEventData["ConsentType"]),
    Permissions=tostring(RawEventData["Permissions"])
| order by TimeGenerated desc

⚠️ Important: A single OAuth consent grant to a malicious AI app can expose your entire tenant’s email, files, and calendar data. Configure admin consent workflow in Entra ID to require approval for any new AI app integrations.

Step 12. Create AI App Compliance Reports

Generate compliance reports documenting AI app usage, policy violations, and remediation actions for audit and regulatory purposes.

Report Components

AI App Inventory: list of all discovered AI applications with risk scores and sanction status
Data Exposure Summary: volume and types of sensitive data shared with AI services
Policy Violation Log: all DLP and session policy matches with user details and resolution
OAuth Consent Audit: applications granted access to organisational data via AI integrations
User Risk Report: users with the highest AI-related risk scores from Insider Risk Management
Remediation Actions: apps blocked, sessions terminated, users notified, and policies updated

Portal Instructions

Navigate to Defender for Cloud Apps > Cloud Discovery > Cloud Discovery dashboard
Click Generate report > filter by Generative AI category
Export as PDF for compliance team distribution
Schedule monthly automated report delivery to the CISO and DPO

Step 13. Build an Approved AI App Catalogue

Establish a curated catalogue of approved AI tools with usage guidelines, data handling requirements, and acceptable use policies.

Catalogue Framework

Define AI app evaluation criteria: data residency, encryption, SOC 2 compliance, GDPR readiness, training data usage policy
Create three tiers:
- Tier 1 (Unrestricted): enterprise-licensed AI tools with data processing agreements (Microsoft 365 Copilot, GitHub Copilot Enterprise)
- Tier 2 (Restricted): approved with session controls and DLP (ChatGPT Enterprise, selected AI coding tools)
- Tier 3 (Blocked): consumer-grade AI tools with no enterprise agreements
Publish the catalogue on the company intranet with request forms for new AI tool evaluations
Configure Defender for Cloud Apps to enforce tier-based access automatically

Step 14. Implement Ongoing AI Governance

Establish a recurring governance process to review AI app risks, update policies, and adapt to new AI tools entering the market.

Governance Cadence

Weekly: review AI policy violation alerts and investigate high-risk incidents
Monthly: review AI app discovery report, evaluate new AI tools, update sanctioned/unsanctioned lists
Quarterly: full governance review with stakeholders (Security, Legal, IT, HR, Business units)
Create an AI Governance Committee with representatives from each department
Document all decisions in a governance register: app evaluations, policy changes, exception approvals

💡 Pro Tip: AI tools evolve rapidly. New AI applications appear weekly, and existing ones add capabilities that change their risk profile. Automate new AI app detection with Defender for Cloud Apps discovery alerts to stay ahead of shadow AI sprawl.

Step 15. Build an Enterprise AI Governance Framework

Synthesise all the controls implemented in this lab into a comprehensive AI governance framework that aligns with regulatory requirements and industry best practices.

Framework Components

Discovery & Visibility: Cloud Discovery for AI apps, usage dashboards, OAuth monitoring
Classification & Labelling: AI app risk scoring, sanctioned/unsanctioned categorisation, tiered catalogue
Protection & Prevention: Session policies, DLP, paste blocking, network-level blocks for unsanctioned apps
Detection & Response: Bulk transfer alerts, Insider Risk integration, anomaly detection
Compliance & Reporting: Automated compliance reports, audit trails, governance register
Governance & Oversight: AI Governance Committee, review cadence, exception management

Executive Summary KQL

// ---------------------------------------------------------------
// PURPOSE: Generate an executive-level AI governance summary showing
//          total sessions, unique users, apps used, violations, and
//          the overall violation rate across all AI platforms.
// WHY: Leadership needs a single-number view of AI risk. The violation
//      rate (violations / total sessions * 100) tells you what percentage
//      of AI interactions triggered a DLP or session policy - a key
//      metric for tracking governance effectiveness over time.
// HOW IT WORKS:
//   1. ai_apps: calculates total sessions, unique users, unique apps
//   2. violations: calculates total policy violations and violating users
//   3. Join on a dummy key (d=1) to combine into one summary row
// OUTPUT COLUMNS:
//   TotalSessions - total AI app interactions in 30 days
//   UniqueUsers - distinct employees using AI apps
//   UniqueApps - number of different AI apps in use
//   TotalViolations - DLP/session policy violations triggered
//   UsersViolating - distinct users who triggered at least one violation
//   ViolationRate - percentage of sessions that violated policy
// INTERPRETATION: ViolationRate < 1% = policies working well.
//   ViolationRate > 5% = policies may be too broad (tune) or users
//   need more training on acceptable AI use.
// ---------------------------------------------------------------
let ai_apps = CloudAppEvents
| where TimeGenerated > ago(30d)
| where Application has_any ("ChatGPT", "Claude", "Gemini", "Midjourney", "Copilot")
| summarize TotalSessions=count(), UniqueUsers=dcount(AccountId),
    UniqueApps=dcount(Application);
let violations = CloudAppEvents
| where TimeGenerated > ago(30d)
| where ActionType == "PolicyViolation"
| where Application has_any ("ChatGPT", "Claude", "Gemini")
| summarize TotalViolations=count(), UsersViolating=dcount(AccountId);
ai_apps | extend d=1
| join kind=inner (violations | extend d=1) on d
| project TotalSessions, UniqueUsers, UniqueApps, TotalViolations, UsersViolating,
    ViolationRate=round(TotalViolations*100.0/TotalSessions, 2)

Resource	Description
Manage OAuth apps in Defender for Cloud Apps	Review and control third-party app permissions
Session policies	Configure real-time session controls for cloud apps
Working with discovered apps	Review and manage apps found by Cloud Discovery
Learn about data loss prevention	DLP policies for protecting sensitive data across locations
Configure Insider Risk Management	Set up insider risk policies and indicators

AI App Discovery and Governance

📋 Overview

About This Lab

🎯 What You Will Learn

🏢 Enterprise Scenario

⚙️ Prerequisites

Step 1. Discover AI Applications

Step 2. Assess and Categorize AI Apps

Step 3. Configure Session Policies for AI Tools

Portal Instructions

Step 4. Monitor Data Uploads in Real-Time

Portal Instructions

Activity Query

Step 5. Block Paste of Sensitive Data into AI Chats

Portal Instructions

Step 6. Create DLP Policies for AI Applications

Portal Instructions

Step 7. Set Up Alerts for Bulk Data Transfers

Portal Instructions

KQL: Bulk Transfer Detection

Step 8. Integrate with Insider Risk Management

Configuration Steps

Step 9. Build an AI App Usage Dashboard

Dashboard KQL Queries

Step 10. Configure Governance Policies for Approved AI Tools

Portal Instructions

Step 11. Monitor API-Based AI Integrations and OAuth

Portal Instructions

KQL: OAuth AI App Consent Monitoring

Step 12. Create AI App Compliance Reports

Report Components

Portal Instructions

Step 13. Build an Approved AI App Catalogue

Catalogue Framework

Step 14. Implement Ongoing AI Governance

Governance Cadence

Step 15. Build an Enterprise AI Governance Framework

Framework Components

Executive Summary KQL

Summary

What You Accomplished

Next Steps

📚 Documentation Resources