Microsoft 365 DSAR Series
DSAR Redaction for Microsoft 365 Exports Office 365 DSAR Response Guide Teams Chat & Transcript Redaction Exchange Email DSAR Redaction SharePoint & OneDrive DSAR Redaction Purview eDiscovery Exports DSAR Redaction Overview DSAR Redaction CostWhy Teams Data Is the Hardest to Redact
Teams conversations present a unique redaction challenge that most generic tools cannot handle. Unlike a PDF or Word document where personal data appears in running text, Teams exports are structured HTML files where participant names, email addresses, and timestamps are embedded in the markup itself. A single Teams channel thread might contain contributions from 20 different people, each identified by name and email in every message they sent.
The volume compounds the complexity. Active Teams channels generate thousands of messages per month. A DSAR for an employee who participated in multiple channels and group chats over two years can easily produce hundreds of HTML transcript files, each containing dozens or hundreds of third-party references that must be identified and removed.
How Teams Data Exports Work
When you export Teams data through Purview eDiscovery, conversations arrive as HTML files structured with metadata tables at the top (ConversationId, participants, dates) followed by the message content in chronological order. Each message includes the sender's display name, their email address, a timestamp, and the message body. Attachments are referenced by filename and may be included as separate files in the export.
Group chats and channel conversations are exported separately. One-to-one chats appear as individual HTML files per conversation pair. Channel messages are grouped by channel. Meeting chat messages — the sidebar conversation during a Teams meeting — are associated with the meeting event and may appear in both the chat export and the calendar export.
Common pitfall: Teams exports include system-generated messages like meeting join notifications, file sharing alerts, and bot responses. These often contain the names and email addresses of participants and must not be overlooked during redaction.
What Must Be Redacted in Teams Transcripts
For a DSAR response, you must preserve the data subject's messages and any references to them, while redacting personal data belonging to every other participant. In practice, this means removing other participants' display names from message headers, their email addresses from metadata tables, phone numbers or addresses shared in conversation, and any personal content they contributed that does not relate to the data subject.
The nuance is important: if a colleague wrote a message directly about the data subject — for example, discussing a project assignment or forwarding relevant information — the content of that message may need to be included because it constitutes part of the subject's personal data. But the colleague's name, email, and other identifying details should still be redacted. This subject-aware redaction requires contextual understanding that simple find-and-replace cannot provide.
SafeRedact's Teams Transcript Parser
SafeRedact includes a dedicated parser for Teams HTML exports that understands the document structure used by both Purview eDiscovery and Purview compliance exports. The parser identifies the metadata table, extracts participant lists, maps each message to its sender, and applies redaction rules based on the configured data subject identity.
The parser handles the common export formats including the standard Purview eDiscovery HTML format and the Microsoft compliance export format (which uses a different HTML structure with ConversationId, FamilyId, and custodian metadata). Both formats are recognized automatically — no manual configuration is needed.
After processing, SafeRedact produces a redacted plaintext version of the transcript with third-party identifiers replaced by category markers like [NAME] and [EMAIL]. The conversation flow and the data subject's contributions are preserved in the output, creating a readable record that satisfies the data subject's right of access without compromising other individuals' privacy.
Handling Meeting Recordings and Transcripts
Teams meeting recordings and their auto-generated transcripts present additional considerations. Meeting recordings are stored in OneDrive or SharePoint and may contain audio and video of multiple participants. Auto-generated transcripts identify speakers by name throughout the document. Both formats require careful handling for DSAR compliance.
For text-based meeting transcripts, SafeRedact processes speaker attribution labels and redacts non-subject participants while preserving the subject's own contributions. For audio and video recordings, redaction requires specialized multimedia tools beyond document-level processing — organizations should assess whether the recording contains personal data that the subject is entitled to access, and whether disclosure would adversely affect other participants' rights.
Ready to automate your DSAR redaction?
Process thousands of files in minutes instead of weeks.
Enterprise Solutions Try FreeMicrosoft 365 DSAR Series
DSAR Redaction for Microsoft 365 Exports Office 365 DSAR Response Guide Teams Chat & Transcript Redaction Exchange Email DSAR Redaction SharePoint & OneDrive DSAR Redaction Purview eDiscovery Exports DSAR Redaction Overview DSAR Redaction CostMicrosoft, Microsoft 365, Office 365, Teams, SharePoint, Exchange Online, OneDrive, Outlook, and Purview are trademarks of Microsoft Corporation. SafeRedact is not affiliated with or endorsed by Microsoft.