Document Analysis
FDA Guidance Diff
Semantic document comparison that classifies regulatory changes. Validated against law firm analyses: 90-100% accuracy on modern FDA guidance documents.
What do these change types mean?
II. Scope
The scope of applicable devices and submission types has been significantly broadened to include more device characteristics, additional submission types, and devices that do not require a premarket submission.
Before (p. 4)
After (pp. 6-7)
1. Implementation of Security Controls
The new version significantly expands on the requirement for manufacturers to integrate cybersecurity into the design process, explicitly referencing specific CFR sections and detailing the need for design inputs, outputs, and acceptance criteria for security features.
Before (pp. 5-6)
After (pp. 25-26)
B. Designing for Security
The new version expands on the cybersecurity functions by defining specific security objectives (Authenticity, Authorization, Availability, Confidentiality, and Secure and timely updatability and patchability) that manufacturers must address in their device design, moving beyond the general 'Identify, Protect, Detect, Respond, and Recover' framework.
Before (pp. 6-8)
After (p. 11)
I. Introduction
New content: I. Introduction
After (pp. 5-6)
III. Background
New content: III. Background
After (pp. 7-9)
IV. General Principles
New content: IV. General Principles
After (p. 9)
1. A Secure Product Development Framework (SPDF) may be one way to satisfy the
New content: 1. A Secure Product Development Framework (SPDF) may be one way to satisfy the
After (p. 3)
1. A Secure Product Development Framework (SPDF) may be
New content: 1. A Secure Product Development Framework (SPDF) may be
After (pp. 10-11)
C. Transparency
New content: C. Transparency
After (pp. 11-12)
D. Submission Documentation
New content: D. Submission Documentation
After (pp. 12-13)
V. Using an SPDF to Manage Cybersecurity Risks
New content: V. Using an SPDF to Manage Cybersecurity Risks
After (pp. 13-14)
A. Security Risk Management
New content: A. Security Risk Management
After (pp. 14-16)
1. Threat Modeling
New content: 1. Threat Modeling
After (pp. 16-17)
2. Cybersecurity Risk Assessment
New content: 2. Cybersecurity Risk Assessment
After (pp. 17-18)
3. Interoperability Considerations
New content: 3. Interoperability Considerations
After (pp. 18-19)
4. Third-Party Software Components
New content: 4. Third-Party Software Components
After (pp. 19-22)
5. Security Assessment of Unresolved Anomalies
New content: 5. Security Assessment of Unresolved Anomalies
After (p. 22)
6. TPLC Security Risk Management
New content: 6. TPLC Security Risk Management
After (pp. 22-23)
B. Security Architecture
New content: B. Security Architecture
After (pp. 23-25)
2. Security Architecture Views
New content: 2. Security Architecture Views
After (pp. 26-29)
C. Cybersecurity Testing
New content: C. Cybersecurity Testing
After (pp. 29-31)
VI. Cybersecurity Transparency
New content: VI. Cybersecurity Transparency
After (p. 31)
B. Cybersecurity Management Plans
New content: B. Cybersecurity Management Plans
After (p. 34)
VII. Cyber Devices
New content: VII. Cyber Devices
After (pp. 34-35)
A. Who is Required to Comply with Section 524B of the
New content: A. Who is Required to Comply with Section 524B of the
After (p. 35)
2. Design, Develop, and Maintain Processes and Procedures to Provide a Reasonable
New content: 2. Design, Develop, and Maintain Processes and Procedures to Provide a Reasonable
After (pp. 3-5)
C. Documentation Recommendations to Comply with
New content: C. Documentation Recommendations to Comply with
After (p. 36)
1. Plans and Procedures (Section 524B(b)(1))
New content: 1. Plans and Procedures (Section 524B(b)(1))
After (pp. 36-38)
2. Design, Develop, and Maintain Processes and
New content: 2. Design, Develop, and Maintain Processes and
After (pp. 38-39)
3. Software Bill of Materials (SBOM) (Section 524B(b)(3))
New content: 3. Software Bill of Materials (SBOM) (Section 524B(b)(3))
After (p. 39)
D. Modifications
New content: D. Modifications
After (p. 39)
1. Changes That May Impact Cybersecurity
New content: 1. Changes That May Impact Cybersecurity
After (p. 39)
2. Changes Unlikely to Impact Cybersecurity
New content: 2. Changes Unlikely to Impact Cybersecurity
After (pp. 39-40)
E. Reasonable Assurance of Cybersecurity of Cyber
New content: E. Reasonable Assurance of Cybersecurity of Cyber
After (pp. 40-51)
A. Diagrams
New content: A. Diagrams
After (p. 51)
B. Information Details for an Architecture View
New content: B. Information Details for an Architecture View
After (pp. 51-64)
1. Introduction
Removed: 1. Introduction
Before (pp. 3-4)
2. A traceability matrix that links your actual cybersecurity controls to the
Removed: 2. A traceability matrix that links your actual cybersecurity controls to the
Before (p. 8)
3. A summary describing the plan for providing validated software updates and
Removed: 3. A summary describing the plan for providing validated software updates and
Before (p. 8)
5. Device instructions for use and product specifications related to recommended
Removed: 5. Device instructions for use and product specifications related to recommended
Before (p. 8)
7. Recognized Standards
Removed: 7. Recognized Standards
Before (pp. 8-9)
1. CLSI, AUTO11-A - IT Security of In Vitro Diagnostic Instruments and
Removed: 1. CLSI, AUTO11-A - IT Security of In Vitro Diagnostic Instruments and
Before (p. 9)
B. Devices Subject to Section 524B of the FD&C Act
The new version defines 'cyber device' based on the FD&C Act, replacing the previous general definitions of cybersecurity terms.
Before (pp. 4-5)
After (pp. 35-36)
A. Cybersecurity is Part of Device Safety and the Quality
The new version recontextualizes cybersecurity documentation within the broader Quality System Regulation and its relevance at different stages, rather than directly listing specific documentation requirements as in the old version.
Before (p. 8)
After (pp. 9-10)
A. Labeling Recommendations for Devices with
The old version listed specific technical standards related to risk management and network security, while the new version references different standards in a footnote related to cybersecurity risks and labeling recommendations, indicating a shift in focus and organization rather than a direct update of the same list.
Before (p. 9)
After (pp. 31-34)
What do these change types mean?
A. Software Functions
The new version expands on the definition of software functions by introducing and defining 'Artificial Intelligence' and 'Machine Learning' as foundational concepts, and renames 'Machine Learning-Enabled Device Software Function' to 'Artificial Intelligence-Enabled Device Software Function' to reflect this broader scope.
Before (p. 12)
After (p. 12)
I. Introduction
New content: I. Introduction
V. Policy for Predetermined Change Control Plans
New content: V. Policy for Predetermined Change Control Plans
F. Version Control and Maintenance of a PCCP for a Device
New content: F. Version Control and Maintenance of a PCCP for a Device
After (pp. 23-24)
A. Goals of the Description of Modifications Section
New content: A. Goals of the Description of Modifications Section
After (pp. 24-25)
I. Introduction
Removed: I. Introduction
V. Policy for Predetermined Change Control Plans
Removed: V. Policy for Predetermined Change Control Plans
A. Goals of the Description of Modifications Section
Removed: A. Goals of the Description of Modifications Section
Before (p. 20)
II. Background
The new version significantly expands on the definitions provided in the footnotes, introducing new terms like 'AI-enabled device' and 'AI-DSF' and referencing additional guidance documents.
Before (pp. 6-9)
After (pp. 6-9)
III. Scope
The new version expands on the scope by explicitly including a combination of automatic and manual modifications, and clarifies that while the guidance is broadly applicable to AI-DSFs, many recommendations are tailored to ML-DSFs due to FDA's review experience.
Before (pp. 9-12)
After (pp. 9-12)
B. Data Sets
The new version adds a new section for 'Test Data' and expands on the representativeness of training data to include intended environments, while also clarifying the FDA's stance on the term 'validation' in the context of tuning data.
Before (pp. 12-13)
After (pp. 12-13)
V. Policy for Predetermined Change Control Plans
The new version significantly expands on the structure and required content of an authorized Predetermined Change Control Plan (PCCP) by outlining three specific sections: Description of Modifications, Modification Protocol, and Impact Assessment.
Before (pp. 13-14)
After (pp. 14-15)
A. Components of a PCCP
The new version significantly expands on the considerations for developing a PCCP, including specific details about intended use populations and environments, and more explicit requirements for the Description of Modifications and Impact Assessment.
Before (pp. 14-15)
After (p. 15)
B. Establishing a PCCP
The new version significantly expands on the types of marketing submissions appropriate for establishing a PCCP, particularly for AI-DSFs, and clarifies that submission types without an affirmative FDA decision are not suitable.
Before (pp. 15-16)
After (pp. 15-17)
E. Modifying a Previously Authorized PCCP
The new version provides more specific recommendations for manufacturers on how to submit modifications to a PCCP, including providing a summary of changes and considering specific submission types like a special 510(k).
Before (pp. 19-20)
After (pp. 22-23)
VI. Description of Modifications
The new version expands on the description of modifications by adding a reference to appendices that provide examples and scenarios for implementing recommendations.
Before (p. 20)
After (p. 24)
C. Types of Modifications
The new version expands on the types of modifications related to device inputs, providing more specific examples and clarifying the scope of acceptable changes.
Before (pp. 21-22)
After (pp. 26-27)
VII. Modification Protocol
The new version expands on the content of the Modification Protocol by explicitly referencing Appendix A for detailed questions and considerations, and clarifies the QSR requirements for documenting changes.
Before (p. 22)
After (p. 27)
A. Goals of the Modification Protocol Section
The new version expands on the goals of the Modification Protocol by adding requirements for identifying the update process, communication/training plans, and ensuring anticipated risks and mitigation strategies are included in the Impact Assessment.
Before (pp. 22-24)
After (pp. 27-29)
B. Content of the Modification Protocol Section
The new version expands on data management practices by explicitly including 'tuning' data, clarifying data sequestration, and suggesting additional methods for bias mitigation.
Before (pp. 24-27)
After (pp. 29-33)
C. Traceability Between the Description of Modifications
The new version significantly expands on the example traceability table by providing a detailed, multi-column table with specific examples of methods and sections, whereas the old version only provided a textual description of the table's purpose.
Before (pp. 27-28)
After (pp. 33-34)
VIII. Impact Assessment
The new version expands on the requirements for an Impact Assessment by explicitly including 'unintended bias' as a risk to discuss and specifying 'verification and validation activities' within the Modification Protocol, and changing 'collective impact' to 'cumulative impact'.
Before (pp. 28-43)
After (pp. 34-49)
I. Introduction
The new version clarifies the scope of the guidance by stating it's for 'AI-enabled devices' rather than specifically 'machine learning-enabled device software functions (ML-DSFs)', and removes the parenthetical definition of ML-DSFs.
Before (pp. 5-6)
After (pp. 5-6)
C. Predetermined Change Control Plan
The new version clarifies that PCCPs are specifically for AI-DSFs and updates the types of premarket submissions that would otherwise be required.
Before (p. 13)
After (pp. 13-14)
C. Identifying a PCCP in a Marketing Submission
The new version clarifies the required title and version number for the PCCP section and refines the language regarding references to the PCCP section and the assessment criteria.
Before (pp. 16-17)
After (pp. 17-19)
D. Utilizing an Authorized PCCP to Implement Device
The new version clarifies the expectation for manufacturers to follow their authorized PCCP and applicable legal requirements, and explicitly states that FDA considers a modification consistent with the PCCP under specific conditions.
Before (pp. 17-19)
After (pp. 19-22)
B. Content of the Description of Modifications Section
The new version clarifies the reference to labeling changes by specifying 'labeling sections' and providing a more precise section number for the Modification Protocol.
Before (pp. 20-21)
After (pp. 25-26)
IV. Definitions
The line numbers '217' and '218' were removed from the new version.
Before (p. 12)
After (p. 12)
What do these change types mean?
III. Scope
The new version significantly expands the definition of 'software devices' by introducing the concept of 'device software functions' and providing a detailed explanation of what constitutes a 'function' within a product, including examples.
Before (pp. 6-7)
After (pp. 7-9)
3. A cloud-based device algorithm for analyzing previously captured medical images
The new version significantly expands on the concept of architecture design documentation, moving from a brief mention of 'detailed depiction' to providing extensive examples, figures, and guidance on the content and presentation of system and software architecture diagrams.
Before (p. 13)
After (pp. 42-45)
I. Introduction
New content: I. Introduction
After (pp. 4-5)
IV. Definitions
New content: IV. Definitions
After (pp. 9-11)
V. Documentation Level
New content: V. Documentation Level
VI. Recommended Documentation
New content: VI. Recommended Documentation
After (pp. 12-31)
VII. Additional Information Regulatory Considerations for
New content: VII. Additional Information Regulatory Considerations for
After (pp. 31-32)
20. An infusion pump intended for use in a health care facility to pump fluids and
New content: 20. An infusion pump intended for use in a health care facility to pump fluids and
After (pp. 38-42)
1. A hand-held diagnostic device
New content: 1. A hand-held diagnostic device
After (p. 42)
Introduction
Removed: Introduction
Before (p. 5)
The Least Burdensome Approach
Removed: The Least Burdensome Approach
Before (pp. 5-6)
Software-Related Consensus Standards
Removed: Software-Related Consensus Standards
Before (p. 7)
Terminology Verification and Validation
Removed: Terminology Verification and Validation
Before (pp. 7-8)
Minor and Serious Injuries
Removed: Minor and Serious Injuries
Before (p. 8)
Major
Removed: Major
Before (p. 9)
Minor
Removed: Minor
Before (pp. 9-10)
Level of Concern
Removed: Level of Concern
Before (p. 14)
Software Description
Removed: Software Description
Before (pp. 14-15)
Device Hazard Analysis
Removed: Device Hazard Analysis
Before (p. 15)
Software Requirements
Removed: Software Requirements
Before (p. 13)
Summary of
Removed: Summary of
Before (p. 13)
Software Design
Removed: Software Design
Before (p. 13)
No
Removed: No
Before (p. 13)
Traceability Analysis
Removed: Traceability Analysis
Before (pp. 17-18)
Software Development Environment Description No
Removed: Software Development Environment Description No
Before (p. 13)
Annotated list of
Removed: Annotated list of
Before (pp. 13-14)
Documentation
Removed: Documentation
Before (p. 14)
Revision Level History
Removed: Revision Level History
Before (pp. 19-21)
Unresolved Anomalies No
Removed: Unresolved Anomalies No
Before (p. 14)
Software Requirements Specification
Removed: Software Requirements Specification
Before (pp. 15-16)
Hardware Requirements
Removed: Hardware Requirements
Before (p. 16)
Programming Language Requirements
Removed: Programming Language Requirements
Before (p. 16)
Interface Requirements
Removed: Interface Requirements
Before (p. 16)
Software Performance and Functional Requirements
Removed: Software Performance and Functional Requirements
Before (pp. 16-17)
Architecture Design Chart
Removed: Architecture Design Chart
Before (p. 17)
Software Design Specification
Removed: Software Design Specification
Before (p. 17)
Software Development Environment Description
Removed: Software Development Environment Description
Before (p. 18)
Verification and Validation Documentation
Removed: Verification and Validation Documentation
Before (p. 18)
Minor Level of Concern Devices
Removed: Minor Level of Concern Devices
Before (p. 18)
Moderate Level of Concern Devices
Removed: Moderate Level of Concern Devices
Before (pp. 18-19)
Risk Assessment and Management Background
Removed: Risk Assessment and Management Background
Before (p. 21)
Risk Assessment and Level of Concern
Removed: Risk Assessment and Level of Concern
Before (p. 21)
Risk Management
Removed: Risk Management
Before (p. 21)
Software Change Management
Removed: Software Change Management
Before (pp. 21-22)
Virus Protection Software
Removed: Virus Protection Software
Before (pp. 22-23)
2. An implantable therapeutic device with patient- and provider-facing applications
The old version defines 'Moderate' level of concern based on potential for minor injury, while the new version provides an example of a device type. These passages do not cover the same regulatory topic.
Before (p. 9)
After (p. 42)
V. Documentation Level
The concept of 'Level of Concern' has been replaced with 'Documentation Level' (Basic or Enhanced), and the criteria for Enhanced Documentation are more explicitly defined based on probable risk of death or serious injury, assessed prior to risk control measures, making the requirements more stringent.
Before (pp. 8-9)
After (pp. 11-12)
2. A non-contact infrared thermometer intended for intermittent measurement of body
The old version presented a series of questions to determine the level of concern for software devices, while the new version provides specific examples of devices with their descriptions, rationales, and outcomes.
Before (pp. 10-13)
After (pp. 32-38)
II. Background
The 'References' section from the old version has been replaced with a 'Background' section in the new version, which now includes a narrative about the purpose of the guidance and its relation to other documents and legislative changes, rather than just a list of references.
Before (pp. 23-24)
After (pp. 5-7)
How It Works
The pipeline extracts text from FDA guidance PDFs, chunks them by section hierarchy, aligns chunks across document versions using BM25 retrieval, then classifies each change with an LLM. The result is a structured diff that tells you not just what changed, but how: stricter requirements, new content, clarifications, or removals.
Key Technical Decisions
| Component | Choice | Why |
|---|---|---|
| Alignment | BM25 (sparse) | 0.915 MRR on 116 labeled queries. Regulatory terminology is stable across years—keyword matching outperformed embeddings. |
| Chunking | Parent-child | Best MRR (0.862) while preserving document hierarchy. 53 chunks vs 545 for fixed-size baseline. |
| Classification | Gemini 2.5 Flash | Consistent taxonomy output. 100% valid JSON on classification calls. |
| Thresholds | MATCH=15.0, NO_MATCH=5.0 | Tuned on ground truth—scores above 15 reliably indicate same content. |
Validation Results
Validated against law firm analyses (King & Spalding cybersecurity and software premarket reviews).
| Document Pair | Gap | Detection | Type Accuracy |
|---|---|---|---|
| Cybersecurity 2014→2023 | 9 years | 6/6 (100%) | 6/6 (100%) |
| PCCP AI 2023→2024 | 1.5 years | 9/10 (90%) | 9/9 (100%) |
| Software Premarket 2005→2023 | 18 years | 3/4 (75%) | 2/3 (67%) |
Modern FDA documents (post-2010) with numbered section headers achieve 90-100% detection. Older documents with inconsistent formatting are harder—75% detection on the 2005 software guidance.
Change Taxonomy
The system classifies changes into eight types, separating what the LLM determines (matched content) from what alignment determines (unmatched content):
LLM-Classified (Matched Pairs)
- STRICTER — Requirements more stringent
- MORE_LENIENT — Requirements relaxed
- EXPANDED — Same topic, more detail
- CLARIFICATION — Same meaning, clearer
- RESTRUCTURED — Content moved
- EDITORIAL — Cosmetic only
Alignment-Determined (No Match)
- NEW — Content only in newer document
- REMOVED — Content only in older document
Architecture
PDF → FDAChunker (parent_child) → Chunks
↓
Old chunks + New chunks → BM25Index.align_chunks() → ChunkMatches
↓
ChunkMatches → LLM classification → ClassifiedChanges → JSON Interested in learning more?
Check out my other projects or get in touch.