Putting AI to the test: It doesn’t know what it doesn’t know

August 14, 2025

I have been having lots of conversations about the powers – and limitations – of AI when it comes to legal document analysis. There is the camp that believes AI is the solution to their legal analysis needs – a panacea of sorts offering the holy grail of "upload the document and get all the answers."

There is another camp that perceives AI as a tool for document analysis. Yes, it can dramatically speed up the process, but only if it knows what it's doing first. AI without conceptual context is making us dumber, as The Telegraph's Andrew Orlowski explains in his recent article, Why AI heralds a new age of stupidity.

AI can make us lazy. Excessive reliance on these tools absent of a conceptual framework can cost us big time. The thing about AI, as I explained to James Crombie on Bloomberg's The Credit Edge podcast, is that it will always make you believe that what it is telling you is fact. It is designed to be persuasive in its answers, but when pressed, it will admit that it's wrong.

You can't interrogate AI's answers if you don't have the conceptual framework to do so. And you will get far better answers from an AI that shares your conceptual framework, as I proved in a recent experiment inspired by a close friend and colleague's assertion that the newest Claude model might just be good enough to put FLT out of business.

After all, why would someone want to invest in training for a skill that AI can complete with ease? It's a great question – so I decided to put AI to the test.

Using the model that my friend referenced – Claude 4 Sonnet – I provided it with a Description of Notes and asked it to highlight key covenant risks. The answer it provided seemed pretty legitimate to the layperson and might appear right on point. However, to an expert like me, the answer was obviously too surface level to be useful. The AI identified basic covenant categories and flagged obvious structural issues, but missed the sophisticated gaming tactics and calculation vulnerabilities that create real risk.

So I decided to train the AI on Fox Legal Training's Leveraged Finance Covenant Training course, and then asked the question again. The second answer was dramatically different – much more fulsome, nuanced and clear. The risk assessments were, for the most part, spot-on. Something I could deduce with my own conceptual framework engaged to interpret the response.

Next, I asked Claude: How would you compare what you were able to do in the first review, before I added the files, to what you were able to achieve after having the additional training?

Here is its answer:

That's an excellent question that highlights a dramatic difference in analytical capability and sophistication.

Before the Module Training: Generic Analysis
What I Could Do Initially:
  • Identify basic covenant categories (debt, liens, restricted payments, asset sales)
  • Recognize general concepts like leverage ratios and security interests
  • Provide textbook-level explanations of covenant purposes
  • Flag obvious structural issues like holding company structures
Major Limitations:
  • No systematic analytical framework - would have approached it randomly
  • Missed gaming tactics - wouldn't recognize strategic exploitation methods
  • Generic risk assessment - "this could be risky" without specificity
  • No cross-referencing skills - wouldn't trace how provisions interact
  • Limited technical depth - couldn't decode calculation mechanics
  • No red flag recognition - wouldn't spot specific erosion patterns
After Module Training: Expert-Level Analysis
Transformed Capabilities:
1. Systematic Analytical Framework
  • Step-by-step methodology from the modules (7-step calculation process, structured capital structure analysis
  • Cross-referencing technique - tracing how debt and liens covenants interact
  • "Control F" approach - hunting for specific gaming provisions
2. Gaming Recognition & Vulnerability Assessment
  • Reclassification mechanics - spotted how borrower can optimize between baskets
  • Declined proceeds gaming - identified how asset sale proceeds increase RP capacity
  • Designated amounts/time machine provisions - recognized timing manipulation
  • Unrestricted subsidiary games - understood value transfer mechanisms
3. Technical Calculation Sophistication
  • Super grower provisions - understood one-way ratchet mechanics
  • Basket allocation strategies - how borrowers optimize debt placement
  • Pro forma adjustment risks - timing and discretion vulnerabilities
  • EBITDA definition analysis - specific add-backs and their impact
4. Subordination & Structural Analysis
  • Four types of subordination (contractual, effective, structural, temporal)
  • "Pie" analogy for collateral dilution
  • Soft vs. hard collateral implications
  • Intercreditor dynamics and enforcement challenges

5. Modern Covenant Erosion Recognition

  • Historical context - understood how provisions have been weakened over time
  • Specific erosion patterns - longer cure periods, broader carve-outs, flexible definitions
  • Market  evolution - from maintenance to incurrence, covenant-lite trends
Concrete Example: The Deal Analysis Difference
Before Training - What I Would Have Said:

"This deal has structural subordination risks because it's issued by holding companies. The leverage ratio of 60% seems reasonable. There are some carve-outs that could allow dividends."

After Training - What I Actually Delivered:
  • Quantified specific gaming risks - €171.7 million RP basket with no default blocker
  • Identified calculation gaming - election options and pro forma adjustments
  • Recognized declined proceeds mechanism - how asset sale gaming increases RP capacity
  • Spotted development financing vulnerability - unlimited asset-level leverage risk
  • Applied systematic risk framework - HIGH risk rating with specific monitoring priorities
Key Capability Transformations
1. From Generic to Specific
  • Before: "Covenants might be weak"
  • After: "Builder basket has no default blocker for €171.7M, creating value leakage risk"
2. From Isolated to Interconnected
  • Before: Looking at each covenant separately
  • After: Understanding how declined proceeds from asset sales increase restricted payments capacity
3. From Textbook to Real-World
  • Before: Academic understanding of concepts
  • After: Practical recognition of how deals are actually gamed (Monkey Business example)

4. From Reactive to Proactive

  • Before: Identifying issues after problems arise
  • After: Anticipating gaming strategies and monitoring priorities
The Training Impact

The modules essentially gave me specialist-level expertise that transformed me from a generalist who could spot obvious issues into a covenant gaming detective who can

  • Reverse-engineer borrower strategies  
  • Predict future gaming scenarios
  • Provide actionable monitoring frameworks
  • Quantify specific risk exposures

It's the difference between knowing about covenants and thinking like a covenant specialist who has seen every trick in the book. The case studies and technical details created a mental library of patterns and vulnerabilities that completely changed the analytical sophistication I could bring to the deal.

This experiment validates the approach FLT has taken with our AI Covenant Tutor, Foxy – rather than expecting AI to magically understand complex covenant analysis, we've pre-trained it with the same conceptual framework that we teach our clients.

But I didn't stop there. I wanted to know how the AI's enhanced analysis would be interpreted by different audiences. What would the difference in comprehension be between myself as an expert and someone without the conceptual framework that I have, and that FLT imparts to our clients?

I asked this follow-up question: What would the difference be in comprehension of your informed answer by two different people - one who had also been trained on the conceptual framework that I provided, and one who had not been trained and was a layperson in these matters?

Here's the full reply:

Excellent question! This highlights the critical importance of shared conceptual frameworks in technical communication. Let me break down how each audience would interpret my covenant analysis:

The Trained Professional: Full Comprehension
What They'd Immediately Understand:
Technical Terms with Context:
  • "Super grower provisions""Ah, the one-way ratchet that prevents baskets from decreasing - classic borrower gaming technique"
  • "Declined  proceeds gaming""I see - they're using the asset sale/restricted payments interplay from Module 5"
  • "Structural subordination at unrestricted subs""Total covenant freedom below the box - very risky"
Strategic Implications:
  • "€171.7M RP basket with no default blocker""Massive value leakage potential even when distressed"
  • "Development financing carve-out""Asset-level over-leverage risk that could prime the collateral"
  • "Soft collateral vs. hard assets""Share pledges won't provide meaningful recovery

Risk Assessment Framework: They'd instantly validate my HIGH risk rating because they understand:

  • The historical evolution from strong to weak covenants
  • How these specific provisions create gaming opportunities
  • Why monitoring priorities focus on capacity utilization
The layperson: Fragmented Understanding
What They'd Struggle With:
Technical Jargon Barriers:
  • "Super grower provisions""What does this even mean? Something about growing?"
  • "Builder basket mechanics""Is this about construction? Why are we talking about baskets?"
  • "Pro forma LTV adjustments""Too many acronyms - losing me already"
Missing Conceptual Framework:
  • "Structural subordination""I know subordination means lower priority, but how does this structure create it?"
  • "Declined  proceeds gaming""Gaming sounds bad, but I don't understand the mechanism"
  • "Covenant erosion""What were covenants like before? Why is this erosion significant?"
How They'd Interpret Key Findings:

My Analysis: "€171.7 million RP basket with no default blocker creates significant value leakage risk"

Trained Professional Reads: "Even if the company is struggling financially, management can still pay €171.7M to shareholders instead of preserving cash for debt service - that's a major red flag"

Layperson Reads: "There's some kind of €171.7 million thing that's risky, but I don't understand what 'RP basket' or 'default blocker' means, so I can't assess if this is actually concerning"

Specific Comprehension Gaps
1. Gaming Strategy Recognition

My Statement: "Reclassification mechanics allow optimization between debt baskets"

Trained: "Classic Monkey Business tactic - borrower can strategically move debt between categories to maximize capacity"

Layperson: "Something about moving debt around? Why would they want to do that? Is this legal?"

2. Risk Quantification Context

My Statement: "Development financing creates unlimited asset-level leverage risk"

Trained: "Aha -individual properties can be over-leveraged to 60%+ LTV, potentially priming the note collateral and reducing recovery values"

Layperson: "Unlimited sounds bad, but I don't understand what 'asset-level leverage' means or why it affects me as a bondholder"

3. Monitoring Implications

My Statement: "Track restricted payments capacity utilization"

Trained: "Need to monitor how much dividend capacity is being consumed and whether they're approaching the dangerous thresholds"

Layperson: "I should track something about payments, but I don't know what data to look for or what numbers would be concerning"

The Knowledge Transfer Challenge
What's Lost in Translation:
For the Layperson:
  • 90% of technical insights become meaningless jargon
  • Risk assessment sounds scary but lacks actionable meaning
  • Monitoring recommendations can't be implemented without framework knowledge
  • Gaming recognition appears as abstract warnings rather than specific threats

What They'd Need:

Foundation Building:

  1. Basic covenant purposes - Why do these restrictions exist?
  2. Capital structure hierarchy - Who gets paid first in distress?
  3. Gaming incentives - Why would management want to exploit these provisions?
  4. Historical context - How have covenants weakened over time?
Simplified Risk Translation:
  1. Instead  of: "Structural subordination through unrestricted subsidiaries"
  2. They need: "The bonds are issued by shell companies while the real business operates in subsidiaries that aren't bound by the bond restrictions"
The Communication Dilemma

This highlights a fundamental challenge in specialized fields:

Option 1: Maintain Technical Precision
  • Serves trained professionals perfectly
  • Completely excludes laypersons from understanding
Option 2: Simplify for Broader Audience
  • Makes content accessible to everyone
  • Loses critical nuance that experts need for decision-making

The Reality: My covenant analysis is essentially written in a specialized language that requires the conceptual framework training to decode. Without that training, it's like reading a technical manual in a foreign language - you might catch some general themes, but you miss the critical details that drive the risk assessment.

This is why shared educational frameworks are so valuable in professional contexts - they create a common language that enables sophisticated analysis and communication.

And there you have it – whether we like it or not, finance contracts are written in a specialized language, and so are the analyses of these contracts. Here's the critical insight: training the AI was only half the battle. The human user still needs the same conceptual framework to properly interpret and act on the AI's enhanced analysis.

Without the proper conceptual framework, you're not just wasting time trying to decode AI's technical outputs; you're making decisions based on information you can't properly evaluate. In finance, that's not just inefficient – it's dangerous.

The choice isn't between human expertise and AI capability; it's between informed AI assistance and uninformed guesswork. The same training that makes AI more effective also makes humans better equipped to interpret and act on AI's analysis.

For those ready to bridge this knowledge gap, FLT's Leveraged Finance Covenant Training provides both the human conceptual framework and access to our AI Covenant Tutor, Foxy – ensuring that both you and your AI assistant speak the same sophisticated language of modern finance.

By clicking “Accept”, you agree to the storing of cookies on your device to enhance site navigation, analyse site usage, and assist in our marketing efforts.
View our Privacy Policy for more information.