Progressive Context Loading

The Problem: Loading Everything Upfront

Now that you understand context windows and context rot, let's address the most common beginner mistake.

A Common Mistake

New Developer's Approach:

# Session start
"Read all files in my project:
- All 50 files in src/
- All 30 test files
- All documentation
- Configuration files
- Everything!"

What happens?

Context window fills to 80%+ immediately
By interaction 3-5, context rot starts
AI slows down drastically
Performance degrades before you've even started building
Most of that context isn't even relevant yet!

The Result: You've wasted your context budget before doing any actual work.

Why This Doesn't Work

Think about learning a new subject:

Bad Learning Approach: "Let me read the entire 500-page textbook cover-to-cover before starting the first exercise."

Result: Information overload, can't remember most of it, gets confused.

Good Learning Approach: "Let me read the chapter introduction, then the relevant section, then try the exercise."

Result: Focused learning, better retention, can reference back as needed.

Progressive context loading is the "good learning approach" for AI agents.

What is Progressive Context Loading?

WHAT: The Strategy

Progressive Context Loading is the practice of loading information into your AI's context as you need it, not all at once.

Core Principle:

Load the minimum context necessary for the current task, then expand as needed.

The Three-Phase Approach

Think of it like exploring a new building:

Phase 1: Overview (Understand the Layout)

Get the "lay of the land"
High-level structure
Don't dive into details yet

Phase 2: Module Focus (Enter the Relevant Room)

Load only the specific area you're working in
Understand patterns and conventions
Still don't load unrelated sections

Phase 3: Deep Dive (Work on Your Specific Task)

Now you're ready to implement
Context is focused and relevant
No wasted space on irrelevant information

Visual Comparison

All-at-Once Loading:

Context Window: [████████████████░░░░] 80% full immediately
Available Space: 20%
Time Until Rot: 5-10 interactions

Progressive Loading:

Context Window: [████░░░░░░░░░░░░░░░░] 20% full after Phase 1
Available Space: 80%
Time Until Rot: 25-30+ interactions

The difference: Progressive loading gives you 3-5x more working room!

WHY: Why Progressive Loading Works

Three Key Benefits

1. Keeps Context Focused

Only relevant information is loaded
AI doesn't get "distracted" by irrelevant code
Faster, more accurate responses

2. Prevents Early Context Rot

Context fills gradually, not immediately
You get much longer productive sessions
More work done before needing to refresh

3. Matches Your Workflow

You don't need to know everything at once anyway
Load context just-in-time as tasks evolve
Natural progression mirrors how humans work

Real-World Analogy

All-at-Once: Like bringing every tool from your toolbox to a job site. Most tools sit unused, get in your way, and you waste energy carrying them.

Progressive: Like bringing only the tools you need for each phase. Hammer and nails for framing, then paintbrush and paint for finishing. Efficient and focused.

HOW: Implementing Progressive Loading

Let's walk through a complete real-world example.

Scenario: Joining a New Project

You've just joined a Python FastAPI project with 50+ files. You need to add a new feature: "User Profile Management."

🚫 The Wrong Way (All-at-Once)

claude "Read all files in src/, tests/, and docs/ so you 
understand the entire project"

Result: Context immediately 75% full, most information not relevant to your task.

✅ The Right Way (Progressive Loading)

Phase 1: High-Level Overview

Goal: Understand project structure without reading file contents.

claude "Analyze the directory structure of this project without 
reading file contents yet. Tell me:

1. What are the main directories?
2. What appears to be the general architecture?
3. Where would user-related features likely be located?
4. What testing structure is being used?

Just analyze the structure—don't read files yet."

What the AI Does:

Lists directories (sees src/api/, src/services/, src/models/, tests/)
Infers architecture (layered: API → Service → Model)
Identifies user-related files are probably in src/services/user_service.py
Notes testing structure (pytest, tests/ mirrors src/)

Context Used: ~1,000 tokens (just directory listings and your conversation)

Why This Works: You now understand the landscape without filling context with actual code.

Phase 2: Module Focus

Goal: Understand the specific area relevant to your task.

claude "Now I need to understand how user-related features work.

Discover and analyze:
1. Find the User model - what fields and relationships exist?
2. Find user service patterns - how do we structure business logic?
3. Find user API endpoints - how do we handle routes?

For each, tell me:
- What file you found
- What patterns you observed
- What conventions we follow

Then summarize: How should I structure new services in this project?"

What the AI Does:

Reads ONLY the 3 files needed
Understands your service pattern (constructor pattern, async methods, error handling)
Notes your code style (type hints, docstrings, custom exceptions)
Sees how API routes connect to services

Context Used: ~5,000 tokens (3 files + conversation)

Total Context So Far: ~6,000 tokens (about 3% of Claude's 200K window!)

Why This Works: AI now knows your patterns without loading 50 files.

Phase 3: Task-Specific Implementation

Goal: Build the new feature with full context of relevant patterns.

claude "Following the exact patterns I saw in user_service.py, 
create a new profile management feature:

1. Add profile fields to User model (bio, avatar_url, social_links)
2. Create ProfileService following the UserService pattern
3. Add API endpoints for profile CRUD operations
4. Follow the same error handling and async patterns

Use ONLY our existing patterns—don't introduce new dependencies."

What the AI Does:

Implements ProfileService matching your UserService structure exactly
Uses your established patterns for models, services, routes
Follows your code style (type hints, docstrings, error handling)
Stays within existing dependency set

Context Used: ~10,000 tokens (previous context + new conversation)

Total Context: ~16,000 tokens (about 8% of context window!)

Why This Works: You've built a complete feature using only 8% of available context, leaving 92% for iteration, debugging, and expansion.

The Progression Visualized

Phase 1: Overview
├─ Context: 1% full
└─ Knowledge: Project structure, architecture pattern

Phase 2: Module Focus  
├─ Context: 3% full
└─ Knowledge: Specific patterns, code style, conventions

Phase 3: Implementation
├─ Context: 8% full
└─ Knowledge: Everything needed to build feature perfectly

Remaining capacity: 92% for refinement, testing, documentation!

Practical Guidelines

When to Use Each Phase

Phase 1 - Overview (Always start here)

✓ Starting a new project
✓ Joining an existing codebase
✓ Haven't worked on project in days/weeks
✓ Exploring unfamiliar architecture

Phase 2 - Module Focus (After you know the landscape)

✓ About to add a feature
✓ Need to understand a specific subsystem
✓ Fixing a bug in particular module
✓ Learning project patterns

Phase 3 - Implementation (When you're ready to build)

✓ Clear task in mind
✓ Understand relevant patterns
✓ Ready to write/modify code
✓ Have context for current work

How Much to Load in Each Phase

Phase 1 Guidelines:

Just directory structures
Key documentation (README.md)
Architecture overview docs
Don't: Read actual code files yet

Phase 2 Guidelines:

2-5 example files showing patterns
Relevant service/model files
Related configuration
Don't: Load unrelated modules

Phase 3 Guidelines:

Specific files you're modifying
Direct dependencies
Related tests
Don't: Load "just in case" files

Practice Exercise: Create Your Loading Plan

Let's practice creating a progressive loading plan.

Scenario

You're joining a new project: An e-commerce API built with FastAPI and PostgreSQL.

Project structure:

project/
├── src/
│   ├── api/routes/
│   │   ├── products.py
│   │   ├── orders.py
│   │   ├── users.py
│   │   └── payments.py
│   ├── services/
│   │   ├── product_service.py
│   │   ├── order_service.py
│   │   ├── user_service.py
│   │   └── payment_service.py
│   ├── models/
│   │   ├── product.py
│   │   ├── order.py
│   │   └── user.py
│   └── database/
│       ├── connection.py
│       └── migrations/
├── tests/
└── docs/
    ├── API.md
    └── ARCHITECTURE.md

Your task: Add a "shopping cart" feature.

Your Turn: Plan the Three Phases

Before looking at the answer below, try planning:

Phase 1: What should you analyze without reading code?
Phase 2: Which 3-5 files should you read to understand patterns?
Phase 3: What specific context do you need for implementing cart?

Click to reveal suggested answer

Phase 1: Overview

"Analyze project structure without reading files:
- What's the architecture pattern?
- Where should cart feature live?
- How do existing features connect (API → Service → Model)?"

Phase 2: Module Focus (Cart is similar to Products and Orders)

"Read these files to understand patterns:
src/models/product.py - see model pattern
src/services/product_service.py - see service pattern
src/services/order_service.py - see how orders work with products
src/api/routes/orders.py - see API route structure
docs/ARCHITECTURE.md - understand database approach"

Phase 3: Implementation

"Create cart feature following patterns:
1. Cart model (similar to Order)
2. CartService (following ProductService pattern)
3. Cart API routes (matching existing route structure)
Follow all established patterns—don't introduce new ones"

Validation Checkpoints

How do you know if your progressive loading strategy is working?

✅ Signs You're Doing It Right

After Phase 1:

You understand project structure
You know where relevant code lives
You haven't read any actual implementation yet
Context is under 5%

After Phase 2:

You understand the coding patterns used
You know how similar features are implemented
You've only loaded 3-7 files
Context is under 15%

After Phase 3:

You've implemented the feature
Generated code matches existing patterns
Context is under 40%
Still have room for iteration and refinement

❌ Signs You Need to Adjust

Red Flags:

❌ Context over 50% after Phase 2
❌ AI asks "Which pattern should I follow?" (means too many patterns loaded)
❌ Loaded files you haven't referenced yet
❌ AI seems confused about which approach to use

What to do: Start a new session, reload only what's actually needed.

Try With AI

Tool: Claude Code

Now let's practice progressive loading thinking.

Prompt 1: Understanding the Concept

claude "I'm starting work on a Python project with 30 files. Should I ask my AI to read all files first, or load them progressively as needed? Why? Explain using the three-phase progressive loading approach."

Expected Outcome:

The AI recommends progressive loading
Explanation of why all-at-once causes problems
Description of the 3-phase approach

Check: Can you explain why progressive loading is better?

Prompt 2: Creating a Loading Plan

claude "I need to add user authentication to a FastAPI project I just joined. The project has 40+ files. I want to use progressive context loading.

Create a 3-phase loading plan for me:
- Phase 1 (Overview): What should I analyze first?
- Phase 2 (Module Focus): Which files should I read?
- Phase 3 (Implementation): What context do I need for building auth?

Assume the project structure is:
src/api/, src/services/, src/models/, tests/"

Expected Outcome:

Phase 1: Structure analysis, no files read yet
Phase 2: Similar existing features (if any), service patterns, model patterns
Phase 3: Specific files to modify, security patterns, testing patterns

Reflection: Does this plan make sense? Would you follow it?

Prompt 3: Real-World Application

claude "I loaded 20 files into my AI context at the start of my session. Now I'm only using 3 of them for my current task. The other 17 are just sitting there taking up space.

What problem does this cause?
What should I have done instead?
What should I do now to fix this?"

Expected Outcome:

Problem: Wasted context, faster rot, slower AI
Should have: Used progressive loading, loaded only needed 3
Fix now: Start fresh session with just the 3 needed files

Action: This is a common situation—now you know how to fix it!

The Problem: Loading Everything Upfront​

A Common Mistake​

Why This Doesn't Work​

What is Progressive Context Loading?​

WHAT: The Strategy​

The Three-Phase Approach​

Visual Comparison​

WHY: Why Progressive Loading Works​

Three Key Benefits​

Real-World Analogy​

HOW: Implementing Progressive Loading​

Scenario: Joining a New Project​

🚫 The Wrong Way (All-at-Once)​

✅ The Right Way (Progressive Loading)​

Phase 1: High-Level Overview​

Phase 2: Module Focus​

Phase 3: Task-Specific Implementation​

The Progression Visualized​

Practical Guidelines​

When to Use Each Phase​

How Much to Load in Each Phase​

Practice Exercise: Create Your Loading Plan​

Scenario​

Your Turn: Plan the Three Phases​

Validation Checkpoints​

✅ Signs You're Doing It Right​

❌ Signs You Need to Adjust​

Try With AI​

Prompt 1: Understanding the Concept​

Prompt 2: Creating a Loading Plan​

Prompt 3: Real-World Application​

The Problem: Loading Everything Upfront

A Common Mistake

Why This Doesn't Work

What is Progressive Context Loading?

WHAT: The Strategy

The Three-Phase Approach

Visual Comparison

WHY: Why Progressive Loading Works

Three Key Benefits

Real-World Analogy

HOW: Implementing Progressive Loading

Scenario: Joining a New Project

🚫 The Wrong Way (All-at-Once)

✅ The Right Way (Progressive Loading)

Phase 1: High-Level Overview

Phase 2: Module Focus

Phase 3: Task-Specific Implementation

The Progression Visualized

Practical Guidelines

When to Use Each Phase

How Much to Load in Each Phase

Practice Exercise: Create Your Loading Plan

Scenario

Your Turn: Plan the Three Phases

Validation Checkpoints

✅ Signs You're Doing It Right

❌ Signs You Need to Adjust

Try With AI

Prompt 1: Understanding the Concept

Prompt 2: Creating a Loading Plan

Prompt 3: Real-World Application