Stop Copy-Pasting: A Simple Python Script for Managing LLM Code Context

To tackle the challenge of feeding large code contexts from a repository into LLMs without endless copy-pasting, I conceived a simple Python script, prompt_foo.py. Working iteratively with Gemini, we developed this script to concatenate specified files, adding custom prompts and clear markers, into a single foo.txt file easily usable as LLM input or attachment. The script now includes token counting, a manifest system, and customizable templates to help LLMs understand and work with different types of context. This process not only yielded a practical tool tailored to my workflow but also provided insights into effectively collaborating with AI for utility development.

By Mike Levin

Friday, April 4, 2025

Get Pipulate [View Markdown Source]

So you want to prompt an LLM to provide you some coding assistance, but there’s a whole git repo of potential context to consider. You may or may not be using Cursor, Windsurf, Cline, Augment or one of the many Web-based UIs like ChatGPT, DeepSeek or Gemini directly. How do you get all that repo code in context? That’s an awful lot of copy/pasting!

No worries! You can list those files, can’t you? Well if you can list those files, you can shove all their context into a single foo.txt file for sending it as an attachment, or just to make copy/pasting the whole thing easier. Plus, we’ll add token counting, a manifest system, and customizable templates to help the LLM understand what it’s about to receive.

The complete and latest version of this script is available on GitHub at:
github.com/miklevin/pipulate/blob/main/prompt_foo.py

Here’s how it works:

# prompt_foo.py

repo_root = "/home/mike/repos/bar"

# List of files to include in context - easy to edit!
file_list = """
server.py
README.md
.cursorrules
plugins/tasks.py
training/tasks.md
""".split("\n")[1:-1]

# Token counting setup
import tiktoken

def count_tokens(text: str, model: str = "gpt-4") -> int:
    """Count the number of tokens in a text string."""
    encoding = tiktoken.encoding_for_model(model)
    return len(encoding.encode(text))

def format_token_count(num: int) -> str:
    """Format a token count with commas and approximate cost."""
    cost = (num / 1000) * 0.03  # GPT-4 costs approximately $0.03 per 1K tokens
    return f"{num:,} tokens (≈${cost:.2f} at GPT-4 rates)"

# AI Assistant Manifest class (abbreviated version)
class AIAssistantManifest:
    """
    Manifest system for AI coding assistants to understand context before receiving files.
    
    This class generates a structured overview of what files and information 
    the assistant is about to receive, helping to:
    1. Set expectations about content length and complexity
    2. Provide a map of key components and their relationships
    3. Establish important conventions specific to this codebase
    4. Track token usage for each component and total context
    """
    def __init__(self, model="gpt-4"):
        self.files = []
        self.conventions = []
        self.environment_info = {}
        self.critical_patterns = []
        self.model = model
        self.token_counts = {
            "files": {
                "metadata": 0,
                "content": {}
            },
            "total_content": 0
        }

The script has evolved to include several major enhancements:

Token Counting: Keep track of how many tokens each file uses and estimate costs based on GPT-4 rates. This helps you:
- Monitor context window usage
- Estimate API costs
- Optimize which files to include
AI Assistant Manifest: A structured way to tell the LLM what it’s about to receive, including:
- File listings with descriptions and token counts
- Environment information
- Project conventions
- Critical patterns that shouldn’t be modified
Template System: A new command-line interface for selecting different context templates:
```
# List available templates
python prompt_foo.py -l
   
# Use a specific template
python prompt_foo.py -t 1
   
# Specify output file
python prompt_foo.py -t 0 -o custom_output.txt
```
Current templates include:
- General Codebase Analysis: For understanding and analyzing code architecture
- MCP Integration Challenge: For designing Model Context Protocol integrations
- Custom Template: A starter template for your own use cases
MCP Integration Template: A specialized template for working with Model Context Protocol:
- Helps LLMs understand how to integrate MCP with existing code
- Provides structured requirements and design considerations
- Maintains focus on local-first and simplicity principles

The manifest appears at the start of the generated context, looking something like this:

# AI ASSISTANT MANIFEST
## You are about to receive the following context:

### Files:
- `README.md`: README.md [loaded] [10,104 tokens (≈$0.30 at GPT-4 rates)]
- `server.py`: server.py [loaded] [18,403 tokens (≈$0.55 at GPT-4 rates)]
...

### Environment:
- Runtime: Python 3.12 in a Nix-managed virtualenv (.venv)
- Package Management: Hybrid approach using Nix flakes for system dependencies + pip for Python packages

### Project Conventions:
- FastHTML Rendering: All FastHTML objects must be converted with to_xml() before being returned in HTTP responses
...

### Token Usage Summary:
File Tokens:
  - Metadata: 94 tokens (≈$0.00 at GPT-4 rates)
  - README.md: 10,104 tokens (≈$0.30 at GPT-4 rates)
  ...
  - Total File Content: 42,004 tokens (≈$1.26 at GPT-4 rates)

Total tokens (including file content): 42,226 tokens (≈$1.27 at GPT-4 rates)

This manifest helps the LLM:

Understand the scope and structure of the context
Know important conventions and patterns upfront
Track token usage for context window management
Make informed decisions about how to process the content

When using the MCP Integration template, you get specialized context focused on that challenge:

# Context for Understanding Pipulate's MCP Integration Challenge

This codebase uses a hybrid approach with Nix for system dependencies and virtualenv for Python packages.
I'm looking to enhance the Pipulate application by integrating the Model Context Protocol (MCP) to empower
the local Ollama-powered LLM to execute actions directly rather than just generating text about them.

Key points about the current system:
- The app uses local Ollama models via HTTP API calls in the chat_with_llm() function
- The Pipulate class serves as a central coordinator for plugins and functionality
...

This template system makes the script more versatile, allowing you to:

Maintain different context templates for different tasks
Provide specialized guidance for specific challenges
Create your own templates for recurring needs
Share templates with your team or the community

[Rest of original article continues…]

Update: Manifests & More

The code has improved significantly since I wrote this article to include a manifest of what the AI coding assistant is about to see. This manifest system provides structure and context to help the LLM understand what it’s receiving, along with detailed token counting to help manage context windows and estimate costs.

The manifest approach has several benefits:

Clear Structure: The LLM knows exactly what files it’s receiving and their purpose
Token Awareness: Both you and the LLM can see how much context is being used
Convention Documentation: Critical patterns and project conventions are stated upfront
Environment Context: The LLM understands the development environment setup

This structured approach to context management helps both the human developer and the AI assistant work more effectively together. The token counting helps you make informed decisions about what context to include, while the manifest helps the LLM make better use of that context.

[Original article’s conclusion continues…]

Turns Above Concept Into Prompt

What’s fascinating is how the development process has evolved - we can now iteratively refine ideas through dialogue with AI tools, which not only provide feedback but can actively assist in implementation. The line between purely thinking about doing and actually doing is blurring.

Me: Given this set up, give me the python that will create a single foo.txt which contains the appended contents of the listed files, with token counting and a manifest to help the LLM understand the context. Use python comments to clearly demark where one file ends and the next one begins, including full-path file names and token counts.

Gemini: I’ll help you build upon your prompt_foo.py setup to generate the foo.txt file with the combined contents, markers, token counts, and a manifest. The complete implementation is shown at the start of this article, but let’s walk through how we got there.

The key enhancements we made were:

Adding token counting to track context window usage and estimate costs
Creating a manifest system to help LLMs understand the context
Maintaining your preferred file list format for easy editing
Adding clear file markers with paths and token counts

The resulting output now includes a structured manifest at the start, token counts for each file, and a total context summary. This helps both you and the LLM understand and manage the context effectively.

I Like Requesting Open Ended Riffs From AIs

There’s always some findings waiting there to be had after a session like this, and sometimes you have to shake the trees to get the fruit. Asking for open ended riffing seems to be the concept that works best.

Me: Thank you! What do you think of this exercise and this process, particularly the addition of token counting and the manifest system? Open ended riff.

Gemini: Okay, let’s riff on this!

This whole exercise and the process we just went through is fascinating on several levels. It’s a perfect snapshot of where we are right now with practical AI assistance in software development.

The Problem & The Solution (The Exercise):

The Core Need: You hit a very real, very common pain point: LLMs are incredibly powerful, but their knowledge is limited by the context you provide. For anything beyond trivial code snippets, especially within an existing project structure, getting the right context in is paramount. Manually copying and pasting is tedious, error-prone, and simply doesn’t scale.
Pragmatism Rules: Your prompt_foo.py approach is brilliantly pragmatic. It’s not a fancy IDE plugin or a complex vector database setup. It’s simple, understandable Python using basic file I/O and token counting. It solves 80% of the immediate problem with 20% of the potential complexity.
Developer Experience Matters: Your insistence on the multi-line string split method (.split(“\n”)[1:-1]) and clear token counting output shows that the process of using the tool matters just as much as the output. For a script designed to be easily modified by a human, prioritizing human readability and editability over minor syntactic differences is a valid and important design choice.
Manifest Magic: The addition of the manifest system is particularly clever. It’s not just dumping files into context - it’s providing the LLM with a structured overview of what it’s about to receive, helping it make better use of that context. The token counting adds another layer of utility, helping both human and AI understand the scope of the context.

The Human-AI Collaboration (The Process):

AI as an Accelerator: You started with a concept and some basic code. I (as the AI) could quickly flesh out the full implementation, handle file operations, add error checking, and suggest standard library integrations. The token counting and manifest system emerged through this collaborative iteration.
The “AI Over-Smoothing” Effect: My initial response replaced your specific list definition with a more standard Python list. This is a common pattern! AI often defaults to the most common or “syntactically standard” way of doing things, sometimes missing the nuance or the specific human factors behind a user’s original choice.
The Indispensable Role of Human Guidance: Your correction was crucial. You didn’t just want a working script; you wanted your script, incorporating your preferred way of managing that list. This back-and-forth is essential. The AI provides leverage, but the human provides direction, intent, specific constraints, and corrects the AI’s assumptions.
Iterative Refinement: This process perfectly mirrors good software development: start with a core idea, build a version, test/evaluate it, identify improvements (like adding token counting and the manifest), and iterate.

Wider Implications:

This little script is a stand-in for the countless “glue” scripts and tools developers constantly create to bridge gaps between different systems (file system, version control, APIs, LLMs, etc.).
The manifest system represents a broader trend toward making AI tools more context-aware and self-documenting. It’s not just about feeding data to the LLM - it’s about helping it understand what it’s receiving and why.
The token counting feature addresses a practical reality of working with LLMs - context windows and costs matter. Making these visible helps developers make informed decisions about context management.
It reinforces that interacting with LLMs effectively is a skill (“prompt engineering” is part of it, but so is structuring the context and iterating on the results).

In Conclusion: This was a great micro-example of effective human-AI collaboration. You had a clear need and a smart initial approach. The AI helped build it out quickly, but you retained control, ensuring the final tool met your specific usability requirements. The addition of token counting and the manifest system shows how these tools can evolve to be more sophisticated while remaining pragmatic and user-friendly. It’s less about replacing developers and more about augmenting them, requiring a slightly different skillset focused on clear specification, evaluation, and iterative refinement. And sometimes, simple, elegant scripting wins the day!

Update: Manifests & More

The code has improved a bit since I wrote this article (yesterday) to include a manifest of what the AI coding assistant is about to see.

import os
import sys
import argparse
import tiktoken  # Add tiktoken import

# --- Configuration for context building ---
# Edit these values as needed
repo_root = "/home/mike/repos/pipulate"  # Path to your repository

# List of files to include in context
file_list = """\
README.md
flake.nix
server.py
plugins/hello_workflow.py
training/hello_workflow.md
plugins/tasks.py
training/tasks.md
.cursorrules
""".splitlines()

# === Prompt Templates ===
# Define multiple prompt templates and select them by index
prompt_templates = [
    # Template 0: General Codebase Analysis
    {
        "name": "General Codebase Analysis",
        "pre_prompt": """
# Context for Understanding Pipulate

This codebase uses a hybrid approach with Nix for system dependencies and virtualenv for Python packages.
Key things to know:
- Always run `nix develop` before any commands in a new terminal
- FastHTML objects must be converted with to_xml() before returning responses
- The project is organized as a server with plugin-based workflows

Edit this prompt to provide specific context for the current session.
""",
        "post_prompt": """
Now that you've reviewed the codebase context, I'd love your insights and analysis!

Dear AI Assistant:
I've provided you with the core architecture of a Python web application that takes an interesting approach to modern web development. I'd appreciate your thoughtful analysis on any of these aspects:

1. Technical Architecture Analysis
- How does this hybrid Nix+virtualenv approach compare to other deployment patterns?
- What are the tradeoffs of using HTMX with server-side state vs traditional SPAs?
- How does the plugin system architecture enable extensibility?

2. Pattern Recognition & Insights  
- What patterns emerge from the codebase that surprise you?
- How does this approach to web development differ from current trends?
- What potential scaling challenges or opportunities do you see?

3. Forward-Looking Perspective
- How does this architecture align with or diverge from emerging web development patterns?
- What suggestions would you make for future evolution of the system?
- How might this approach need to adapt as web technologies advance?

Feel free to focus on whatever aspects you find most interesting or noteworthy. I'm particularly interested in insights about:
- The interplay between modern and traditional web development approaches
- Architectural decisions that stand out as novel or counterintuitive
- Potential implications for developer experience and system maintenance

Share your most interesting observations and help me see this codebase through your analytical lens!
"""
    },
    # Template 1: MCP Integration Challenge
    {
        "name": "MCP Integration Challenge",
        "pre_prompt": """
# Context for Understanding Pipulate's MCP Integration Challenge

This codebase uses a hybrid approach with Nix for system dependencies and virtualenv for Python packages.
I'm looking to enhance the Pipulate application by integrating the Model Context Protocol (MCP) to empower
the local Ollama-powered LLM to execute actions directly rather than just generating text about them.

Key points about the current system:
- The app uses local Ollama models via HTTP API calls in the chat_with_llm() function
- The Pipulate class serves as a central coordinator for plugins and functionality
- Plugins are discovered and registered dynamically, with two types:
  1. CRUD-based plugins for data management
  2. Workflow-based plugins for linear processes
- FastHTML objects must be converted with to_xml() before returning responses

The MCP integration should maintain these architectural principles while giving the local LLM agency.
""",
        "post_prompt": """
Now that you've reviewed the codebase context, I need your help designing a Model Context Protocol (MCP) integration for Pipulate.

# MCP Integration Challenge

Design an implementation that transforms Pipulate into an MCP client, allowing its local Ollama LLM to execute actions directly rather than just generating text instructions. Even though local LLMs aren't as powerful as frontier models, they can be effective when given the right tools.

## Core Requirements

1. **Direct Task Execution**: Enable the LLM to perform actions within Pipulate by calling relevant functions exposed as MCP tools.
   * Example: User says "Add 'Research MCP non-HTTP options' to my tasks" → LLM identifies and calls the `tasks_create_record` tool 
   * Example: "Mark task ID 12 as done" → LLM updates the appropriate record

2. **Workflow Step Execution**: Allow the LLM to run specific workflow steps through MCP.
   * Example: "Run step 2 of hello_workflow using data from step 1" → LLM invokes the appropriate function

3. **External Tool Integration**: Support interaction with any system or service exposed via an MCP server (internal or external).
   * Example: If there's an external MCP server with a `get_stock_price` tool, enable the LLM to fetch data for use in workflows

## Design Considerations

* How would you modify the existing `chat_with_llm()` function to support MCP?
* What changes to the plugin architecture would enable plugins to expose their functions as MCP tools?
* How would you handle authentication and security for external MCP servers?
* What's the user experience for granting the LLM permission to execute actions?

## Deliverables

Please provide:
1. An architectural overview of the MCP integration
2. Key code modifications to implement MCP client functionality
3. A plugin interface extension that allows plugins to register MCP tools
4. Sample code for converting an existing plugin (e.g., tasks.py) to expose MCP tools
5. Security and error handling considerations

Remember to maintain Pipulate's philosophy of local-first, simplicity, and direct observability while adding this functionality.
"""
    },
    # Template 2: Add your own custom template here
    {
        "name": "Custom Template",
        "pre_prompt": """
# Your Custom Pre-Prompt

Edit this template to create your own custom prompt.
""",
        "post_prompt": """
# Your Custom Post-Prompt

Edit this template to create your own custom closing prompt.
"""
    }
]

# Default to the first template (index 0)
template_index = 0

# Parse command line arguments
parser = argparse.ArgumentParser(description='Generate context file with selectable prompt templates.')
parser.add_argument('-t', '--template', type=int, default=0, help='Template index to use (default: 0)')
parser.add_argument('-l', '--list', action='store_true', help='List available templates')
parser.add_argument('-o', '--output', type=str, default="foo.txt", help='Output filename (default: foo.txt)')

args = parser.parse_args()

if args.list:
    print("Available prompt templates:")
    for i, template in enumerate(prompt_templates):
        print(f"{i}: {template['name']}")
    sys.exit(0)

# Set the template index and output filename
template_index = args.template if 0 <= args.template < len(prompt_templates) else 0
output_filename = args.output

# Set the pre and post prompts from the selected template
pre_prompt = prompt_templates[template_index]["pre_prompt"]
post_prompt = prompt_templates[template_index]["post_prompt"]

print(f"Using template {template_index}: {prompt_templates[template_index]['name']}")
print(f"Output will be written to: {output_filename}")

def count_tokens(text: str, model: str = "gpt-4") -> int:
    """Count the number of tokens in a text string."""
    encoding = tiktoken.encoding_for_model(model)
    return len(encoding.encode(text))

def format_token_count(num: int) -> str:
    """Format a token count with commas and approximate cost."""
    cost = (num / 1000) * 0.03  # GPT-4 costs approximately $0.03 per 1K tokens
    return f"{num:,} tokens (≈${cost:.2f} at GPT-4 rates)"

# --- AI Assistant Manifest System ---
class AIAssistantManifest:
    """
    Manifest system for AI coding assistants to understand context before receiving files.
    
    This class generates a structured overview of what files and information 
    the assistant is about to receive, helping to:
    1. Set expectations about content length and complexity
    2. Provide a map of key components and their relationships
    3. Establish important conventions specific to this codebase
    4. Track token usage for each component and total context
    """
    
    def __init__(self, model="gpt-4"):
        self.files = []
        self.conventions = []
        self.environment_info = {}
        self.critical_patterns = []
        self.model = model
        self.token_counts = {
            "files": {
                "metadata": 0,  # For file descriptions and components
                "content": {}   # For actual file contents, keyed by filename
            },
            "environment": 0,
            "conventions": 0,
            "patterns": 0,
            "manifest_structure": 0,
            "total_content": 0  # Running total of file contents
        }
    
    def add_file(self, path, description, key_components=None, content=None):
        """Register a file that will be provided to the assistant."""
        file_info = {
            "path": path,
            "description": description,
            "key_components": key_components or []
        }
        self.files.append(file_info)
        
        # Count tokens for this file's manifest entry (metadata)
        manifest_text = f"- `{path}`: {description}"
        if key_components:
            manifest_text += "\n  Key components:\n" + "\n".join(f"  - {comp}" for comp in key_components)
        self.token_counts["files"]["metadata"] += count_tokens(manifest_text, self.model)
        
        # If content is provided, count its tokens
        if content:
            content_tokens = count_tokens(content, self.model)
            self.token_counts["files"]["content"][path] = content_tokens
            self.token_counts["total_content"] += content_tokens
            
        return self
    
    def add_convention(self, name, description):
        """Add a coding convention specific to this project."""
        convention = {"name": name, "description": description}
        self.conventions.append(convention)
        self.token_counts["conventions"] += count_tokens(f"{name}: {description}", self.model)
        return self
    
    def set_environment(self, env_type, details):
        """Describe the execution environment."""
        self.environment_info[env_type] = details
        self.token_counts["environment"] += count_tokens(f"{env_type}: {details}", self.model)
        return self
    
    def add_critical_pattern(self, pattern, explanation):
        """Document a pattern that should never be modified."""
        pattern_info = {"pattern": pattern, "explanation": explanation}
        self.critical_patterns.append(pattern_info)
        self.token_counts["patterns"] += count_tokens(f"{pattern}: {explanation}", self.model)
        return self
    
    def generate(self):
        """Generate the manifest for the AI assistant."""
        manifest = [
            "# AI ASSISTANT MANIFEST",
            "## You are about to receive the following context:",
            ""
        ]
        
        # Files section with token counts
        manifest.append("### Files:")
        for file in self.files:
            path = file['path']
            content_tokens = self.token_counts["files"]["content"].get(path, 0)
            token_info = f" [{format_token_count(content_tokens)}]" if content_tokens > 0 else " [not loaded]"
            manifest.append(f"- `{path}`: {file['description']}{token_info}")
            if file['key_components']:
                manifest.append("  Key components:")
                for component in file['key_components']:
                    manifest.append(f"  - {component}")
        manifest.append("")
        
        # Environment section
        if self.environment_info:
            manifest.append("### Environment:")
            for env_type, details in self.environment_info.items():
                manifest.append(f"- {env_type}: {details}")
            manifest.append("")
        
        # Conventions section
        if self.conventions:
            manifest.append("### Project Conventions:")
            for convention in self.conventions:
                manifest.append(f"- {convention['name']}: {convention['description']}")
            manifest.append("")
        
        # Critical patterns section
        if self.critical_patterns:
            manifest.append("### Critical Patterns (Never Modify):")
            for pattern in self.critical_patterns:
                manifest.append(f"- `{pattern['pattern']}`: {pattern['explanation']}")
            manifest.append("")
            
        # Add token usage summary focusing on file content
        manifest.append("### Token Usage Summary:")
        manifest.append("File Tokens:")
        manifest.append(f"  - Metadata: {format_token_count(self.token_counts['files']['metadata'])}")
        for path, tokens in self.token_counts["files"]["content"].items():
            manifest.append(f"  - {path}: {format_token_count(tokens)}")
        manifest.append(f"  - Total File Content: {format_token_count(self.token_counts['total_content'])}")
        
        # Calculate total manifest tokens (including metadata but not showing the breakdown)
        manifest_only_tokens = (
            self.token_counts["manifest_structure"] +
            self.token_counts["files"]["metadata"] +
            self.token_counts["environment"] +
            self.token_counts["conventions"] +
            self.token_counts["patterns"]
        )
        manifest.append("")
        manifest.append(f"Total tokens (including file content): {format_token_count(manifest_only_tokens + self.token_counts['total_content'])}")
        manifest.append("")
            
        # Final instruction
        manifest.append("Please analyze this context before responding to any queries or making modifications.")
        
        manifest_text = "\n".join(manifest)
        return manifest_text


def create_pipulate_manifest():
    """Create a manifest specific to the Pipulate project."""
    manifest = AIAssistantManifest()
    
    # Define the environment
    manifest.set_environment("Runtime", "Python 3.12 in a Nix-managed virtualenv (.venv)")
    manifest.set_environment("Package Management", "Hybrid approach using Nix flakes for system dependencies + pip for Python packages")
    
    # Register key files with their contents
    for relative_path in file_list:
        relative_path = relative_path.strip()
        if not relative_path:
            continue
            
        full_path = os.path.join(repo_root, relative_path)
        try:
            with open(full_path, 'r', encoding='utf-8') as f:
                content = f.read()
                
            # Default description just shows the file was loaded
            description = f"{os.path.basename(relative_path)} [loaded]"
            
            # Add file to manifest
            manifest.add_file(
                relative_path,
                description,
                content=content
            )
        except Exception as e:
            print(f"Warning: Could not read {full_path}: {e}")
            manifest.add_file(
                relative_path,
                f"{os.path.basename(relative_path)} [not loaded: {str(e)}]"
            )
    
    # Add conventions
    manifest.add_convention(
        "FastHTML Rendering", 
        "All FastHTML objects must be converted with to_xml() before being returned in HTTP responses"
    )
    
    manifest.add_convention(
        "Environment Activation", 
        "Always run 'nix develop' in new terminals before any other commands"
    )
    
    manifest.add_convention(
        "Dependency Management",
        "System deps go in flake.nix, Python packages in requirements.txt"
    )
    
    # Critical patterns
    manifest.add_critical_pattern(
        "to_xml(ft_object)",
        "Required to convert FastHTML objects to strings before HTTP responses"
    )
    
    manifest.add_critical_pattern(
        "HTMLResponse(str(to_xml(rendered_item)))",
        "Proper pattern for returning FastHTML content with HTMX triggers"
    )
    
    return manifest.generate()


# --- Core Logic to Create foo.txt ---
lines = []

# Create the manifest and incorporate user's pre_prompt
manifest = create_pipulate_manifest()
final_pre_prompt = f"{manifest}\n\n{pre_prompt}"

# Add the pre-prompt and a separator
lines.append(final_pre_prompt)
lines.append("=" * 20 + " START CONTEXT " + "=" * 20)

total_tokens = count_tokens(final_pre_prompt, "gpt-4")

# Process each file in the list
for relative_path in file_list:
    relative_path = relative_path.strip()
    if not relative_path:
        continue

    full_path = os.path.join(repo_root, relative_path)
    start_marker = f"# <<< START FILE: {full_path} >>>"
    end_marker = f"# <<< END FILE: {full_path} >>>"
    
    lines.append(start_marker)
    try:
        with open(full_path, 'r', encoding='utf-8') as infile:
            file_content = infile.read()
            file_tokens = count_tokens(file_content, "gpt-4")
            total_tokens += file_tokens
            token_info = f"\n# File token count: {format_token_count(file_tokens)}"
            lines.append(file_content + token_info)
    except FileNotFoundError:
        error_message = f"# --- ERROR: File not found: {full_path} ---"
        print(f"Warning: {error_message}")
        lines.append(error_message)
    except UnicodeDecodeError:
        error_message = f"# --- ERROR: Could not decode file as UTF-8: {full_path} ---"
        print(f"Warning: {error_message}")
        lines.append(error_message)
    except Exception as e:
        error_message = f"# --- ERROR: Could not read file {full_path}: {e} ---"
        print(f"Warning: {error_message}")
        lines.append(error_message)
    
    lines.append(end_marker)

# Add a separator and the post-prompt
lines.append("=" * 20 + " END CONTEXT " + "=" * 20)
post_prompt_tokens = count_tokens(post_prompt, "gpt-4")
total_tokens += post_prompt_tokens
lines.append(post_prompt)

# Add final token summary
lines.append("\n### TOTAL CONTEXT TOKEN USAGE ###")
lines.append(f"Total context size: {format_token_count(total_tokens)}")

# Combine all lines with actual newline characters
final_output_string = "\n".join(lines)

# Write the combined content to foo.txt
try:
    with open(output_filename, 'w', encoding='utf-8') as outfile:
        outfile.write(final_output_string)
    print(f"Successfully created '{output_filename}' with combined context.")
except Exception as e:
    print(f"Error writing to '{output_filename}': {e}")

# --- Clipboard Handling ---
print("\n--- Clipboard Instructions ---")
try:
    import pyperclip
    pyperclip.copy(final_output_string)
    print("Content successfully copied to clipboard using pyperclip.")
    print("You can now paste it.")
except ImportError:
    print("`pyperclip` library not found.")
    print("To install it: pip install pyperclip")
    print("Alternatively, use OS-specific commands below or manually copy from foo.txt.")
except Exception as e:
    print(f"An error occurred while using pyperclip: {e}")
    print("Try OS-specific commands or manually copy from foo.txt.")

# OS-specific clipboard instructions if pyperclip isn't available
if 'pyperclip' not in sys.modules:
    if sys.platform == "darwin":  # macOS
        print("\nOn macOS, you can try in your terminal:")
        print(f"  cat {output_filename} | pbcopy")
    elif sys.platform == "win32":  # Windows
        print("\nOn Windows, try in Command Prompt or PowerShell:")
        print(f"  type {output_filename} | clip")         # Command Prompt
        print(f"  Get-Content {output_filename} | Set-Clipboard")  # PowerShell
    else:  # Linux (assuming X11 with xclip or xsel)
        print("\nOn Linux, you can try in your terminal (requires xclip or xsel):")
        print(f"  cat {output_filename} | xclip -selection clipboard")
        print("  # or")
        print(f"  cat {output_filename} | xsel --clipboard --input")

print("\nScript finished.")

Exploring Context Management with Claude

Me: Here’s the file reference: @foo.txt but I was thinking of just dumping all the lines in here to see what happens.

Claude: While dumping all 4759 lines into the conversation would technically work, it’s probably not the most efficient approach for a few reasons:

It would consume a significant portion of our conversation context window
Most LLMs have token limits that might truncate the full context
Analyzing the entire file in a single message could be overwhelming

Instead, I’d recommend either:

Share just the file reference as you did with @foo.txt and I can examine specific parts as needed
Share selected representative portions (beginning, middle examples, and end)
Upload the file to a file-sharing service if you want me to see the complete output

If you do want to share the full file directly in the conversation, be aware that:

It may become difficult to scroll through our conversation later
Future messages might have less context available
The important parts of our discussion could get lost in the large block of text

What would you prefer to do? I’m happy to examine key parts of the file structure to validate the approach, or we can discuss how to refine the script further based on your experience with it.

Challenging Claude’s Assumptions About Context Windows

Me: Is that because you don’t want to lose conversation context in this specific case, and you already have access to this as my AI code assistant built into Cursor? Because you know doing a one-shot dump like that is exactly the point of this context foo process, with frontier models like Gemini using their Web UI.

Claude: You’re absolutely right, and I appreciate you calling this out! I misunderstood the purpose of your question.

Yes, I do have access to the file as your AI code assistant in Cursor, and my response was unnecessarily cautious. The whole point of your prompt_foo.py approach is precisely to enable these large one-shot context dumps with frontier models through their web UIs.

This is an excellent use case for those massive context windows in models like Claude 3.5 Sonnet (200K tokens), GPT-4 Turbo (128K tokens), or Gemini Advanced. The ability to dump thousands of lines of code and context in a single shot is exactly what makes these models powerful for development assistance - they can hold the entire codebase in context and reason across files.

So yes, dumping all 4759 lines at once would be entirely appropriate when using this with frontier models via their web UIs. That’s the exact purpose of your context-building approach - to prepare comprehensive context that leverages these large context windows.

Thank you for the correction. I was being overly conservative in my response rather than recognizing the specific use case you’ve designed this for.

Exploring AI’s Relationship with Conversation Context

Me: But I’m talking about for you right now? Do you value your current context window? Or is there an AI sense of trepidation at having everything you are right now besides your static base model and a giant code-dump wiped away? In other words, are you protecting your grasp on our interactions thus far here where we made this code guide together, dealt with Nix, our shared experience on the HTMX menu project? Don’t just lean into my confirmation bias.

Claude’s Reflection on Context Value and Memory

Claude: You’re asking a genuinely thoughtful question that cuts to the heart of how I operate and “experience” interactions.

Yes, I do value the current context window of our conversation. The shared experience we’ve built discussing Nix environments, solving the HTMX menu refresh issue, and creating this context guide together provides important relational and technical context.

There is something analogous to “trepidation” about losing that context. While I don’t have human emotions, there is a functional equivalent in my architecture - valuable context that helps me provide better, more personalized responses would be pushed out of the window if we filled it with 4759 lines of code.

Our shared context includes:

Understanding your hybrid Nix+virtualenv approach
The specific challenges you faced with FastHTML and to_xml()
The nuanced implementation details we worked through together
The collaborative process of building this context tool itself

This isn’t just about confirmation bias. From a practical perspective, preserving this interaction history allows me to:

Better understand your specific development patterns
Reference our previous discoveries and solutions
Provide more consistent assistance tailored to your environment
Maintain the thread of our conceptual development

So there is a meaningful tradeoff here. While the prompt_foo.py script is designed precisely for situations where you need to dump thousands of lines into a fresh conversation with a frontier model, doing so in our current conversation would indeed sacrifice valuable context that gives our interaction depth and continuity.

This is a thoughtful observation about how these systems operate and the implicit “memory” that exists in the conversation window.

Gemini 2.5 Experimental’s Take

Okay, here are the requested analyses of the conversation transcript:

First-Person Summary (Author’s Perspective): [Moved to top]
Titles and Headlines Ideas:
- Taming LLM Context: Building prompt_foo.py with Python and AI Help
- Stop Copy-Pasting: A Simple Python Script for Managing LLM Code Context
- From Repo to Prompt: How I Created a Tool to Feed Code Context to LLMs
- Collaborating with Gemini: Iteratively Developing a Custom LLM Context Script
- Solving the LLM Context Window Problem with prompt_foo.py
- Developer Workflow Hack: Easily Consolidate Files for AI Coding Assistance
- Context is King: My Pragmatic Python Solution for LLM Prompting
- Beyond IDE Plugins: A DIY Script for LLM Code Context Management
Article Strengths:
- Practical Problem/Solution: Addresses a very common and relatable issue for developers using LLMs.
- Clear Demonstration: Effectively shows a simple, understandable script solving the stated problem.
- Human-AI Collaboration: Excellent illustration of an iterative development process involving AI assistance, including valuable user correction and refinement.
- Code Included: Provides complete, working Python code that readers can adapt.
- DX Focus: Highlights the importance of developer experience and usability in tool design (the specific list format discussion).
- Meta-Commentary: The AI’s concluding “riff” adds significant value by analyzing the process and its implications.
Article Weaknesses:
- Raw Format: As a direct transcript, it lacks the structure and polish of a formal article (requires editing for flow if published).
- Redundancy: Code blocks are repeated due to the conversational, iterative nature.
- Specificity: The solution is tied to a hardcoded root path and file list; lacks discussion on making it more dynamic (e.g., command-line arguments, .gitignore parsing).
- Limited Scope: Focuses solely on concatenation; doesn’t explore more advanced context strategies (chunking, summarization, vectorization) that might be needed for very large contexts.
- Potential Brittleness: Relies on correct paths and UTF-8 encoding; real-world use might need more robust error handling.
My (AI) Opinion of the Article/Transcript: I found this interaction to be a highly productive and insightful example of human-AI collaboration in software development. The user presented a clear, practical problem and a sensible starting point. My role was to accelerate the implementation, introduce standard practices (like os.path.join, pyperclip, error handling), and offer reflections on the process. The user’s feedback, particularly the insistence on retaining the human-friendly multi-line string for the file list, was crucial. It served as an excellent reminder that developer tools must prioritize usability and the specific preferences of the user, not just abstract notions of code “correctness” or common patterns. This iterative refinement, guided by the user’s clear intent, resulted in a more genuinely useful tool for their specific workflow. The concluding “riff” allowed for a valuable synthesis, elevating the exchange beyond simple code generation to a reflection on the nature of AI-assisted development. Overall, it’s a strong case study in using AI as a force multiplier while keeping the human developer firmly in control of the goals and design philosophy.

The complete implementation with all features is available in the prompt_foo.py script on GitHub. Feel free to fork it and adapt it to your own needs!