Quickstart - Delve

Get up and running with Delve in just a few minutes. Choose your preferred interface below.

Installation

pip install delve-taxonomy

Set API Key

Delve uses Claude for taxonomy generation. Set your Anthropic API key:

export ANTHROPIC_API_KEY="your-api-key-here"

Get your API key from the Anthropic Console

Run Your First Taxonomy Generation

Process a CSV file with text data:

delve run data.csv --text-column conversation

Results will be saved to the ./results/ directory with taxonomy, labeled documents, and reports.

View Results

Check your output directory for these files:

taxonomy.json - Machine-readable taxonomy
labeled_documents.json - Documents with categories
labeled_data.csv - Spreadsheet format
report.md - Human-readable summary

Next Steps

Try different data sources
Customize with CLI options
See more examples

Installation

pip install delve-taxonomy

Set API Key

Set your Anthropic API key as an environment variable:

export ANTHROPIC_API_KEY="your-api-key-here"

Get your API key from the Anthropic Console

Basic Usage

Create a Python script with this code:

from delve import Delve

# Initialize Delve client
delve = Delve(sample_size=100)

# Run taxonomy generation
result = delve.run_sync("data.csv", text_column="conversation")

# Access results
print(f"Generated {len(result.taxonomy)} categories")
for category in result.taxonomy:
    print(f"- {category.name}: {category.description}")

Results are automatically saved to ./results/ and returned as a DelveResult object.

Access Results

The DelveResult object provides easy access to all outputs:

# Access taxonomy
for category in result.taxonomy:
    print(f"{category.name}: {category.description}")

# Access labeled documents
for doc in result.labeled_documents:
    print(f"{doc.id} → {doc.category}")

# Access metadata
print(f"Processed {result.metadata['total_documents']} documents")

# Get file paths
print(result.export_paths['taxonomy'])  # Path to taxonomy.json

Next Steps

Common Use Cases

CSV Files

Process customer feedback, support tickets, or survey responses from CSV files.

JSON Data

Handle API responses, logs, or structured data with JSONPath support.

DataFrames

Work directly with pandas DataFrames for in-memory processing.

LangSmith

Analyze LangSmith project runs to categorize LLM interactions.

Quick Alternative: Binary Detection

If you already know the single category you’re looking for, use find_matches() for faster results:

from delve import Delve

# Find all refund-related traces in seconds, not minutes
result = Delve.find_matches(
    "data.csv",
    category={
        "name": "Refund Request",
        "description": "User asking for refund or money back",
        "keywords": ["refund", "money back", "cancel"],
    },
    text_column="content",
)

print(f"Found {result.stats['matches']} matches")

Binary detection is ~10x faster and ~5x cheaper than full taxonomy generation. Use it when you know what you’re looking for. See Binary Detection for details.

What Happens During Processing?

Sampling - Delve samples your dataset (default: 100 documents)
Summarization - Each document is summarized using Claude Haiku
Clustering - Documents are grouped into minibatches and analyzed iteratively
Taxonomy Generation - Categories are discovered and refined across batches
Validation - The final taxonomy is reviewed for quality
Labeling - All documents are categorized with explanations
Export - Results are saved in multiple formats

For large datasets, Delve automatically samples documents to ensure efficient processing while maintaining representative coverage.

Need Help?

Check the CLI Reference for all command options
See the SDK Reference for API details
Browse Examples for common patterns
Read the full Installation Guide for advanced setup

​Installation

​Set API Key

​Run Your First Taxonomy Generation

​View Results

​Next Steps

​Installation

​Set API Key

​Basic Usage

​Access Results

​Next Steps

​Common Use Cases

CSV Files

JSON Data

DataFrames

LangSmith

​Quick Alternative: Binary Detection

​What Happens During Processing?

​Need Help?

Installation

Set API Key

Run Your First Taxonomy Generation

View Results

Next Steps

Installation

Set API Key

Basic Usage

Access Results

Next Steps

Common Use Cases

Quick Alternative: Binary Detection

What Happens During Processing?

Need Help?