Skip to content

GEPA in Action

Discover how organizations and researchers are using GEPA to optimize AI systems across diverse domains. These examples showcase the versatility and impact of reflective prompt evolution.

Living Document

This page is continuously updated with new use cases from the community. Have a GEPA success story? Share it on Discord or Twitter/X!

Quick Navigation:


Enterprise & Production

  • DataBricks: 90x Cost Reduction


    DataBricks Enterprise Agents

    DataBricks achieved 90x cheaper inference while maintaining or improving performance by optimizing enterprise agents with GEPA.

    Key Results:

    • Open-source models optimized with GEPA outperform Claude Opus 4.1, Claude Sonnet 4, and GPT-5
    • Consistent 3-7% performance gains across all model types
    • At 100,000 requests, serving costs represent 95%+ of AI expenditure—GEPA makes this sustainable

    Read the full blog

  • OpenAI Cookbook: Self-Evolving Agents


    OpenAI Cookbook

    The official OpenAI Cookbook (Nov 2025) features GEPA for building autonomous self-healing workflows.

    What You'll Learn:

    • Diagnose why agents fall short of production readiness
    • Build automated LLMOps retraining loops
    • Combine human review, LLM-as-judge evaluations, and GEPA optimization

    View cookbook

  • HuggingFace Cookbook


    HuggingFace Cookbook

    Comprehensive guide on prompt optimization with DSPy and GEPA.

    What's Inside:

    • Setting up DSPy with language models
    • Processing mathematical problem datasets
    • Building Chain-of-Thought reasoning programs
    • Error-driven feedback optimization

    View cookbook

  • Google ADK Agents Optimization


    Google ADK Training

    Tutorial on optimizing Google Agent Development Kit (ADK) agents using GEPA for improved performance.

    Key Topics:

    • Optimizing agent SOPs (Standard Operating Procedures)
    • Integrating GEPA with ADK workflows
    • Production deployment patterns

    View tutorial

  • Comet-ml Opik Integration


    GEPA is integrated into Comet's Opik Agent Optimizer platform as a core optimization algorithm.

    Capabilities:

    • Optimize prompts, agents, and multimodal systems
    • Works alongside MetaPrompt, HRPO, Few-Shot Bayesian optimizers
    • Automates prompt editing, testing, and tool refinement

    View documentation


AI Coding Agents & Research Tools

  • Production Incident Diagnosis


    ATLAS Incident Diagnosis

    Arc.computer's ATLAS system uses GEPA-optimized agents to teach LLMs to diagnose production incidents.

    Application:

    • Automated root cause analysis (RCA)
    • Dynamic collection of logs, metrics, and databases
    • Reduces manual burden on on-call engineers

    Learn more

  • ATLAS Augmented: +142% Student Performance


    ATLAS Augmented

    GEPA can augment even RL-tuned models. The Intelligence Arc team uses GEPA in their ATLAS framework to improve an already powerful and RL-tuned teacher model.

    Key Result:

    • +142% student performance improvement when guided by the GEPA-improved teacher
    • Demonstrates that GEPA works alongside RL, not just as an alternative
    • Shows GEPA's value even for already-optimized models

    Read the technical blog

  • Data Analysis Coding Agents


    FireBird Auto-Analyst

    FireBird Technologies optimized their Auto-Analyst platform using GEPA for improved code execution.

    Architecture:

    • 4 specialized agents: Pre-processing, Statistical Analytics, Machine Learning, Visualization
    • Optimized 4 primary signatures covering 90% of all code runs
    • Tested across multiple model providers to avoid overfitting

    Read the article

  • Backdoor Detection in AI Code


    LessWrong Backdoor Detection

    GEPA enables AI control research by optimizing classifiers to detect backdoors in AI-generated code.

    Approach:

    • Trusted monitoring using weaker models
    • Classification based on suspicion scores
    • Safety measured by true positive rate at given false positive rate

    Read on LessWrong

  • AI Code Safety Monitoring


    Code Safety Monitoring

    GEPA enables monitoring safety of AI-generated code through optimized classifiers.

    Capabilities:

    • Detect potentially unsafe code patterns
    • Monitor code generation in real-time
    • Improve detection accuracy with reflective optimization

    Try the example

  • DeepResearch Agent


    A production-grade agentic research system combining LangGraph + DSPy + GEPA.

    Pipeline:

    • Query planning with diverse search queries
    • Parallel web search via Exa API
    • Summarization, gap analysis, and iterative research rounds
    • Module-specific GEPA optimization for each agent role

    View tutorial


Domain-Specific Applications

  • Healthcare Multi-Agent RAG


    Building multi-agent RAG systems for diabetes and COPD using DSPy and GEPA.

    System Design:

    • Two specialized subagents (disease experts)
    • Vector database search for medical documents
    • ReAct subagents individually optimized with GEPA
    • Lead agent for orchestration

    Read the guide

  • OCR Accuracy: Up to 38% Error Reduction


    OCR Intrinsic Labs

    Intrinsic Labs achieved significant OCR error rate reductions across Gemini model classes.

    Models Improved:

    • Gemini 2.5 Pro
    • Gemini 2.5 Flash
    • Gemini 2.0 Flash

    A grounded benchmark for document-understanding agents under operational constraints.

    Read the research

    Resources page

  • Market Research AI Personas


    Market Research Focus Groups

    Simulating realistic focus groups with GEPA-optimized AI personas for market research.

    Benefits:

    • Eliminates geographic constraints and facility costs
    • No moderator bias
    • Tests across different personality types
    • Research timelines: weeks → hours

    Learn more

  • Fiction Writing with Small Models


    Creative Writing

    Teaching Gemma3-1B to write engaging fiction through GEPA optimization.

    Demonstrates that small models can handle creative tasks with the right prompts.

    Read on Substack


Advanced Capabilities

  • Multimodal/VLM Performance (OCR)


    Multimodal OCR

    GEPA improves Multimodal/VLM Performance for OCR tasks through optimized prompting strategies.

    Try the example

  • Agent Architecture Discovery


    Architecture Discovery

    GEPA for automated agent architecture discovery - finding optimal agent designs through evolutionary search.

    View ARC-AGI tutorial

  • Adversarial Prompt Search


    Adversarial Prompt Search

    GEPA for adversarial prompt search - discovering edge cases and failure modes in AI systems.

    Advanced application for AI safety research

  • Unverifiable Tasks (Evaluator-Optimizer)


    Unverifiable Evaluator

    GEPA for unverifiable tasks using evaluator-optimizer patterns where ground truth is unavailable.

    View example


Research & Academic

  • Berkeley AI Summit: GEPA Deep Dive


    @matei_zaharia presents GEPA at Berkeley AI Summit, explaining how reflective prompt evolution works even with few rollouts.

    Key Insight:

    "FLOPs are getting cheaper, but rollouts for complex agentic tasks are not. The next frontier of AI will be limited by rollouts budget!"

    Watch the presentation

  • NeurIPS 2025 Workshop: 12.5% → 62.5% Gains


    NeurIPS Poster

    Veris.AI used GEPA in their RAISE framework to achieve 12.5% → 62.5% gains in task correctness accuracy, demonstrating GEPA's immediate practical impact for training reliable domain-specific AI agents through simulated environments.

    Key Results:

    • RAISE: Simulation-first experiential learning framework
    • GEPA prompt optimization for 4 epochs
    • Poster session at NeurIPS 2025 San Diego

    View the announcement

  • 100% on Clock Hands Problem


    Achieving perfect accuracy on the challenging clock hands mathematical reasoning problem using GEPA optimization.

    Application:

    • Complex spatial reasoning
    • Mathematical problem-solving
    • Demonstrates GEPA on hard reasoning tasks

    Try the notebook


Media & Press Coverage

  • VentureBeat: GEPA Optimizes LLMs Without Costly RL


    VentureBeat coverage of GEPA's approach to optimizing LLMs without expensive reinforcement learning.

    Highlights:

    • Explains reflective prompt evolution to a broader audience
    • Discusses cost and efficiency benefits
    • Industry perspective on GEPA's impact

    Read the article

  • DAIR.AI: Top AI Papers of the Week


    GEPA featured in DAIR.AI's "Top AI Papers of The Week" roundup, alongside other breakthrough research.

    Recognition:

    • Listed among Graph-R1, AlphaEarth, Self-Evolving Agents
    • Highlighted natural language reflection approach

    View DAIR.AI newsletter

  • DSPy Weekly Newsletter


    GEPA regularly featured in the DSPy Weekly newsletter, tracking adoption and new use cases.

    Coverage:

    • Issue #4: "GEPA is 🌶️🔥 and on a hype 🚄 as people discover GEPA"
    • Regular updates on community applications

    Read DSPy Weekly

  • LinkedIn AI Talk: Automatic Prompt Optimization


    Vaibhav Gupta (CEO @ Boundary / BAML) provides a detailed GEPA tutorial, first explaining the algorithm and then walking through a code example.

    What's Covered:

    • GEPA algorithm explanation
    • Step-by-step code walkthrough
    • Practical implementation guidance

    Watch the event


Emerging Applications

  • The State of AI Coding 2025


    Greptile State of AI Coding

    GEPA was highlighted in Greptile's comprehensive State of AI Coding 2025 report as a key advancement in AI coding capabilities.

    Key Insight:

    GEPA evolves prompts via trace analysis, matching RL performance with far fewer rollouts—making it ideal for coding agent optimization.

    Read the report

  • Model Migration Workflows


    GEPA is proving valuable for migrating existing LLM-based workflows to new models across model families.

    Pattern:

    • Keep your DSPy program structure
    • Change only the LM initialization
    • Re-run GEPA optimization for the new model
    • Much faster than manually re-tuning prompts

    This is especially useful as new models are released and organizations need to migrate quickly.

  • Evaluator-Optimizer Pattern


    Evaluator Optimizer

    @hammer_mt shares the powerful Evaluator-Optimizer pattern for fuzzy generative tasks where evals are informal and subjective.

    Use Case:

    • Creative writing tasks
    • Persona generation
    • Tasks without ground-truth labels

    Watch the talk

  • Program Synthesis & Kernel Optimization


    GEPA shows promise for program synthesis tasks:

    Applications:

    • CUDA kernel optimization
    • AMD NPU kernel generation
    • Outperforms RAG and iterative refinement (Section 6 of paper)

    Especially valuable for tasks with expensive rollouts (simulation, long runtime).

  • GPU Parallelization (OpenACC)


    GPU Optimization

    Jhaveri & @cristalopes applied GEPA to GPU optimization, targeting OpenACC parallelization.

    Results:

    • Boosted GPT-5 Nano to generate pragmas improving compilation success from 87% → 100%
    • Models saw up to 50% increase in # functional GPU speedups

    Demonstrates GEPA's applicability to code synthesis beyond prompts.

  • Material Science Applications


    GEPA being explored for material science workflows where simulations are costly.

    Why GEPA:

    • High sample efficiency
    • Works with expensive evaluation functions
    • Can optimize simulation parameters

    Exploratory use case from the research community

  • Continuous Learning & Self-Improvement


    GEPA enables continual learning patterns:

    Emerging Pattern:

    1. Deploy optimized agent
    2. Collect feedback from production
    3. Batch feedback and re-optimize
    4. Redeploy improved agent

    Works alongside RL (see BetterTogether paper) for even better results.

  • Letta: Continual Learning in Token Space


    Letta's blog post explores continual learning in token space, discussing how GEPA and similar approaches enable agents to learn and improve over time.

    Concepts:

    • Memory-augmented agents
    • Long-term learning patterns
    • Token-space optimization

    Read the blog


Community Integrations

  • Weaviate Podcast #127: Deep Dive on GEPA


    Comprehensive podcast episode covering GEPA in depth with Lakshya A. Agrawal.

    Topics Covered:

    • Natural Language Rewards
    • Reflective prompt evolution principles
    • Production deployment patterns

    Listen to podcast

  • Weaviate GEPA Hands-On Notebook


    Interactive notebook demonstrating GEPA for reranking optimization in RAG pipelines.

    What's Inside:

    • End-to-end GEPA optimization
    • Integration with Weaviate vector store
    • Practical reranking examples

    View notebook

  • LangStruct GEPA Examples


    Strong examples demonstrating GEPA's effectiveness with Gemini Flash and other models.

    Explore examples

  • GEPA in Go


    Full Go implementation of DSPy concepts including GEPA optimization.

    Features:

    • Native Go implementation
    • MIT licensed
    • Includes CLI tools and examples

    View on GitHub

  • Observable JavaScript


    Observable JavaScript

    Interactive JavaScript notebooks exploring GEPA for web-based optimization.

    By Tom Larkworthy (Tech Lead, formerly Firebase/Google)

    Explore reflective prompt evolution directly in your browser.

    Try it on Observable

  • Context Compression


    Experiments using GEPA for context compression to reduce token usage while maintaining quality.

    Explore novel approaches to efficient prompt engineering.

    View experiments

  • bandit_dspy


    DSPy library for security-aware LLM development using Bandit principles.

    Part of the EvalOps ecosystem for AI evaluation and development tools.

    Explore on GitHub

  • SuperOptiX-AI


    SuperOptiX uses GEPA as its framework-agnostic optimizer across multiple agent frameworks including DSPy, OpenAI SDK, CrewAI, Google ADK, and more.

    Explore SuperOptiX

    Read the blog post


Infrastructure & DevOps

  • Multi-Cloud Data Transfer Cost Optimization


    Multi-Cloud Data Transfer

    The ADRS team used GEPA to minimize multi-cloud data transfer costs.

    Results:

    • GEPA autonomously evolved a naive replication strategy into a sophisticated "shared-tree" topology
    • 31% cost reduction with just $5 of optimization spend
    • Demonstrates GEPA's ability to optimize complex infrastructure configurations

    View research

  • Sales Support Multi-Agent Routing


    Databricks Sales Support

    Databricks used GEPA to optimize a sales-support multi-agent system's routing component.

    Key Results:

    • 75% relative gains in routing accuracy
    • Demonstrates multi-agent orchestration optimization
    • Production-ready deployment patterns

    Read the blog

  • Self-Improving Agent Systems (GEPA + TRM)


    Building self-improving AI agents that combine GEPA with TRM (Test-time Reasoning Modification) for both orchestration optimization and reasoning enhancement.

    Architecture:

    • GEPA for orchestration/prompt optimization
    • TRM for reasoning enhancement
    • Continuous monitoring and feedback loops
    • Automated retraining without human intervention

    Read the guide


Creative & Generative Applications

  • AI Voice/Persona Discovery


    GEPA's multi-objective guided optimization can find an authentic "AI voice" using an 8-dimensional score representing different voice characteristics.

    Dimensions Optimized:

    • Point of view
    • Authority level
    • Cadence and rhythm
    • And 5 more characteristics

    Read the guide

  • Human-Like Response Generation


    GEPA+DSPy can optimize AI to generate human-like responses, passing sophisticated detection systems.

    Application:

    • More natural conversational AI
    • Better user engagement
    • Authentic persona maintenance

    Community-reported application

  • Non-Obvious GEPA Insights


    Non-Obvious GEPA Insights

    Deep dive into non-obvious lessons learned from practical GEPA usage, covering edge cases, unexpected behaviors, and advanced patterns.

    Read the blog


Data Processing & Synthesis

  • Synthetic Data Generation


    Use GEPA to optimize query generation pipelines for creating high-quality synthetic datasets.

    Example: Sanskrit NLP

    • GEPA+DSPy optimizes a query generation pipeline
    • Differentiates between document pairs
    • Generated 50k samples for Gemma embedding fine-tuning

    View project

  • Text2SQL Optimization


    GEPA has been successfully used for Text2SQL tasks with a system prompt/user prompt breakdown.

    Pattern:

    • System prompt specifies the task (evolved by GEPA)
    • User prompt contains dynamic content
    • Alternatively: use DSPy signature for text2sql

    See SQL Generator tutorial

  • Enterprise Agents Blog


    Enterprise Agents

    Building enterprise agents for real-world workflows with GEPA: tackling unstructured data, task decomposition, and context blowup.

    Key Topics:

    • Modular agent design
    • Low-data optimization strategies
    • Cost-effective deployment

    Read the blog


Community Tutorials & Guides

  • DSPy 3 + GEPA: Advanced RAG Framework


    Comprehensive guide on building powerful AI agents with DSPy 3 and GEPA.

    What's Covered:

    • Auto reasoning and prompting
    • Step-by-step agent building
    • Professional-level RAG optimization

    Read the guide

  • Teaching AI to Spot Fake XKCD Comics


    Teaching AI to Spot Fake XKCD

    Fun, accessible explanation of GEPA concepts with XKCD-inspired visualizations.

    Read the blog

  • 20% Improvement in Structured Extraction with DSPy + GEPA


    DSPy Optimization

    Achieving 20+ percentage-point improvement in exact match accuracy for structured extraction tasks using DSPy and GEPA.

    Key Insight:

    The benefit is not only improved performance, but that optimization allows transferring capability to cheaper models while retaining acceptable accuracy, improving the cost profile of applications.

    Read the guide

  • GEPA Impact Analysis: 81% → 90% Accuracy


    Practical analysis achieving 81% → 90% accuracy on sales call transcript analysis in just 3 hours and ~$0.50.

    Key Insight:

    "I stopped thinking of prompts as things I write and started thinking of them as things I evolve. My job shifted from 'craft the perfect prompt' to 'define what good looks like and let the system find it.'"

    GEPA's genetic mutations work best with precise feedback—targeted feedback like "you're conflating greetings with rapport" produces targeted fixes.

    Read the analysis

  • Lakshya's GEPA Blog


    Personal blog post explaining GEPA concepts and applications.

    Read the blog

  • GEPA for De-identification


    Tutorial on using GEPA for PII de-identification tasks with DSPy.

    Read the tutorial

  • SQL Generator with GEPA


    Building optimized Text2SQL systems with DSPy and GEPA.

    Read the tutorial


International Coverage

GEPA has gained significant attention in the global AI community, with tutorials, blogs, and discussions in multiple languages.


Get Started

Ready to optimize your own AI systems with GEPA?

  • Quick Start Guide


    Get up and running with GEPA in minutes.

    Start here

  • Create Custom Adapters


    Integrate GEPA with your specific system.

    Learn adapters

  • API Reference


    Complete documentation of all GEPA components.

    View API

  • Join the Community


    Connect with other GEPA users and contributors.

    Discord Slack