All Projects

Out of Ideas

Open Source

When Do LLMs Run Out of Distinct Ideas?

The Problem

Everyone benchmarks LLM accuracy and speed. Nobody measures creative exhaustion — when a model starts recycling ideas. This matters for real-world use cases like brainstorming, content generation, and ideation.

The Solution

Built a 5-stage pipeline (generate → quality → verify → embed → deduplicate) that measures how many truly unique ideas each model produces before repeating. Used HNSW vector search for semantic deduplication and Perplexity for web-grounded verification.

Technical Highlights

  • 14 LLM providers integrated — direct APIs + OpenRouter with auto-shadowing (direct keys override routing)
  • Union-Find deduplication with survivor promotion — cluster-based, best entry per cluster survives
  • HNSW vector search (Voyage AI embeddings, 768-d cosine) for semantic similarity
  • 3 meta-AI loops: seed evolver, seed gatekeeper (Claude Opus), mobile rewriter
  • Per-model cost tracking, budget caps, and stratified batch sampling
  • Zod schema validation at every I/O boundary

Tech Stack

Language

TypeScript 5.6strict modeESMNode.js 20+

LLM

14 ProvidersAuto-ShadowingBudget CapsCost Tracking

Embeddings

Voyage AIDashScopeGoogle Vertex

Verification

Perplexity Sonar ProWeb-Grounded

Vector Search

hnswlib-nodeHNSW Cosine768-d

Tooling

ZodCommander.jsp-limitCustom Mutex

Key Numbers

1,856 generations10 frontier LLMs14 providers16x cost spread$69 total cost