Pathfinder gap-report remediation: hybrid search + docs repoint + ag-ui sources + chunker/durability#93
Merged
Conversation
…point docs source to shell-docs tree
…windowed analytics
…ctness and content integrity Unify heading and fence detection behind one CommonMark-correct predicate, make overlap and the line-split fallback fence-aware, normalize CRLF and re-normalize inlined snippet bytes before chunking, and bound the from-import brace so half-open fences, severed multi-backtick spans, and unclosed imports can no longer corrupt served chunk text. De-duplicate identical snippet import lines and strip every copy on inline. Add a comprehensive chunker-invariant oracle covering split completeness and fenced-content preservation.
…eries correct Hold the state token on transient incremental read failures and propagate index failures instead of advancing over or deleting unindexed items, make chunk delete+upsert atomic, run deletion detection before the no-matching-changes short-circuit, fix path:'.' indexing data-loss, include file size in the change-detection hash so mtime-preserving edits reindex, and clear stale chunks on zero-chunk items. Guard FAQ confidence casts with rollback, fetch FAQ metadata by result id, over-fetch before the confidence filter, and order FAQ browse by global recency. Coerce p95 latency and webhook by_decision counts, surface stat and extract-fallback failures, and span all data for the All time range. Add coverage across pipeline, file-provider, state-token, analytics, knowledge, and schema.
Span all data in the All time view, cap the range-mode per-day series, and exclude the browse sentinel so the dashboard charts match the corrected analytics queries.
…filter fixture Exclude the ag-ui .md sources whose derived URLs 404 from the docs deploy config, and update the test-path-filter fixture to match the current code excludes and docs repoint.
2aafd40 to
cc5c8c5
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What this PR actually is
This is the Pathfinder gap-report remediation (the 30-day docs-MCP gap analysis). Merging it deploys to the live docs-MCP (
mcp.copilotkit.ai) via push-to-main → Docker → Railway, and runs a small additive DB migration. It bundles the engine-code fixes and thedeploy/copilotkit-docs.yamlconfig quick-wins — so merging flips production behavior, not just code.Config quick-wins (
deploy/copilotkit-docs.yaml)search_mode: hybrid) +min_score: 0.3floor — the report's Add collect tool type #1 lever (prod was pure-vector).docs/content/docs/tree to the liveshowcase/shell-docs/src/content/docs/(+strip_prefix/webhook.path_triggers).examples/**,showcase/**,**/.next/**,**/*.d.ts(was ~79% boilerplate).integrations/**(Python adapters),sdks/python/,sdks/community/(JVM + ports); excludegenerated/+*.pb.*..mdxdocs files that derived 404 URLs.Engine changes (
src/)<Snippet/>inlining (snippets.ts) — recovers snippet-composed pages (v2 Migration Guide, etc.).title+headingPathalongside content (pipeline.ts), not content-only.X-Pathfinder-Source→query_log.request_source+session_id) — anti-self-inflation groundwork. Migration isADD COLUMN IF NOT EXISTS+CREATE INDEX IF NOT EXISTSon the append-onlyquery_log(additive, idempotent, online-safe).After merge
deploy-health-check(auto on push-to-main) +index-health-monitor(4h cron).Not included (follow-up)