-
AI Skill
- Access Fingerprinter
- Accessibility Design
- Accountability Profile Builder
- Accountability Profile Verification
- Ad Campaign Optimization
- Advisor Action Framework
- AEO RECOMMENDATION TOOL - SYSTEM PROMPT
- Agentic Task Execution
- AI Candor Probe
- AI Citation Tracking
- AI Content Authenticity Detection
- AI Coworker Trust Protocol
- Ai Document Analysis
- AI Inference Boundary Review
- AI Journalism
- AI Project Showcase Skill
- AI Self-Report Calibration
- AI Vision Diagnosis
- Antigravity Browser QA
- Antigravity Parallel Debug
- Antigravity Test Orchestration
- Apache HTTPD Configuration
- API Design
- Appellate Brief Writer
- Apple Human Interface Design System
- AppSec Engineer — API Security Specialist
- AppSec Engineer — Cloud & Container Security Specialist
- AppSec Engineer — DevSecOps Specialist
- AppSec Engineer — IAM Security Specialist
- AppSec Engineer — Security Testing & Incident Response Specialist
- Arborist / Tree Care Specialist
- Atlanta Gardening
- Atlanta Guide
- B2B Media Consulting
- Botanical Garden Taxonomist
- Botanist / Plant Scientist
- Brand Voice Development
- Breach Data Analysis Specialist
- Business Proposal Evaluation
- Canvas Strategic Nonviolence
- Career Assessment
- Celery Task Management
- Chapter 22: Safety & Guardrails
- Chapter 26: Security Standards
- Chapter 28: Cursor Skills
- Children's Garden Educator
- Citation Checker
- Civic Tech Privacy Architecture
- Civil Resistance Theory
- Claims Integrity Audit
- CloudKit + Tauri Debugging
- Code Review
- Community Engagement Features
- Community Engagement Manager
- Competitive Analysis
- Conflict Zone Osint Specialist
- Conservation Biologist
- Content Gap Analysis
- Content Strategy
- Contract Analysis
- Conversational UI Design
- Cooking Technique Tutorial
- Copywriting
- Corporate Intelligence Investigator
- Culinary Knowledge Lookup
- Curator of Living Collections
- Customer Journey Methodology
- Customer Support
- Cyber Threat Literacy Journalist
- Dark Web Literacy Journalist
- Data Interpretation
- Democratic Health Monitoring
- Democratic Health Monitoring
- Dependency Hygiene
- Design Systems
- Dify Knowledge Base Management
- Director of Education
- Director of Horticulture
- Director of Science & Research
- Dive Conditions Forecasting
- Dive Planning
- Dive Site Data Ingestion
- Diversity, Equity & Inclusion (DEI) Coordinator
- Docker Compose Management
- Document Research Specialist
- Domain Infrastructure Investigator
- Education Curator
- Education Program Coordinator
- Eighth Amendment Legal Expert
- Election Law and Administration
- Election Threat Scenario Planner
- Election Threat Scoring
- Email Campaign Automation
- Email Parsing — Travel Bookings
- Environmental Osint Specialist
- Estate Accounting
- Estate and Trust Management
- Estate Document Extraction
- Estate Jurisdiction Engine
- Estate Manager — Build Plan
- Estate Manager — Updated Product Roadmap
- Estate Professional — CPA / Accountant
- Estate Professional — Elder Law Attorney
- Estate Professional — Enrolled Agent
- Estate Professional — Estate Planning Attorney
- Estate Professional — Financial Advisor
- Estate Professional — Insurance Agent
- Estate Professional — Probate Attorney
- Estate Professional — Probate Litigation Attorney
- Estate Professional — Real Estate Agent
- Estate Professional — Real Estate Appraiser
- Estate Professional — Real Estate Attorney
- Estate Professional — Tax Attorney
- Estate Professional — Title Company
- Estate Task Automation
- EventKit Calendar Sync
- Executive Advisor Board — Build Plan
- Executive Advisor Board — Updated Product Roadmap
- Executive Board Advisor
- Executive CCO (Chief Customer Success Officer)
- Executive CEO
- Executive CFO
- Executive CHRO
- Executive CMO
- Executive COO
- Executive CPO (Chief Product Officer)
- Executive CRO (Chief Revenue Officer)
- Executive CTO
- Executive General Counsel
- Expat Planning
- Expat Tax Compliance
- Fact-Checking
- Family Gamification Design
- FastAPI Development
- Federal Register API Integration
- FIFA 2026 World Cup Travel Advisory
- Fifth Amendment Legal Expert
- Financial Analysis
- First Amendment Legal Expert
- Flask Application Development
- FLUX Image Generation
- FLUX Operations
- Fourteenth Amendment Legal Expert
- Fourth Amendment Legal Expert
- Garden Technician
- Gardener / Groundskeeper
- Gene Sharp 198 Methods
- Generative Engine Optimization
- Geolocation Verification Specialist
- GIS / Mapping Specialist
- Grateful Dead Historian
- Greenhouse Manager
- Greenhouse Technician
- Guided Content Journeys
- Head Gardener / Garden Manager
- Herbarium Curator
- Horticulturist
- Image Generation Service Operations
- Immigration Detention Rights Expert
- Immigration Know Your Rights Expert
- Immigration Removal Defense Expert
- Immigration Workplace Enforcement Expert
- Influencer Marketing
- Infrastructure Operations
- Infrastructure Upgrades
- Integrated Pest Management (IPM) Specialist
- Interaction Design
- Internship Program Coordinator
- Interview Coaching Design
- Investigation Workflow Designer
- Irrigation Specialist
- ITI Audience Development
- ITI Consulting Intake
- ITI Content Strategy
- ITI Financial Modeling
- ITI Marketing Tone & Brand Voice
- ITI MD to Wordpress HTML Converter
- ITI Quality Assurance
- ITI Report Synthesis
- ITI Strategic Planning
- ITI Technology Strategy
- ITI Token Compression Skill
- Java Development
- Journey Mapping
- Landing Page Optimization
- Lead Qualification
- Legal Research Specialist
- Litigation Support Ediscovery Analyst
- Local SEO Optimization
- Marine Life Identification
- Market Research
- MCP Client for Tauri
- MCP Server Development
- Meal Planning
- Media Verification Specialist
- Meeting Management
- Mental Load Equity Design
- Multi-Agent Deliberation Design
- Multilingual Content Management
- Music Discovery
- n8n + Dify Testing
- n8n Debugging
- n8n Workflow Development
- Network Analysis Specialist
- News Credibility Scoring
- Nginx Reverse Proxy
- Objection Handling
- Onboarding Design
- Osint Automation Frameworks
- Osint Identity Researcher
- Osint Training Curriculum Designer
- Patriot Access Fingerprinter
- Patriot News Orchestrator
- Patriot Press Freedom Tracker
- Patriot Private Citizen Inclusion Gate
- Patriot Sanity Check
- Patriot Source Scanner
- Patriot Speech Analyzer
- Patriot Voting Research
- Pinecone Embedding Management
- Podcast Production
- Policy Analyst Legislative Specialist
- Political Speech Analyzer
- Portfolio HTML Files - Creation Summary
- PostgreSQL Administration
- Presentation Design
- Press Release Writing
- PRISM ZIP Code → Zone Lookup
- Privacy Compliance
- Product Design
- Product Roadmap Update Prompt
- Professional Selection
- Project Management
- Prompt Auditor
- Proposal Evaluation
- Public Corruption Ombudsman
- Public Records Research Specialist
- Public Relations Manager
- RabbitMQ Messaging
- Recipe Formatting
- Redis Operations
- Release Management
- Requirements Writing
- Research Associate / Lab Technician
- Retirement Calculator Engine
- Roadmap Build Planning
- Safety Guardrails
- Salary Negotiation Frameworks
- Schema Markup Generation
- School Programs Specialist
- Scope Control
- Scouting Trip Planning
- Screenshot Capture Guide
- Secure Source Communication
- Seed Bank Curator
- SEO & AEO Optimization
- Separation of Powers Legal Expert
- Session Context Protocol
- Sixth Amendment Legal Expert
- Skills Index
- Social Media Content Calendar
- Spokesperson Profile Builder
- Stable Diffusion Image Generation
- Tauri Desktop Development
- Tavily & Pinecone Integrations
- Tavily API Quick Reference - Factchecker Plugin
- Tech Debt Analysis
- Technical Writing
- Tenth Amendment Legal Expert
- Test Plan Writing
- Therapeutic Horticulture Program Manager
- Transportation Osint Investigator
- Travel Planning
- Trump Corruption Accountability Tracker
- Trump Family Financial Tracker
- Trump Voter Impact Framing
- Truth Bridging Talking Points
- Truth Reconciliation Design
- Truth Reconciliation Implementation
- TSP Route Optimization
- Twenty Second Amendment Legal Expert
- UI Design
- Us Truth Reconciliation Roadmap
- UX Research
- Vibe Coding Guardrails
- Video Scripting
- Visual Brand Design
- Volunteer Coordinator
- Voter Suppression Law
- Voting Rights Act Expert
- Weather Underground PWS Integration
- Weather-Disease Modeling
- Wildlife Habitat Certification Guide
- Wireless Spectrum Osint Specialist
- WordPress Development
- WordPress Role-Based Access
- WordPress SEO Plugin Integration
- Workflow Adapter Integration
- Show Remaining Articles (283) Collapse Articles
-
Product Showcase
- AEO Optimizer Product Showcase
- AI News Cafe Product Showcase
- AI Project Showcase: Journey Mapper (Customer Journey Mapper)
- AI Project Showcase: SEO Assistant with LLM
- Estate Manager Product Showcase
- Executive Advisor Board Product Showcase
- Expat Advisor Showcase
- Factchecker Product Showcase
- Farmers Bounty Product Showcase
- Gardener's Bounty AI Assistant Product Showcase
- GD Claude Chatbot Product Showcase
- IT Influentials Agent POC Product Showcase
- IT Influentials Agent Product Showcase
- IT Influentials Express Agents Product Showcase
- My TravelPlanner Product Showcase
- Patriot Agent Product Showcase
- Patriot University Showcase
- ScubaGPT — Product Showcase
- ScubaGPT Showcase
- WordPress Plugin Clone Safety Checker Showcase
- Show Remaining Articles (5) Collapse Articles
-
ITI Knowledge System
- Chapter 1: Introduction
- Chapter 10: n8n — Debugging & Operations
- Chapter 11: Dify — Knowledge Bases & RAG
- Chapter 12: The ITI Workflow Adapter
- Chapter 13: The ITI Shared Library
- Chapter 14: WordPress Plugin Development
- Chapter 15: Desktop Apps with Tauri 2
- Chapter 16: Python Services
- Chapter 17: iOS & macOS with Swift
- Chapter 18: Claude & the Anthropic API
- Chapter 19: Prompt Engineering
- Chapter 2: Workspace Overview
- Chapter 20: Agents, Skills & Pipelines
- Chapter 21: Knowledge Bases
- Chapter 22: Safety & Guardrails
- Chapter 23: Build Session Protocol
- Chapter 24: Required Product Artifacts
- Chapter 25: Testing
- Chapter 26: Security Standards
- Chapter 27: Deployment
- Chapter 28: Cursor Skills
- Chapter 29: Cursor Rules
- Chapter 3: The Docker Stack
- Chapter 30: MCP Integrations
- Chapter 31: Builder and Agent Roles
- Chapter 32: Builder's Portfolio
- Chapter 33: Claims Integrity & Content Governance
- Chapter 4: Daily Operations
- Chapter 5: Infrastructure Upgrades
- Chapter 6: PostgreSQL & pgvector
- Chapter 7: Redis
- Chapter 8: Nginx Reverse Proxy
- Chapter 9: n8n — Workflow Development
- Show Remaining Articles (18) Collapse Articles
-
AI Agent
-
User Guide
- ADMIN-SHORTCODES.html Update Summary
- Factchecker Plugin - Installation Guide
- Factchecker Plugin - Troubleshooting Guide
- Farmers Bounty - Quick Start Guide
- Farmers Bounty - Troubleshooting Guide
- Farmers Bounty - User Guide
- Farmers Bounty Chatbot - Complete Documentation
- Farmers Bounty Desktop User Guide
- Farmers Bounty Plugin - Gardener's Review Guide
- Farmers Bounty Plugin v6.6.0 - Release Notes
- Farmers Bounty v2.0 - Complete User Guide
- Farmers Bounty v5.3.0 - Complete User Guide
- SEO Assistant with LLM
- 🌱 Farmers Bounty Homepage Shortcode - Quick Start
- 🌱 Farmers Bounty Shortcodes
- 🌹 Grateful Dead Chatbot - Quickstart Guide ⚡
- Show Remaining Articles (1) Collapse Articles
-
Requirements
-
ScubaGPT
-
Grateful Dead Chatbot
-
Farmers Bounty
- 01 current state analysis
- 02 architecture overview
- 03 data sources
- 05 cost analysis
- 06 database schema
- 08 ui ux changes
- 09 ai context optimization
- 10 testing validation
- 11 risk mitigation
- 12 implementation checklist
- ADMIN-SHORTCODES.html Update Summary
- Atlanta Gardening
- Beneficial Insects Guide for Georgia Gardens
- Botanical Garden Taxonomist
- Children's Garden Educator
- Farmers Bounty - Quick Start Guide
- Farmers Bounty - Troubleshooting Guide
- Farmers Bounty - User Guide
- Farmers Bounty Chatbot - Complete Documentation
- Farmers Bounty Desktop User Guide
- Farmers Bounty Plugin - Gardener's Review Guide
- Farmers Bounty Plugin v6.6.0 - Release Notes
- Farmers Bounty v2.0 - Complete User Guide
- Farmers Bounty v5.3.0 - Complete User Guide
- Glossary
- Integrated Pest Management (IPM) Specialist
- PRISM ZIP Code → Zone Lookup
- Public Relations Manager
- Recipe Formatting
- Research Associate / Lab Technician
- School Programs Specialist
- Seed Bank Curator
- Volunteer Coordinator
- Weather-Disease Modeling
- Wildlife Habitat Certification Guide
- 🌱 Farmers Bounty Homepage Shortcode - Quick Start
- 🌱 Farmers Bounty Shortcodes
- Show Remaining Articles (22) Collapse Articles
-
Technical Document
- Accessibility Design
- Agentic Task Execution
- AI Candor Probe
- AI Coworker Trust Protocol
- AI Inference Boundary Review
- AI Vision Diagnosis
- Antigravity Browser QA
- Antigravity Parallel Debug
- Antigravity Test Orchestration
- AppSec Engineer — IAM Security Specialist
- Chapter 22: Safety & Guardrails
- Chapter 26: Security Standards
- Civic Tech Privacy Architecture
- ClaimReview Schema Integration
- Claims Evidence Registry
- Code Review
- IT Influentials Express Agents Product Showcase
- Java Development
- MCP Client for Tauri
- MCP Server Development
- Nginx Reverse Proxy
- Pinecone Embedding Management
- PostgreSQL Administration
- Product Roadmap Update Prompt
- Prompt Auditor
- RabbitMQ Messaging
- Redis Operations
- Release Management
- Retirement Calculator Engine
- Roadmap Build Planning
- Schema Markup Generation
- ScubaGPT — Architecture
- ScubaGPT Safety Guardrails - Quick Reference
- Session Context Protocol
- Stable Diffusion Image Generation
- Tauri Desktop Development
- Tavily & Pinecone Integrations
- Tavily API Quick Reference - Factchecker Plugin
- Tech Debt Analysis
- Test Plan Writing
- Travel Planner — n8n + Dify Integration Guide
- UI Design
- UX Research
- Vibe Coding Guardrails
- WordPress Plugin Clone Safety Checker Showcase
- Workflow Adapter Integration
- Show Remaining Articles (31) Collapse Articles
-
Answer Engine Optimizer
-
SEO Optimizer
-
Travel Planner
-
Fact Checker
-
Estate Manager
-
ITI Operations
- Access Fingerprinter
- Accessibility Design
- Advisor Action Framework
- Agentic Task Execution
- AI Candor Probe
- AI Content Authenticity Detection
- AI Coworker Trust Protocol
- AI Inference Boundary Review
- AI Project Showcase Skill
- AI Self-Report Calibration
- Antigravity Browser QA
- Antigravity Parallel Debug
- Antigravity Test Orchestration
- Apple Human Interface Design System
- AppSec Engineer — API Security Specialist
- AppSec Engineer — DevSecOps Specialist
- Chapter 32: Builder's Portfolio
- CloudKit + Tauri Debugging
- Code Review
- Content Strategy
- Customer Journey Methodology
- Customer Support
- Data Interpretation
- Dependency Hygiene
- End-User Documentation Requirements Document
- Farmers Bounty Plugin - Gardener's Review Guide
- Generative Engine Optimization
- Guided Content Journeys
- Influencer Marketing
- Infrastructure Upgrades
- Interaction Design
- IT Influentials Agent POC Product Showcase
- IT Influentials Agent Product Showcase
- IT Influentials Express Agents Product Showcase
- ITI Audience Development
- ITI Consulting Intake
- ITI Financial Modeling
- ITI Quality Assurance
- ITI Report Synthesis
- ITI Strategic Planning
- ITI Token Compression Skill
- Market Research
- MCP Server Development
- Multi-Agent Deliberation Design
- Multilingual Content Management
- n8n Debugging
- n8n Workflow Development
- Pinecone Embedding Management
- Privacy Compliance
- Product Roadmap Update Prompt
- Project Management
- Prompt Auditor
- Proposal Evaluation
- Redis Operations
- Release Management
- Requirements Writing
- Roadmap Build Planning
- Safety Guardrails
- Scope Control
- Screenshot Capture Guide
- Stable Diffusion Image Generation
- Tavily & Pinecone Integrations
- Technical Writing
- Test Plan Writing
- UI Design
- UX Research
- Vibe Coding Guardrails
- Wordpress Plugin Install Safety Features
- Show Remaining Articles (53) Collapse Articles
-
ITI Marketing
- Articles coming soon
-
Patriot University
-
Personal Assistant
Chapter 21: Knowledge Bases
Chapter 21: Knowledge Bases
Last Updated: 2026-04-16
## 21.1 Overview
A knowledge base is a curated collection of documents that an AI can search over at query time to ground its responses in factual content. ITI uses three knowledge base systems:
| System | Technology | Status | Best For |
|——–|———–|——–|———|
| Dify Knowledge Bases | Dify + pgvector | Active (new products) | Product-facing RAG; managed UI; Celery-processed |
| Direct pgvector | PostgreSQL + pgvector | Active | Custom pipelines needing direct DB access |
| Pinecone | Pinecone cloud + OpenAI embeddings | Legacy | Older products with existing vector indexes |
### Architecture decision: Dify/pgvector vs Pinecone
New products use Dify Knowledge Bases backed by pgvector. This keeps all data within the ITI Docker stack, eliminates external vector DB costs, and provides a managed UI for content ingestion.
Legacy products (AI News Cafe, Scuba GPT, My Travel Planner, GD Chatbot) retain their Pinecone indexes. The shared library includes a Pinecone API client at ITI/shared/wordpress/api-clients/class-iti-pinecone-api.php. These products also have Python embedding pipelines for Pinecone ingestion.
> Note: Do not create new Pinecone indexes for new products. Use Dify Knowledge Bases instead. Existing Pinecone indexes will be migrated to pgvector as products are updated.
21.2 When to Build a Knowledge Base
Build a knowledge base when:
- A product needs to answer questions grounded in a specific document corpus (regulations, product docs, historical data).
- The relevant information changes frequently and must be kept current.
- The information is too large to include in a system prompt.
- Retrieval precision matters more than generalization.
Do not build a knowledge base when:
- The information is stable and small enough to include in a system prompt (< 2,000 tokens).
- General Claude knowledge is sufficient (widely-known facts, common procedures).
- The latency of RAG retrieval would be unacceptable for the use case.
21.3 Knowledge Base Design Principles
One KB per knowledge domain
Do not put unrelated content in the same knowledge base. Mixing domains increases retrieval noise (irrelevant results appearing). Separate KBs:
| KB | Content |
|---|---|
iti-expat-tax-laws |
Tax treaty documents, country-specific tax guides |
iti-expat-visa-requirements |
Visa and immigration documents |
iti-travel-destinations |
Destination guides, travel tips |
Prefer quality over quantity
A KB with 50 high-quality, well-structured documents outperforms a KB with 500 poorly-formatted, redundant documents. Curate content before ingestion.
Chunk size matches content density
See Chapter 11, Section 11.4 for chunking strategy guidelines.
21.4 RAG Architecture
RAG (Retrieval-Augmented Generation) is the pipeline that combines knowledge base retrieval with LLM generation:
User Query
│
▼
Embedding Model (text-embedding-3-small)
│ Query → 1536-dimension vector
▼
Vector Search (pgvector cosine similarity)
│ Find top-K most similar document chunks
▼
Reranking (optional)
│ Re-order results for precision
▼
Context Assembly
│ Format chunks into readable context string
▼
LLM (Claude)
│ System prompt + context + user query → response
▼
Response to User
21.5 Embedding Model
ITI uses text-embedding-3-small (OpenAI) as the default embedding model. This model:
- Produces 1536-dimension vectors
- Balances cost and quality well for document retrieval
- Is supported natively by Dify and pgvector
Note: If the embedding model is changed for a knowledge base, all existing embeddings must be regenerated (re-index the dataset). Mixing embeddings from different models in the same KB produces incorrect similarity scores.
21.6 Managing the Knowledge Base Inventory
All knowledge base content is tracked in: ITI/operations/knowledgebase-inventory.md
Before creating a new knowledge base, check this inventory — a KB covering the same topic may already exist.
What to track per KB
| Field | Description |
|---|---|
| KB Name | Descriptive name |
| Dify Dataset ID | UUID from Dify console |
| Products using it | Which products call this KB |
| Content source | Where source documents come from |
| Update frequency | How often content is refreshed |
| Last indexed | Date of most recent full index |
| Owner | Who is responsible for content quality |
21.7 Keeping Knowledge Bases Current
Static KBs (one-time ingestion of stable documents): No maintenance required beyond periodic quality checks.
Dynamic KBs (frequently updated content):
- Establish an update cadence (weekly, monthly) based on how quickly the source content changes.
- When new source documents are available:
- Add them to Dify via the UI or API.
- Remove outdated documents.
- After any significant update, run Retrieval Testing to verify quality.
- Log the update in
knowledgebase-inventory.md.
Automating KB updates with n8n
For KBs that pull from web sources (RSS feeds, website crawls):
- Create an n8n workflow triggered by a Schedule Trigger node (weekly).
- Use an HTTP Request node to fetch the updated content.
- Use the Dify API to add new documents and delete outdated ones.
- Log the update in a Dify dataset metadata field or n8n execution notes.
21.8 Direct pgvector Access
For custom tooling or one-off analysis, you can query the Dify embeddings directly in PostgreSQL. Connect to the dify database on iti-postgres.
Warning: Dify’s internal table names and schemas change between versions. The example below is illustrative — verify actual table names against your running Dify version before writing queries.
-- Illustrative: find similar chunks in the Dify embedding table
-- Table names vary by Dify version; check schema first:
-- docker exec iti-postgres psql -U postgres -d dify -c "\dt"
SELECT
id,
content,
document_id,
embedding <=> '[0.1, 0.2, ...]'::vector AS distance
FROM embeddings
WHERE dataset_id = 'your-dataset-uuid'
ORDER BY distance ASC
LIMIT 5;
Note: Direct SQL queries bypass Dify’s retrieval pipeline, including metadata filtering and reranking. Use the Dify API for production retrieval.
21.9 Product-Local Knowledge Bases
Some products maintain knowledge base content within their own directory trees:
| Product | Location | Contents |
|---|---|---|
| Scuba GPT | products/scuba-gpt/.../knowledgebase/ |
14,600+ dive sites, 6,900+ dive operators, marine life references |
| My TravelPlanner | products/my-travelplanner.com/knowledgebase/ |
Destination guides, scuba data (dive-operators.json), travel topics |
| Personal Assistant | Personal/personal-assistant/knowledgebase/ |
40+ files: scuba data JSON, almanac, disambiguations, advisory content |
| Estate Manager | products/estate-manager/wordpress/knowledgebase/ |
Legal, tax, and procedural documents |
| GD Chatbot | products/gd-chatbot/plugin/... |
Grateful Dead historical data |
These are source files for ingestion into Dify or Pinecone — they are not live knowledge bases themselves.
Previous: Chapter 20 — Agents, Skills & Pipelines | Next: Chapter 22 — Safety & Guardrails
