SageChat API Documentation Integration

Overview

Enhanced SageChat with schema-aware text-to-workflow capabilities by integrating OpenAPI spec fragments for real-time LLM context.

Implementation Date

January 7, 2025

Components Created

1. Backend API: ApiDocsFragmentController

Location: valkyrai/src/main/java/com/valkyrlabs/valkyrai/controller/ApiDocsFragmentController.java

Endpoints:

GET /api/docs/fragments/workflow - Returns workflow-related paths, schemas, and ExecModules
GET /api/docs/fragments/execmodules - Returns list of all available ExecModule class names
GET /api/docs/fragments/schemas/{schemaName} - Returns specific schema definition
GET /api/docs/fragments/minimal - Returns minimal context optimized for LLM (compressed, workflow-focused)

Features:

No authentication required (public access for easier integration)
Reads from classpath OpenAPI spec (YAML or JSON)
Extracts workflow-relevant fragments dynamically
Falls back to hardcoded ExecModule list if spec parsing fails
Efficient caching via client-side sessionStorage

2. Frontend Service: apiDocsService

Location: web/typescript/valkyr_labs_com/src/services/apiDocsService.ts

Key Functions:

fetchWorkflowFragment() - Fetch complete workflow-related documentation
fetchMinimalFragment() - Fetch LLM-optimized minimal context
fetchExecModules() - Fetch list of available ExecModules
fetchSchema(schemaName) - Fetch specific schema definition
buildLLMContext(fragment) - Build compact markdown context for LLM
clearApiDocsCache() - Clear all cached documentation
preloadFragments() - Background preload on app initialization

Features:

SessionStorage caching with 1-hour expiry
Automatic JWT authentication when available
Graceful fallback on errors
TypeScript interfaces for type safety

3. SageChat Integration

Location: web/typescript/valkyr_labs_com/src/components/SageChat/index.tsx

Enhancements:

Automatically fetches API docs fragment when LLM is selected
Enriches workflow-related messages with schema context
Updates VALOR_COMMAND_PROMPT to reference available ExecModules
Stores fragment in component state for reuse

Context Injection: When user mentions "workflow" in their message, SageChat automatically prepends:

## ValkyrAI Workflow Schema

### Available ExecModules:
- com.valkyrlabs.workflow.modules.EmailModule
- com.valkyrlabs.workflow.modules.RestApiModule
...

### Workflow Structure:
{...}

### Task Structure:
{...}

### Module Structure:
{...}

---

User Request: [original message]

4. App-Level Preloading

Location: web/typescript/valkyr_labs_com/src/App.tsx

Enhancement:

Preloads API docs fragments on app initialization
Runs in background without blocking UI
Ensures cached data is ready when SageChat opens

Usage Flow

App Initialization:
- App.tsx calls preloadFragments() on mount
- Fetches minimal fragment and ExecModules list
- Caches in sessionStorage for 1 hour
SageChat Opens:
- User opens SageChat dock
- Component fetches cached fragment (if available)
- Falls back to API if cache miss or expired
Workflow Creation:
- User types: "Create a workflow to send welcome emails"
- SageChat detects "workflow" keyword
- Automatically enriches message with schema context
- LLM receives complete ExecModule list and schemas
- LLM generates workflow command with valid modules
Command Execution:
- Existing workflow creation logic processes command
- Validates against actual schemas
- Creates workflow with proper graph structure
- Simulates and opens in designer

Benefits

Schema Awareness: LLM always knows current available modules
Reduced Hallucination: ExecModule class names are validated
Better Workflow Generation: LLM follows exact schema structures
Efficient Caching: 1-hour cache reduces API calls
Graceful Degradation: Works without cache, falls back on errors
Type Safety: Full TypeScript interfaces for all data structures

Testing Checklist

Test /api/docs/fragments/minimal endpoint returns expected structure
Test /api/docs/fragments/execmodules returns current module list
Test sessionStorage caching (check expiry after 1 hour)
Test SageChat workflow creation with valid ExecModules
Test error handling when API endpoint unavailable
Test preloading on app init (check console logs)
Verify LLM receives enriched context for workflow queries

Security Considerations

✅ Endpoints are public (no sensitive data exposed)
✅ Only returns API schema structure (no user data)
✅ JWT token passed when available for audit trails
✅ No write operations exposed
✅ Rate limiting handled by Spring Boot defaults

Performance

Initial Load: ~100-500ms for fragment fetch (only once per hour)
Cache Hit: <1ms (sessionStorage read)
Memory: ~50-100KB per cached fragment
Network: Minimal (only on cache miss or expiry)

Future Enhancements

Add WebSocket for real-time schema updates
Add fragment versioning for cache invalidation
Add compression (LZ-string) for large fragments
Add analytics for most-used ExecModules
Add schema validation before workflow creation
Add auto-complete suggestions based on cached schemas

valkyrai/src/main/java/com/valkyrlabs/valkyrai/controller/ApiDocsFragmentController.java
web/typescript/valkyr_labs_com/src/services/apiDocsService.ts
web/typescript/valkyr_labs_com/src/components/SageChat/index.tsx
web/typescript/valkyr_labs_com/src/App.tsx

Dependencies Added

None! Implementation uses existing Jackson, Spring Web, and sessionStorage APIs.

Configuration

No configuration required. Works out-of-the-box with existing OpenAPI spec in classpath.

Optional: Set custom cache expiry by modifying CACHE_EXPIRY_MS in apiDocsService.ts.

Overview​

Implementation Date​

Components Created​

1. Backend API: ApiDocsFragmentController​

2. Frontend Service: apiDocsService​

3. SageChat Integration​

4. App-Level Preloading​

Usage Flow​

Benefits​

Testing Checklist​

Security Considerations​

Performance​

Future Enhancements​

Related Files​

Dependencies Added​

Configuration​