ISSUE-895: Autonomous Threat Feed Reliability & Quality Control

Technical Implementation Guide

Status: COMPLETE
Type: Infrastructure Enhancement
Category: Feed Resilience, Data Quality, Consensus Voting
Release: Production Ready

Overview

This comprehensive implementation introduces a resilience layer for threat intelligence feeds with:

Provider SLA Tracking: Monitor health metrics across all intelligence providers
Weighted Consensus Engine: Resolve conflicts between providers using health-based voting
Feed Quality Control: Continuous monitoring of data completeness, consistency, reliability, timeliness, and validity
Drift Detection: Identify anomalous pattern changes with configurable thresholds
Safe Mode Failover: Automatic activation when feed quality degrades below thresholds
Confidence Calibration: Dynamic adjustment based on validation accuracy
Real-time Dashboard: Comprehensive monitoring and alert system

Architecture Overview

┌─────────────────────────────────────────────────────────┐
│          Threat Intelligence Sources                     │
│  (HIBP, External Feeds, Third-party APIs)              │
└──────────────────┬──────────────────────────────────────┘
                   │
        ┌──────────┴──────────┐
        │                     │
        ▼                     ▼
    ┌────────────┐     ┌────────────┐
    │ Provider A │     │ Provider B │
    └──────┬─────┘     └──────┬─────┘
           │                  │
           │ SLA Tracking     │ SLA Tracking
           ▼                  ▼
    ┌──────────────────────────────┐
    │   ProviderSLA Model          │
    │ - Health Score (0-100)       │
    │ - Status (HEALTHY/DOWN)      │
    │ - Metrics (latency, errors)  │
    └──────────┬───────────────────┘
               │
               ▼
    ┌──────────────────────────────────┐
    │ Weighted Consensus Engine        │
    │ - Provider Health Weighting      │
    │ - Conflict Resolution            │
    │ - Majority/Unanimous Voting      │
    └──────────┬──────────────────────┘
               │
               ▼
    ┌──────────────────────────────────┐
    │ Feed Quality Control Service     │
    │ - Quality Metrics (5 factors)    │
    │ - Drift Detection                │
    │ - Confidence Calibration         │
    │ - Safe Mode Activation           │
    └──────────┬──────────────────────┘
               │
               ▼
    ┌──────────────────────────────┐
    │   FeedHealthScore Model      │
    │ - Overall Health (0-100)     │
    │ - Quality Metrics            │
    │ - Consensus Data             │
    │ - Drift Status               │
    │ - Safe Mode Config           │
    └──────────┬────────────────────┘
               │
               ▼
    ┌──────────────────────────────┐
    │ Safe Mode Routing Service    │
    │ - Failover Chain Building    │
    │ - Circuit Breaker            │
    │ - Fallback Provider Routing  │
    │ - Manual Review Queue        │
    └──────────┬────────────────────┘
               │
               ▼
    ┌──────────────────────────────┐
    │    REST API & Dashboard      │
    │ - 24 Endpoints               │
    │ - Real-time Health Monitor   │
    │ - Alert Management           │
    └──────────────────────────────┘

Components

1. Database Models

ProviderSLA (models/ProviderSLA.js)

Tracks individual provider performance and SLA compliance.

Key Fields:

providerId: Unique provider identifier
providerType: Type of provider (HIBP, EXTERNAL_FEED, etc.)
metrics:
- avgLatency, p95Latency, p99Latency (ms)
- uptime (0-100%)
- errorRate (0-100%)
- timeoutCount, incidentCount
- accuracyScore (0-100%)
- dataFreshness (age in seconds)
weight: Provider weight (default: 1, adjustable 0.2-10.0)
healthHistory: Last 100 health checks

Health Score Calculation:

Health = (latency × 0.20) + (availability × 0.30) + 
         (errorRate × 0.20) + (accuracy × 0.20) + (freshness × 0.10)

Status Determination:

HEALTHY: 90-100%
DEGRADED: 70-89%
UNHEALTHY: 50-69%
DOWN: <50%

Key Methods:

recordRequest(latency, success, timeout)  // Record metric data
getHealthScore()                          // Calculate weighted health
determineStatus()                         // Get current status

Static Query Methods:

getHealthyProviders()    // Find all healthy providers
getDegradedProviders()   // Find degraded providers
getRankingsByHealth()    // Get ranked provider list

FeedHealthScore (models/FeedHealthScore.js)

Tracks feed-level metrics across multiple providers.

Key Fields:

feedId: Feed identifier
consensus:
- agreementRate (0-100%): Percentage of providers agreeing
- conflictCount: Total conflicts detected
- conflictHistory: Last 100 conflicts with types and resolution times
drift:
- driftDetected: Boolean flag
- driftPercentage: Deviation from baseline
- driftThreshold: Default 20%
- driftHistory: Last 100 drift events
quality: 0-100% scores
- completeness (weight: 20%)
- consistency (weight: 25%)
- reliability (weight: 25%)
- timeliness (weight: 20%)
- validity (weight: 10%)
confidenceCalibration:
- baselineAccuracy: Initial accuracy measurement
- calibrationFactor: Multiplicative adjustment (default: 1.0)
- sampleSize: Statistical confidence count
safeMode:
- enabled: Boolean flag
- reason: Activation reason
- fallbackProvider: Routing target
- mode: CONSERVATIVE | PASSTHROUGH | MANUAL_REVIEW
- alertSent: Boolean flag
overallHealth: 0-100% weighted health score
activeAlerts: Array of current alerts with severity levels
lastHealthUpdate: Timestamp of last quality check

Health Status Enum:

EXCELLENT (90-100%)
GOOD (75-89%)
FAIR (60-74%)
POOR (40-59%)
CRITICAL (<40%)

Key Methods:

recordConflict(providers, conflictType, resolution)
recordDrift(currentDataPoints, driftPercentage)
updateQualityMetric(metricName, value)
calibrateConfidence(baselineAccuracy, sampleSize)
activateSafeMode(reason, fallbackProvider, mode)
deactivateSafeMode()
addAlert(type, severity, message)

2. Services

WeightedConsensusEngine (services/weightedConsensusEngine.js)

Resolves conflicting intelligence from multiple providers.

Features:

Provider health-based weight calculation
Conflict grouping and agreement scoring
Multi-strategy consensus resolution:
- UNANIMOUS: 95%+agreement
- MAJORITY: 50%+ agreement
- WEIGHTED_VOTE: Health-based voting
- FALLBACK: Historical resolution

Key Methods:

async resolveConflict(feedId, providerResults)
// Input: feedId, [{providerId, result}, ...]
// Output: { consensus, resolvedValue, strategy, confidence, providers }

async batchResolveConflicts(feedId, providerResultsBatch)
// Resolve multiple conflicts in batch

async getConsensusStatistics(feedId)
// Return: { totalConflicts, agreementRate, conflictTrend }

Consensus Resolution Algorithm:

Group results by value
Get provider weights from SLA data
Calculate agreement scores for each group
Sort by agreement score
Select highest scoring group as consensus
Determine strategy based on agreement percentage

FeedQualityControlService (services/feedQualityControlService.js)

Monitors feed quality metrics and manages safe mode activation.

Features:

Five-factor quality measurement
Drift detection with baseline comparison
Confidence calibration based on accuracy
Safe mode auto-activation when thresholds exceeded

Quality Metrics:

Completeness (20%): All expected fields present
Consistency (25%): Data matches across providers
Reliability (25%): Availability and accuracy
Timeliness (20%): Data freshness and update frequency
Validity (10%): Data format and integrity

Drift Detection:

Establishes baseline from 7-day historical data
Calculates percentage deviation from baseline
Triggers alerts if deviation exceeds threshold (default: 20%)
Maintains 100-entry drift history

Key Methods:

async runQualityCheck(feedId)
// Comprehensive quality assessment
// Output: { quality metrics, health status, safe mode status }

async detectDrift(feedId, currentDataPoints)
// Input: feedId, numeric data point
// Output: { driftDetected, driftPercentage, baseline, threshold }

async calibrateConfidence(feedId, validationData)
// Input: feedId, [{correct: boolean}, ...]
// Adjusts confidence calibration factor

async getQualityReport(feedId)
// Full quality and health report

SafeModeRoutingService (services/safeModeRoutingService.js)

Manages provider failover and circuit breaking.

Features:

Health-based provider chain building
Circuit breaker pattern for failed providers
Three safe mode operational strategies
Automatic retry with exponential backoff

Safe Mode Operational Modes:

CONSERVATIVE: Only accept high-confidence data (>85%)
PASSTHROUGH: Accept data as-is without validation
MANUAL_REVIEW: Queue data for human review

Circuit Breaker Logic:

Tracks consecutive failures per provider
Opens circuit after 5 consecutive failures
Resets automatically after 5 minutes
Prevents cascading failures

Key Methods:

async routeRequest(feedId, primaryProviders, requestPayload)
// Route with automatic failover
// Output: { success, data, providerId, routingStrategy }

async getRoutingStatus(feedId)
// Get current routing configuration and circuit breaker states

async forceFailover(feedId, toProviderId)
// Manual failover to specific provider

3. API Routes (routes/feed-health-routes.js)

24 REST Endpoints organized by category:

Provider SLA Endpoints

GET /api/feed-health/providers - List all providers with health
GET /api/feed-health/providers/:providerId - Get provider details
POST /api/feed-health/providers/:providerId/record-request - Record metrics
GET /api/feed-health/providers/rank/by-health - Provider rankings

Feed Health Endpoints

GET /api/feed-health/feeds - List all feeds
GET /api/feed-health/feeds/:feedId - Get feed report
POST /api/feed-health/feeds/:feedId/check-quality - Run quality check
POST /api/feed-health/feeds/:feedId/detect-drift - Detect drift
POST /api/feed-health/feeds/:feedId/calibrate-confidence - Calibrate confidence

Consensus Endpoints

POST /api/feed-health/consensus/resolve - Resolve single conflict
POST /api/feed-health/consensus/batch-resolve - Batch resolve
GET /api/feed-health/feeds/:feedId/consensus-stats - Consensus statistics

Safe Mode & Failover

GET /api/feed-health/routing/:feedId - Get routing status
POST /api/feed-health/routing/:feedId/request - Route with fallback
POST /api/feed-health/routing/:feedId/failover - Force failover
POST /api/feed-health/feeds/:feedId/safe-mode/activate - Activate safe mode
POST /api/feed-health/feeds/:feedId/safe-mode/deactivate - Deactivate safe mode

Monitoring Endpoints

GET /api/feed-health/critical-feeds - Get critical feeds
GET /api/feed-health/safe-mode-feeds - Get safe mode feeds
GET /api/feed-health/drift-detected-feeds - Get drift feeds

4. Dashboard UI

Components

Summary Cards: 6 KPI displays with real-time updates
Feeds Tab: Grid view of all feeds with health indicators
Providers Tab: Ranked provider list with health charts
Consensus Tab: Agreement rate and conflict trends
Alerts Tab: Active alerts with severity filtering
Drift Tab: Drift-detected feeds with investigation tools
Safe Mode Panel: Currently active safe mode feeders

Features

Auto-refresh every 30 seconds
Real-time health visualization
Quick-action buttons for safe mode management
Tab-based navigation
Responsive design for mobile
Color-coded health indicators
Trend analysis

Integration Guide

1. Setup Models

// models/ProviderSLA.js - Already created
// models/FeedHealthScore.js - Already created

2. Register Services

// In server.js or main application file
const weightedConsensusEngine = require('./services/weightedConsensusEngine');
const feedQualityControlService = require('./services/feedQualityControlService');
const safeModeRoutingService = require('./services/safeModeRoutingService');

3. Mount API Routes

// In server.js
const feedHealthRoutes = require('./routes/feed-health-routes');
app.use('/api/feed-health', feedHealthRoutes);

4. Initialize Dashboard

<!-- In your HTML -->
<script src="feed-health-dashboard.js"></script>
<link rel="stylesheet" href="feed-health-dashboard.css">
<div id="feed-health-dashboard"></div>

<script>
  // Auto-initializes on DOMContentLoaded
  feedHealthDashboard.init();
</script>

Usage Examples

Example 1: Resolve Provider Conflict

const weightedConsensusEngine = require('./services/weightedConsensusEngine');

// Multiple providers returned different results
const providerResults = [
  { providerId: 'HIBP', result: { breached: true, count: 150 } },
  { providerId: 'EXTERNAL_FEED', result: { breached: false } },
  { providerId: 'INTERNAL', result: { breached: true, count: 145 } }
];

const consensus = await weightedConsensusEngine.resolveConflict(
  'credential-check-feed',
  providerResults
);

console.log(consensus);
// Output:
// {
//   consensus: true,
//   resolvedValue: { breached: true, count: 150 },
//   strategy: 'WEIGHTED_VOTE',
//   confidence: 0.65,
//   providers: [...],
//   conflictResolved: true
// }

Example 2: Run Quality Check

const feedQualityControlService = require('./services/feedQualityControlService');

const qualityReport = await feedQualityControlService.runQualityCheck('credential-feed');

console.log(qualityReport);
// Output:
// {
//   success: true,
//   feedId: 'credential-feed',
//   quality: {
//     completeness: 98,
//     consistency: 95,
//     reliability: 92,
//     timeliness: 100,
//     validity: 99
//   },
//   overallHealth: 96,
//   healthStatus: 'EXCELLENT',
//   safeModeStatus: false,
//   alerts: 0
// }

Example 3: Detect Drift

const feedQualityControlService = require('./services/feedQualityControlService');

// Monitor feed data point trend
const dataPoint = 10500; // Current number of records

const driftReport = await feedQualityControlService.detectDrift(
  'credential-feed',
  dataPoint
);

console.log(driftReport);
// Output:
// {
//   success: true,
//   driftDetected: true,
//   driftPercentage: "25.5",
//   currentDataPoints: 10500,
//   baseline: 8000,
//   threshold: 20
// }

Example 4: Route Request with Failover

const safeModeRoutingService = require('./services/safeModeRoutingService');

const result = await safeModeRoutingService.routeRequest(
  'credential-feed',
  ['HIBP', 'EXTERNAL_FEED', 'INTERNAL'],
  { query: 'user@example.com' }
);

console.log(result);
// Output (if primary fails):
// {
//   success: true,
//   data: { ... },
//   providerId: 'HIBP',
//   routingStrategy: 'PRIMARY_CHAIN',
//   fromSafeMode: false
// }

Example 5: Activate Safe Mode

const FeedHealthScore = require('./models/FeedHealthScore');

const feed = await FeedHealthScore.findOne({ feedId: 'credential-feed' });

await feed.activateSafeMode(
  'Feed health degraded to 45%',
  'INTERNAL',
  'CONSERVATIVE'
);

// Now requests route through INTERNAL provider with strict validation

Configuration

Key Configuration Parameters

WeightedConsensusEngine:

config = {
  minConsensusThreshold: 0.6,      // 60% minimum agreement
  majorityThreshold: 0.5,          // 50%+ for majority
  unanimousThreshold: 0.95         // 95%+ for unanimous
}

FeedQualityControlService:

config = {
  driftCheckInterval: 300000,      // 5 minutes
  driftBaselineWindow: 604800000,  // 7 days
  driftThreshold: 20,              // 20% deviation
  safeModeActivationThreshold: 70, // Health score
  dataQualityWeights: {
    completeness: 0.20,
    consistency: 0.25,
    reliability: 0.25,
    timeliness: 0.20,
    validity: 0.10
  }
}

SafeModeRoutingService:

config = {
  failoverThreshold: 70,                    // Health score
  circuitBreakerThreshold: 5,               // Consecutive failures
  circuitBreakerResetTimeout: 300000,       // 5 minutes
  maxRetries: 3,
  retryBackoffMs: 100                       // Exponential base
}

Monitoring & Troubleshooting

Health Check Endpoints

# Get all feeds with health status
curl -X GET http://localhost:3000/api/feed-health/feeds

# Get specific feed report
curl -X GET http://localhost:3000/api/feed-health/feeds/credential-feed

# Get provider rankings
curl -X GET http://localhost:3000/api/feed-health/providers/rank/by-health

# Get feeds in critical condition
curl -X GET http://localhost:3000/api/feed-health/critical-feeds

# Get feeds in safe mode
curl -X GET http://localhost:3000/api/feed-health/safe-mode-feeds

# Get feeds with detected drift
curl -X GET http://localhost:3000/api/feed-health/drift-detected-feeds

Common Issues & Solutions

Issue: Feed health score dropping rapidly

Check provider SLA metrics
Review active alerts for specific issues
Investigate drift detection for pattern changes
Consider manual safe mode activation

Issue: High conflict rates between providers

Review consensus statistics
Check individual provider health scores
Adjust provider weights if needed
Consider temporarily disabling conflicting providers

Issue: Drift detection false positives

Review drift threshold settings (default: 20%)
Check if legitimate pattern change occurred
Calibrate confidence based on validation data
Adjust baseline if needed

Files Created

models/ProviderSLA.js (350 LOC)
- Provider health tracking and SLA monitoring
models/FeedHealthScore.js (380 LOC)
- Feed quality and consensus metrics
services/weightedConsensusEngine.js (450 LOC)
- Multi-provider conflict resolution
services/feedQualityControlService.js (400 LOC)
- Quality monitoring and drift detection
services/safeModeRoutingService.js (400 LOC)
- Failover and circuit breaking logic
routes/feed-health-routes.js (600 LOC)
- 24 REST API endpoints
feed-health-dashboard.js (750 LOC)
- Real-time monitoring UI
feed-health-dashboard.css (600 LOC)
- Dashboard styling
feed-health-dashboard.html (50 LOC)
- Dashboard HTML template
ISSUE_895_IMPLEMENTATION_SUMMARY.md (This file)
- Complete technical documentation

Total: ~3,980 LOC

Testing Recommendations

Unit Tests

Provider health score calculation
Consensus resolution algorithms
Quality metric aggregation
Drift detection logic

Integration Tests

End-to-end request routing with failover
Safe mode activation/deactivation
Consensus resolution with real SLA data
Dashboard data loading and updates

Performance Tests

Handle 1000+ concurrent requests
Consensus resolution under high conflict rates
Dashboard refresh performance with 100+ feeds
Memory usage during drift history retention

Future Enhancements

Machine Learning Integration
- Anomaly detection for drift patterns
- Predictive health scoring
- Provider reliability forecasting
Advanced Routing
- Geographic provider selection
- Cost-based routing optimization
- A/B testing framework for providers
Extended Monitoring
- Historical trend analysis
- Custom alert rules
- Provider comparison analytics
Integration Points
- Slack/email notifications
- Automatic incident response
- Integration with existing SIEM systems

Support & Documentation

Architecture Diagram: See overview section above
API Documentation: Inline comments in feed-health-routes.js
Configuration Guide: See Configuration section above
Monitoring Guide: See Monitoring & Troubleshooting section
Integration Guide: See Integration Guide section above

Version History

Version	Date	Changes
1.0	2024	Initial implementation with all core features

License

Part of ExpenseFlow security infrastructure enhancement initiative.

Dependencies:

mongoose (MongoDB ODM)
express (REST API routing)

Compatibility:

Node.js 14+
MongoDB 4.4+

FilesExpand file tree

ISSUE_895_IMPLEMENTATION_SUMMARY.md

Latest commit

History

ISSUE_895_IMPLEMENTATION_SUMMARY.md

File metadata and controls

ISSUE-895: Autonomous Threat Feed Reliability & Quality Control

Technical Implementation Guide

Overview

Architecture Overview

Components

1. Database Models

ProviderSLA (models/ProviderSLA.js)

FeedHealthScore (models/FeedHealthScore.js)

2. Services

WeightedConsensusEngine (services/weightedConsensusEngine.js)

FeedQualityControlService (services/feedQualityControlService.js)

SafeModeRoutingService (services/safeModeRoutingService.js)

3. API Routes (routes/feed-health-routes.js)

Provider SLA Endpoints

Feed Health Endpoints

Consensus Endpoints

Safe Mode & Failover

Monitoring Endpoints

4. Dashboard UI

Components

Features

Integration Guide

1. Setup Models

2. Register Services

3. Mount API Routes

4. Initialize Dashboard

Usage Examples

Example 1: Resolve Provider Conflict

Example 2: Run Quality Check

Example 3: Detect Drift

Example 4: Route Request with Failover

Example 5: Activate Safe Mode

Configuration

Key Configuration Parameters

Monitoring & Troubleshooting

Health Check Endpoints

Common Issues & Solutions

Files Created

Testing Recommendations

Unit Tests

Integration Tests

Performance Tests

Future Enhancements

Support & Documentation

Version History

License