Personalized Memory-Aware RAG with Cognitive Context Evolution (C-RAG)

A Comprehensive Framework for Longitudinal Intent Trajectory Modeling and Cognitive Disentanglement

Project Overview

Traditional Retrieval-Augmented Generation (RAG) architectures operate under the Stateless Assumption. Every query is processed as an isolated, independent and identically distributed (i.i.d.) event. While this is efficient for generic Q&A, it fails in high-stakes research environments where a user’s knowledge base evolves over time.

C-RAG (Cognitive-RAG) rejects the stateless paradigm. Instead, it proposes a system that models the user’s Intent Trajectory. By treating every interaction as a node in a evolving semantic graph, C-RAG achieves a level of personalization that mirrors the human executive function.

Cognitive Disentanglement Architecture

The system is built on a tripartite cognitive model, separating raw data from extracted knowledge.

Episodic Short-Term Memory (STM)

The STM serves as the high-fidelity input buffer. Unlike standard chat history, the STM in C-RAG is Weighted by Salience.

Sliding Window: Consists of the last $N$ interactions (default $N=5$).
Salience Scoring: Each item is assigned a confidence score based on the Research Efficiency Index (REI) at the time of its creation.

Semantic Long-Term Memory (LTM)

The LTM is the "Axiom Registry." It does not store raw chat; it stores Distilled Research Axioms. When the STM reaches its limit, the system undergoes a Consolidation Cycle.

Knowledge Distillation: The most relevant item is summarized into a foundational axiom.
Durable Persistence: All LTM items are synced to a production-grade JSON layer for multi-session stability.

Information Bottleneck Analysis (IBA)

During the transition from STM to LTM, the system applies IBA Distillation. The goal is to maximize Semantic Retainment while minimizing Token Complexity.

Calculus: $IBA = 1 - \frac{T_{final}}{T_{initial}}$ where $T$ represents the token count.

Technical Illustrations (Live Workspace Capture)

Ultimate Research Dashboard

The C-RAG Evolution Dashboard featuring Vanguard Glass-CSS and real-time intent visualization.

Cognitive Memory Evolution

Visualization of the STM-to-LTM transition (Episodic to Semantic) and IBA Compression Efficiency.

Scientific Laboratory (AQE)

Real-time NDCG and Hallucination Probability reports from a 10-trajectory DailyDialog simulation.

Autonomous Researcher Persona

Expertise discovery and identity export based on longitudinal interaction history.

Dataset Integration

The DailyDialog dataset is utilized for modeling Natural Conversational Intent. Research interactions are rarely one-off. DailyDialog provides multi-turn trajectories that allow us to test if the system can follow shift in logic.

The PersonaChat dataset is utilized for Semantic Identity Verification. A personalized RAG system must maintain a consistent "self-image" of the user. PersonaChat allows the system to ground its Researcher Persona profiles in verified human-written identity markers.

Evaluation Methodology

This project includes a dedicated AQE (Automated Quantitative Evaluation) Suite:

Metric	Scientific Importance	Description
NDCG	Retrieval Precision	Measures how effectively the system ranks internal history over noise.
Hallucination Index	Factuality	Based on contradiction with Long-Term Memory grounding.
AMW Stability	Adaptive Weighting	Adjusts retrieval math based on historical REI success.

Core Module Breakdown

memory_engine.py: The heart of the system. Implements the UltimateMemoryEngine class.
rag_pipeline.py: Orchestrates the dual-retrieval track (Internal Personal vs. External Research).
data_loader.py: Handles asynchronous downloading of Kaggle research datasets.
eval_engine.py: The laboratory module. Calculates the NDCG and Hallucination metrics.
main.py: The API Gateway and static dashboard host.

How to Run

Backend: python backend/main.py.
Frontend: Open frontend/index.html.
Simulation: Click "Run AQE Benchmark" in the sidebar.

Future Research Roadmap

Phase 5: Relational Knowledge Graphs

Migration of the Semantic LTM from a JSON flat-store to a Neo4j Graph Database to enable multi-hop reasoning.

Phase 6: Active Forgetting Loops

Introduction of a Saliency Decay Algorithm that autonomously "forgets" low-utility data points.

Phase 7: Collaborative Memory

Implementing a privacy-preserved Federated Memory Layer where multiple C-RAG instances can share insights.

R&D Ecosystem & Institutional Integration

This framework is strategically architected for Reproducible Research (R2) and stands as a baseline for longitudinal memory studies. It is currently optimized for collaborative integration within:

Global Research Context: Optimized for large-scale integration within top-tier academic and industrial AI laboratories.
MNC Research Divisions: Pioneering agentic memory architectures in a decentralized, multi-user environment.

Technical Contribution & Peer Review

The C-RAG protocol establishes a novel benchmark for Cognitive Context Evolution. We invite the global research community and institutional stakeholders to audit the AMW (Adaptive Memory Weighting) algorithms and contribute to the evolution of persistent cognitive memory layers.

Core Research Initiative: Personalized Memory Architectures

Research Trajectory: Agentic Cognitive Architectures | Information Bottleneck Distillation

Built for the next generation of personalized knowledge synthesis. 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets		assets
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
render.yaml		render.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Personalized Memory-Aware RAG with Cognitive Context Evolution (C-RAG)

A Comprehensive Framework for Longitudinal Intent Trajectory Modeling and Cognitive Disentanglement

Project Overview