Skip to content

Akshitha0118/ContextCore-Domain-Aware-RAG-Chatbot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 

Repository files navigation

πŸš€ContextCore-Domain-Aware-RAG-Chatbot

Turn any PDF into an intelligent AI-powered Q&A system using Retrieval-Augmented Generation (RAG).


πŸ“Œ Overview

This project implements a Retrieval-Augmented Generation (RAG) pipeline that allows users to ask questions from a PDF document and receive context-aware, accurate answers along with source references.

Instead of relying solely on LLM knowledge, this system retrieves relevant chunks from the document and feeds them into the model β€” ensuring grounded and reliable responses.


✨ Features

  • πŸ“„ Load PDF directly from a URL
  • πŸ” Intelligent text chunking for better retrieval
  • 🧠 Semantic search using embeddings
  • πŸ—‚οΈ Vector storage using ChromaDB
  • ⚑ Fast inference using Groq LLM
  • πŸ”— RetrievalQA pipeline with LangChain
  • πŸ“Œ Returns answers with source context
  • πŸ’‘ Scalable and modular design

πŸ› οΈ Tech Stack

  • Python
  • LangChain
  • ChromaDB (Vector Database)
  • HuggingFace Embeddings
  • Groq API (LLM - LLaMA 3)
  • Unstructured (PDF Loader)

πŸ—οΈ Architecture

        β”‚   PDF Document     β”‚
        
                 β”‚
                 β–Ό
  
  β”‚ Unstructured File Loader    β”‚
  
           β”‚
           β–Ό

β”‚ Text Chunking (Splitter)   β”‚

         β”‚
         β–Ό

β”‚ HuggingFace Embeddings β”‚

      β”‚
      β–Ό

β”‚ Chroma Vector Database β”‚

      β”‚
      β–Ό

 β”‚ Retriever       β”‚

           β”‚
           β–Ό

 β”‚ Groq LLM (LLaMA 3)    β”‚

      β”‚
      β–Ό

βœ… Final Answer + Sources

About

RAG-based Question Answering system using LangChain, Groq LLM, and ChromaDB that processes PDFs and retrieves context-aware answers with source references.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors