.NET RAG with PostgreSQL pgvector + Aspire

A production-style Retrieval-Augmented Generation (RAG) implementation in .NET using PostgreSQL pgvector, .NET Aspire, and pluggable LLM providers (Ollama / OpenAI).

Overview

This project demonstrates how to build a semantic search + RAG pipeline using modern .NET practices:

Vector similarity search with pgvector
LLM-based answer generation (Ollama / OpenAI)
Clean Architecture (Domain, Application, Infrastructure)
.NET Aspire orchestration (Postgres + Ollama + API)
Provider-agnostic design (LLM + vector store)

Architecture

User Query
   ↓
Embedding Service (Ollama / OpenAI)
   ↓
Vector Store (pgvector)
   ↓
Top-K Retrieved Documents
   ↓
LLM (RAG Prompt)
   ↓
Generated Answer

Solution Structure

DotNetRagPgvector/
│
├── AppHost/                 # Aspire orchestration
├── Api/                     # Minimal API (entry point)
├── Application/             # Use cases + abstractions
│   ├── Abstractions/        # IEmbeddingService, IVectorStore, ILLMService
│   ├── Services/            # RagService, IngestionService
│   └── DTOs/                # UserInput
│
├── Domain/                  # Core models
│   └── Models/
│       ├── Document
│       └── RetrievedDocument
│
├── Infrastructure/          # Implementations
│   ├── Entities/            # EF Core Entities - DesignPatterns
│   ├── Persistence/         # EF Core + pgvector
│   ├── VectorStores/        # PgVectorStore
│   ├── Embeddings/          # Ollama / OpenAI
│   └── LLM/                 # Ollama / OpenAI

Features

✅ Semantic search using embeddings
✅ Retrieval-Augmented Generation (RAG)
✅ pgvector integration with PostgreSQL
✅ Clean architecture with strict separation of concerns
✅ Pluggable LLM providers (Ollama / OpenAI)
✅ Aspire-based container orchestration
✅ Scalar UI for API testing

Tech Stack

.NET 10
ASP.NET Core Minimal APIs
Entity Framework Core
PostgreSQL + pgvector
.NET Aspire
Ollama (local LLM)
OpenAI (optional)

Getting Started

1. Prerequisites

.NET 10 SDK
Docker Desktop (Or Podman - set Aspire container runtime to podman)
.NET Aspire workload

dotnet workload install aspire

2. Run the Application

dotnet run --project AppHost

This starts:

PostgreSQL (pgvector)
Ollama
API

3. Open API UI (Scalar)

Navigate to:

http://localhost:<api-port>/scalar

Testing the API

POST `/ask`

{
  "query": "How to handle distributed transactions?"
}

Example Questions

How to prevent cascading failures?
How to scale read-heavy systems?
How do microservices communicate asynchronously?

Configuration

appsettings.json

{
  "AI": {
    "Provider": "Ollama",
    "Ollama": {
      "BaseUrl": "http://localhost:11434",
      "EmbeddingModel": "nomic-embed-text",
      "ChatModel": "phi4-mini"
    },
    "OpenAI": {
      "ApiKey": "",
      "EmbeddingModel": "text-embedding-3-small",
      "ChatModel": "gpt-4o-mini"
    }
  }
}

Provider Switching

Switch between Ollama and OpenAI:

"Provider": "Ollama"
// or
"Provider": "OpenAI"

No code changes required.

🗄️ Database & Vector Search

Uses PostgreSQL with pgvector
Embeddings stored as vector column
Cosine similarity used for retrieval:

var results = await dbContext.DesignPatterns
    .OrderBy(x => x.Embedding.CosineDistance(questionVector))
    .Take(2)
    .ToListAsync();

ORDER BY "Embedding" <=> @queryVector
LIMIT 2

Data Seeding

Automatic on startup
Uses real-world architecture patterns dataset
Embeddings generated via selected provider

RAG Flow (Code-Level)

// 1. Embed query
var queryEmbedding = await _embedding.GenerateAsync(query);

// 2. Retrieve relevant documents
var docs = await _vectorStore.SearchAsync(queryEmbedding);

// 3. Build prompt
// 4. Generate response using LLM

Extensibility

You can easily extend:

Vector Stores

pgvector ✅
Pinecone (future)
FAISS (future)

LLM Providers

Ollama ✅
OpenAI ✅
Azure OpenAI (easy to add)

Design Principles

Domain is persistence-agnostic
Infrastructure handles external dependencies
Application orchestrates the RAG pipeline
API acts as composition root

Future Improvements

Hybrid search (BM25 + vector)
HNSW indexing for pgvector
Streaming LLM responses
Evaluation & benchmarking
Metadata filtering

Contributing

Contributions are welcome! Feel free to open issues or PRs.

License

MIT License

⭐ Why This Project?

This repository demonstrates a production-ready, provider-agnostic RAG architecture in .NET.

Acknowledgements

pgvector
.NET Aspire
Ollama

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
AppHost		AppHost
DotNetRagPgvector.Api		DotNetRagPgvector.Api
DotNetRagPgvector.Application		DotNetRagPgvector.Application
DotNetRagPgvector.Domain		DotNetRagPgvector.Domain
DotNetRagPgvector.Infrastructure		DotNetRagPgvector.Infrastructure
.gitignore		.gitignore
DotNetRagPgvector.slnx		DotNetRagPgvector.slnx
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

.NET RAG with PostgreSQL pgvector + Aspire

Overview

Architecture

Solution Structure

Features

Tech Stack

Getting Started

1. Prerequisites

2. Run the Application

3. Open API UI (Scalar)

Testing the API

POST `/ask`

Example Questions

Configuration

appsettings.json

Provider Switching

🗄️ Database & Vector Search

Data Seeding

RAG Flow (Code-Level)

Extensibility

Vector Stores

LLM Providers

Design Principles

Future Improvements

Contributing

License

⭐ Why This Project?

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

.NET RAG with PostgreSQL pgvector + Aspire

Overview

Architecture

Solution Structure

Features

Tech Stack

Getting Started

1. Prerequisites

2. Run the Application

3. Open API UI (Scalar)

Testing the API

POST /ask

Example Questions

Configuration

appsettings.json

Provider Switching

🗄️ Database & Vector Search

Data Seeding

RAG Flow (Code-Level)

Extensibility

Vector Stores

LLM Providers

Design Principles

Future Improvements

Contributing

License

⭐ Why This Project?

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

POST `/ask`

Packages