Build a Production-Ready RAG System from Scratch
Agentic RAG System
Phase 1 of The Mother of AI (MOAI) Project. You'll build a production-grade Agentic RAG system end-to-end — an arxiv paper curator powered by Airflow pipelines, hybrid retrieval (BM25 + vectors with RRF), LLM generation via Ollama, and an agentic layer for query validation, document grading, and adaptive retrieval. Full observability with Langfuse, Redis caching, streaming APIs, and multiple client interfaces. No toy demos — 23+ tools, real architecture, real code you can run and ship.
Substack Annual Premium subscribers get 50% off — Contact us
What You'll Learn
7 weeks. From zero infrastructure to a production RAG system with observability.
Infrastructure & Data Engineering
The foundation real systems depend on.
Docker-orchestrated services, FastAPI, OpenSearch, PostgreSQL, and Airflow. Production habits from day one — service boundaries, retries, failure handling, and health checks.
Data Ingestion & Real-World Pipelines
Where most quality gains come from.
Automated pipelines that fetch, parse, and normalize academic PDFs. Messy inputs, unreliable APIs, parsing failures — the problems that dominate real systems.
Retrieval as a First-Class Problem
Not an afterthought — the core of RAG.
BM25 keyword search, semantic embeddings, structured chunking, and hybrid ranking. Keyword and semantic signals combined deliberately for stable, explainable retrieval.
Observable, Agentic Production System
Measure and iterate, not guess.
LLMs as a controlled component, not the centerpiece. Streaming, citations, full tracing, and caching. Then agentic workflows with LangGraph — query validation, document grading, and adaptive retrieval.
Tools & Technologies
23+ production tools you'll master
Docker-based service orchestration
FastAPI with health checks & docs
PostgreSQL for structured storage
OpenSearch setup
Airflow integration
Local LLM with Ollama
All course code is open source
Complete notebooks and production-ready Python scripts — browse the code before you enroll.
Learn from experienced practitioners — 18+ years building and deploying AI/ML systems at real companies.
How It Works
Self-paced curriculum that fits your schedule. Live support when you need it.
Self-Paced Recorded Curriculum
Six weeks of production-focused lessons. Learn at your own pace, on your own schedule. All labs are in Python scripts, runnable and production-ready.
Discord Support
Join our private Discord community for participants. Get help from peers, share progress, and collaborate daily.
Completion Certificate
A formal certificate recognizing your achievement, suitable for L&D budgets and professional development.
Lifetime Access
Retain access to all course materials, recordings, and future updates forever.
6-8 Hours Per Week
Build real systems alongside your full-time job. Designed to fit your busy schedule.
What you need
Prerequisites
Make sure you have these covered before starting.
Ready to Build?
Get lifetime access to 12 hours of hands-on content, real code, and production patterns.
Substack Annual Premium subscribers get 50% off — Contact us
Course Curriculum
8 modules · 29 lessons · 12 hours of content · Free Preview Available
01 Kickoff Session
1 lessons
Kickoff Session
1 lessons
Welcome session to kick off the course, set expectations, and walk through the roadmap.
- Kickoff Session Recording from 16th Nov 2025
02 Week 1 — The Infrastructure That Powers RAG Systems
4 lessons
Week 1 — The Infrastructure That Powers RAG Systems
4 lessons
Set up the production infrastructure foundation: Docker, FastAPI, PostgreSQL, OpenSearch, Airflow, and Ollama.
- Pre-Read (Must read before watching)
- Introduction to RAG
- Understanding the Architecture & Project Structure
- Infrastructure Walkthrough & Question & Answers
03 Week 2 — Bringing Your RAG System to Life: The Data Pipeline
5 lessons
Week 2 — Bringing Your RAG System to Life: The Data Pipeline
5 lessons
Build automated pipelines that fetch, parse, and normalize academic PDFs with real-world error handling.
- Pre-Read (Must read before watching)
- Introduction to Data Pipeline & Ingestion in RAG
- Question & Answers Part 1
- Ingestion Walkthrough
- Question & Answers Part 2
04 Week 3 — The Search Foundation Every RAG System Needs
4 lessons
Week 3 — The Search Foundation Every RAG System Needs
4 lessons
Implement keyword search with BM25, OpenSearch index design, and build a debuggable retrieval flow.
- Pre-Read (Must read before watching)
- Understanding the Basics of Search (BM25)
- Code Walkthrough & OpenSearch Keyword Retrieval
- Question & Answers
05 Week 4 — Chunking Strategies & Hybrid RAG System
5 lessons
Week 4 — Chunking Strategies & Hybrid RAG System
5 lessons
Explore chunking strategies, generate embeddings, and combine BM25 + vector search with Reciprocal Rank Fusion.
- Pre-Read (Must read before watching)
- Introduction to Chunking Strategies
- Question & Answers Part 1
- Code Walkthrough
- Question & Answers Part 2
06 Week 5 — The Complete RAG System
4 lessons
Week 5 — The Complete RAG System
4 lessons
Wire hybrid retrieval to LLM generation with streaming APIs, optimized prompts, and a chat interface.
- Pre-Read + File Downloads (Must read before watching)
- Architecture Revisit
- Code Walkthrough
- Question & Answers
07 Week 6 — Monitoring & Caching
4 lessons
Week 6 — Monitoring & Caching
4 lessons
Add end-to-end RAG tracing, latency and error monitoring, Redis caching, and production reliability patterns.
- Pre-Read + File Downloads (Must read before watching)
- LLM Observability Overview
- Question & Answers
- Code Walkthrough
08 Week 7 — Agentic RAG (Bonus)
2 lessons
Week 7 — Agentic RAG (Bonus)
2 lessons
Build agentic workflows with LangGraph — query validation, document grading, and adaptive retrieval.
- Pre-Read (Must read before watching)
- Agentic RAG with Code Walkthrough
Learn from people who've actually built these systems
This isn't theory from a textbook. Every module comes from real experience building and deploying RAG systems in production. We've made the mistakes so you don't have to.
We've intentionally priced this course low because we want people to actually take advantage of it and learn. Put your money in the right place — invest in skills that compound, taught by engineers who ship.
Frequently Asked Questions
Ready to Build?
Get lifetime access to 12 hours of hands-on content, real code, and production patterns.
Substack Annual Premium subscribers get 50% off — Contact us