AI Features

Scoping and Planning a Production RAG Application

Get introduced to the Support Bot project and learn how it aligns with our 4D framework.

We have spent the last few lessons building a working model of LLMOps: the 4D life cycle and the reference architecture for production RAG systems. Now we need to apply those ideas to a concrete build. The goal of this course is to ship a RAG application that is reliable, measurable, and maintainable. That requires a small set of tools with clear responsibilities, plus explicit quality gates we can test before deployment.

The RAG application we will build is an HR support assistant for a fictional company. We will build upon this throughout the remainder of the course. We also map the 4D framework directly to the engineering work we will execute.

This will provide a consistent framework for determining what to build next, measuring progress, and identifying when a phase is complete.

An HR support assistant

Assume we are the engineering team at a fictional company called Halluli that does not have a dedicated HR department. All company policies and processes are documented. New hires currently have to search through hundreds of pages of Markdown documentation to determine whether the company offers work-from-home options.

During ...

Ask