← Home
AI Healthcare RAG Assistant
Retrieval-Augmented Generation system built on Google Cloud

This project is a fully functional, end-to-end Retrieval-Augmented Generation (RAG) system built entirely on the Google Cloud Platform. It features a conversational AI agent, powered by Dialogflow CX and Google's Gemini models, that can answer user questions about public health topics (specifically, the flu) based on real, up-to-date information scraped from the Centers for Disease Control and Prevention (CDC) website.

The entire system is serverless, event-driven, and designed to be a low-cost, scalable solution for providing factual, AI-driven answers from a trusted knowledge base.

Features

Architecture

The project is composed of two main asynchronous workflows: a daily data ingestion pipeline and a real-time query pipeline.

RAG Architecture Diagram

Technology Stack

Google Cloud Platform Services

Key Python Libraries

The Journey: A Note on Real-World Implementation

Built and stabilized a modern cloud stack on a bleeding-edge platform by systematically uncovering and fixing deep, often undocumented issues, for example, stubborn container start-up failures, non-intuitive console navigation for setting up workflows, persistent agent misbehaviors and API endpoints failures. The work highlights a persistent, methodical, and deeply technical approach to cloud engineering and debugging.

Future Improvements

Demo

A short walkthrough of the assistant experience and how the RAG pipeline produces grounded answers from the knowledge base.

Video file: Open or download.