Conversational AI app demonstrating accurate LLM inference with RAG over complex datasets. Tech stack: Serverless FastAPI backend on GCP (provisioned via Terraform), local ONNX embeddings, Firestore vector search, and LangChain orchestration.
Conversational AI app demonstrating accurate LLM inference with RAG over complex datasets. Tech stack: Serverless FastAPI backend on GCP (provisioned via Terraform), local ONNX embeddings, Firestore vector search, and LangChain orchestration.