
RAG or Retrieval-Augmented Generation
This is a topic of its own. Basically, you can build up your AI’s memory and then exchange and even add information. You can use it for basic functions like fetching relevant information from documents, or feeding instructions to perform functions based on context you’ve provided.

Different parameters, components, and models in a simple RAG application. Source: Gradient Flow
I made a basic one for myself to test out. My experiments all ended with the free tiers of GPT’s OpenRouter, Groq, Gemini 2.5 Pro and Flash models. Qwen’s API needed my card even though they claimed a free tier. Broadening the context is what I’m struggling with. Like if you build a bot for appointment setting, it can only do that in intelligent ways. You give it more functionalities and it will work within those boundaries. You can program it to do those exact operational functions.
My goal was to make it give me strategic outputs.
My attempt to create a realtime GraphRAG that can give answers inspired by the divine. Source: self
But I have to preempt what I tell the agent to do. It can’t be random and I can’t expect the AI to know what I want. For RAG to work strategically within a given academic context, it will either come back empty handed or hallucinate on its own. Then it gets deeper and beyond regular RAG, you give it tools and so on, for example search tools.
The LLMs I find are built with these typical limitations: Western-European frameworks that dominate higher-level institutions and analytical thinking. I’ve been trying to get AI to stop defaulting to certain perspectives, to align with specific lines of academic thinking, but there’s bias baked in. There are uncensored models though, which are fun to experiment with but they will fry my GPU.
Although I found this to be an evolving space, with the evolution of GraphRAG and other hybrids this remains an interesting space to watch. And this will have more utilities to come, example Google’s NotebookLM. I made one for myself and tested it but I could not produce a divinely-oriented non-secular output because of LLM bias.