Want better LLM results? Then it's time for AI evaluation tools – learning from Galileo's RAG and agentic metrics

Whether we should trust AI – particularly generative AI – remains a worthy debate. But if you want a better LLM result, you need two things: better data, and better evaluation tools. Here’s how a chip on my shoulder about a missing RAG evaluation metric led me to Galileo, and this deep dive.