LLM | Ben Karciauskas

Why AI's Biggest Leap This Month Isn't About Model Size

You know those weeks when you glance at the AI headlines and think, “Okay, that’s neat, but it’ll be years before we feel it on the ground?” Then something drops that makes you double-take. That’s exactly how the Samsung Tiny Recursive Model (TRM) news hit me. And just as I was digesting that, the Thinking Machines Lab’s work on deterministic LLM inference showed up. Both are challenging what we thought AI needed to get better and more reliable. ...

Getting Started with Databricks LLM Chatbot with RAG and Vector Search Demo

Introduction I recently setup and have been playing around with the “Deploy LLM Chatbots with RAG and Databricks Vector Search” demo from Databricks. While not a complete step-by-step guide, this article offers insights into what to expect, the necessary steps, how to fix issues that may arise, and how to maximize your trial experience quickly. Getting access to Databricks You can get a free 2-week trial from Databricks. I used AWS, and if you want to follow along, you will need to do the same. Here are the main things to know for setting up the trial. ...

RAG vs Fine-Tuning: Exploring the Options

Retrieval-Augmented Generation (RAG) seems to be the new cool thing in the fast-changing world of generative AI, especially in language models. I’ve been really getting into LlamaIndex lately, and I’ve noticed that sometimes the answers my apps give aren’t as good as I’d like. Is it because of how I made the app, or should I try fine-tuning instead? Maybe I just need to use more specific and better prompts? I’ve decided to dig deeper into this. ...