Llama 3.2 is INSANE - But Does it Beat GPT as an AI Agent?
Cole Medin Cole Medin
11.1K subscribers
3,113 views
124

 Published On Sep 29, 2024

Meta recently released their latest suite of LLMs - Llama 3.2 - and they are CRUSHING it on the benchmarks! The 11b and 90b parameter versions even have vision capabilities.

But what I care about the most is how well they do as AI agents - LLMs that are able to send emails, message in Slack, do RAG, etc. Local LLMs have historically been not so great as AI agents (specifically because of poor function calling performance), so when Llama 3.2 came out, I was super excited to see how it fared as a function calling agent. And that's exactly what I test out in this video!

Llama 3.2 90b is pretty on par with GPT-4o-mini according to the benchmarks, so I pit the two models against each other to set a good baseline for seeing how good Llama 3.2 really is. Cage match!

00:00 - Plan of Attack
01:52 - Showcasing the AI Agent Code
05:03 - The Agent Capabilities (Tools)
06:52 - Testing GPT-4o-mini as an Agent
11:35 - Testing Llama 3.2 90b as an Agent
15:29 - Outro

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

All the code for the AI Agent that I used in this video to test Llama 3.2 90b and GPT-4o-mini can be found here:

https://github.com/coleam00/ai-agents...

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Artificial Intelligence is no doubt the future of not just software development but the whole world. And I'm on a mission to master it - focusing first on mastering AI Agents.

Join me as I push the limits of what is possible with AI. I'll be uploading videos three times a week - Sundays, Wednesdays, and Fridays at 7:00 PM CDT! Sundays and Wednesdays are for everything AI, Fridays are specifically for platform showcases (sometimes sponsored, always creative in approach!).

show more

Share/Embed