AI agent + Vision = Incredible

115K subscribers

53,586 views

About
Share

Published On Oct 16, 2023

A step by step tutorial of how to build vision powered AI agent via autogen + llava + stable diffusion AND Break down of 160-page analysis of GPT4V capabilities

🤘 Get 15% off on sceneXplain via my code AIJASON : https://go.jina.ai/scenexplainjason

🔗 Links
- Follow me on twitter: / jasonzhou1993
- Join my AI email list: https://www.ai-jason.com/
- My discord: / discord
- sceneXplain: https://go.jina.ai/scenexplainjason
- Vision-agent Github: https://github.com/JayZeeDesign/visio...

⏱️ Timestamps
0:00 Intro
1:15 What is multi-modal model
2:12 GPT4V ability break down
4:34 sceneXplain
6:00 Visual prompt techniques
10:53 Use cases
13:00 Build vision agent #1 - Setup
14:20 Build vision agent #2 - Use Llava model
15:58 Build vision agent #3 - Use Stable diffusion
16:52 Build vision agent #4 - Set agent system via autogen
18:53 Build vision agent #5 - Demo

👋🏻 About Me
My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! [email protected]

#gpt4 #autogen #autogpt #ai #artificialintelligence #tutorial #stepbystep #openai #llm #chatgpt #largelanguagemodels #largelanguagemodel #bestaiagent #chatgpt #agentgpt #agent #babyagi #llava #stablediffusion

Published On Oct 16, 2023

Share/Embed

Video Link