Multi-modal RAG: Chat with Docs containing Images
Prompt Engineering Prompt Engineering
169K subscribers
18,770 views
605

 Published On Premiered Jul 9, 2024

Learn how to build a multimodal RAG system using CLIP mdoel.

LINKS:
Notebook: https://tinyurl.com/pfc64874
Flow charts in the paper:
https://tinyurl.com/4pp78xuf
https://tinyurl.com/5yeww5py
https://tinyurl.com/4un6y6x5
https://tinyurl.com/2jkbb3ma


💻 RAG Beyond Basics Course:
https://prompt-s-site.thinkific.com/c...

Let's Connect:
🦾 Discord:   / discord  
☕ Buy me a Coffee: https://ko-fi.com/promptengineering
|🔴 Patreon:   / promptengineering  
💼Consulting: https://calendly.com/engineerprompt/c...
📧 Business Contact: [email protected]
Become Member: http://tinyurl.com/y5h28s6h

💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off).

Signup for Newsletter, localgpt:
https://tally.so/r/3y9bb0


00:00 Introduction to Multimodal RAC Systems
01:24 First Approach: Unified Vector Space
02:23 Second Approach: Grounding Modalities to Text
03:57 Third Approach: Separate Vector Stores
06:26 Code Implementation: Setting Up
09:05 Code Implementation: Downloading Data
11:13 Code Implementation: Creating Vector Stores
14:00 Querying the Vector Store


All Interesting Videos:
Everything LangChain:    • LangChain  

Everything LLM:    • Large Language Models  

Everything Midjourney:    • MidJourney Tutorials  

AI Image Generation:    • AI Image Generation Tutorials  

show more

Share/Embed