Chat with PDF charts and tables using Gemini Pro Vision and GPT4 Vision models.

135 subscribers

137 views

About
Share

Published On Feb 3, 2024

In this video, I use 2 state of the art vision models to explore how they do on tasks such as answering questions based on charts and tables. Plus, 2 other cool use-cases for leveraging these vision models.
Please LIKE and SUBSCRIBE.

Links:
Code: https://github.com/mktaop/qna_vision
Langchain CLIP multimodal embeddings: https://python.langchain.com/docs/int...

Published On Feb 3, 2024

Share/Embed

Video Link