LLaVA 1.6 is here...but is it any good? (via Ollama)
Learn Data with Mark Learn Data with Mark
8.69K subscribers
10,613 views
0

 Published On Feb 4, 2024

LLaVA (or Large Language and Vision Assistant) recently released version 1.6. In this video, with help from Ollama, we're going to compare this version with 1.5 to see how it's improved over the last few months. We'll see how well it describes a photo of me, if it can create a caption for an image, how well it extracts text/code from images, and whether it can understand a diagram.

Resources
* Blog - https://www.markhneedham.com/blog/202...
* LLaVA 1.6 release - https://llava-vl.github.io/blog/2024-...
* Ollama - https://ollama.ai/
* Ollama Python Library - https://pypi.org/project/ollama/

show more

Share/Embed