Published On Feb 4, 2024
LLaVA (or Large Language and Vision Assistant) recently released version 1.6. In this video, with help from Ollama, we're going to compare this version with 1.5 to see how it's improved over the last few months. We'll see how well it describes a photo of me, if it can create a caption for an image, how well it extracts text/code from images, and whether it can understand a diagram.
Resources
* Blog - https://www.markhneedham.com/blog/202...
* LLaVA 1.6 release - https://llava-vl.github.io/blog/2024-...
* Ollama - https://ollama.ai/
* Ollama Python Library - https://pypi.org/project/ollama/
show more