Run your own large language model with Mozilla's Llamafile

1.23K subscribers

9,234 views

316

About
Share

Published On Dec 1, 2023

How to run your own large language model with #mozilla #llamafile

* LLamafile Repo: https://github.com/Mozilla-Ocho/llama...

* Mistral Model Weights File: https://huggingface.co/TheBloke/Mistr...
* 0.2.1 release binary - https://github.com/Mozilla-Ocho/llama...

* Simon Willison Blog Article on Getting Started - https://simonwillison.net/2023/Nov/29...

* Mozilla Llamafile announcement - https://hacks.mozilla.org/2023/11/int...

About Me: I am a Staff Design Technologist on Mozilla's Innovation Team. Learn more about Mozilla Innovation at future.mozilla.org

0:00 Run a large language model locally on your computer from a single file
0:15 Why that's useful
0:20 Llamafile Demo - prompting for a poem about llamas
0:45 Adjusting model parameters
0:56 Getting up and running with Llamafile
1:26 Getting model weights from Hugging Face
2:11 Getting the latest llamafile release
3:06 Running llama server from the command line with your chosen model
3:50 Testing it out!
4:10 Running a single file executable instead of the llamafile server
4:47 Testing it out with a prompt
5:03 Getting help with llamafile and a list of options
5:20 Link to further instructions on Llamafile via Simon Willison's blog

Published On Dec 1, 2023

Share/Embed

Video Link