Run your own large language model with Mozilla's Llamafile
Practical AI through Prototypes Practical AI through Prototypes
1.23K subscribers
9,234 views
316

 Published On Dec 1, 2023

How to run your own large language model with #mozilla #llamafile

* LLamafile Repo: https://github.com/Mozilla-Ocho/llama...

* Mistral Model Weights File: https://huggingface.co/TheBloke/Mistr...
* 0.2.1 release binary - https://github.com/Mozilla-Ocho/llama...

* Simon Willison Blog Article on Getting Started - https://simonwillison.net/2023/Nov/29...

* Mozilla Llamafile announcement - https://hacks.mozilla.org/2023/11/int...

About Me: I am a Staff Design Technologist on Mozilla's Innovation Team. Learn more about Mozilla Innovation at future.mozilla.org

0:00 Run a large language model locally on your computer from a single file
0:15 Why that's useful
0:20 Llamafile Demo - prompting for a poem about llamas
0:45 Adjusting model parameters
0:56 Getting up and running with Llamafile
1:26 Getting model weights from Hugging Face
2:11 Getting the latest llamafile release
3:06 Running llama server from the command line with your chosen model
3:50 Testing it out!
4:10 Running a single file executable instead of the llamafile server
4:47 Testing it out with a prompt
5:03 Getting help with llamafile and a list of options
5:20 Link to further instructions on Llamafile via Simon Willison's blog

show more

Share/Embed