StreamingLLM - Z-Tube

StreamingLLM - Extend Llama2 to 4 million token & 22x faster inference?

115K subscribers

20,194 views

About
Share

Published On Oct 7, 2023

It's hard to get LLM generate big amount of content and take in large inputs; To solve this, introducing StreamingLLM, Extend Llama-2 & Falcon's up to 4 million tokens; 22x faster inference than your standard LLM ⚡️

Now you can even generate the whole book with LLM!

🔗 Links
- Follow me on twitter: / jasonzhou1993
- Join my AI email list: https://www.ai-jason.com/
- My discord: / discord
- StreamingLLM Github: https://github.com/mit-han-lab/stream...

👋🏻 About Me
My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! [email protected]

#llama2 #meta #gpt #autogpt #ai #artificialintelligence #tutorial #stepbystep #openai #llm #largelanguagemodels #largelanguagemodel #chatgpt #gpt4 #machinelearning

Published On Oct 7, 2023

Share/Embed

Video Link