Powering AI with the World's Largest Computer Chip with Joel Hestness - 684
The TWIML AI Podcast with Sam Charrington The TWIML AI Podcast with Sam Charrington
19.6K subscribers
354 views
8

 Published On May 13, 2024

Today we're joined by Joel Hestness, principal research scientist and lead of the core machine learning team at Cerebras. We discuss Cerebras’ custom silicon for machine learning, Wafer Scale Engine 3, and how the latest version of the company’s single-chip platform for ML has evolved to support large language models. Joel shares how WSE3 differs from other AI hardware solutions, such as GPUs, TPUs, and AWS’ Inferentia, and talks through the homogenous design of the WSE chip and its memory architecture. We discuss software support for the platform, including support by open source ML frameworks like Pytorch, and support for different types of transformer-based models. Finally, Joel shares some of the research his team is pursuing to take advantage of the hardware's unique characteristics, including weight-sparse training, optimizers that leverage higher-order statistics, and more.

🔔 Subscribe to our channel for more great content just like this: https://youtube.com/twimlai?sub_confi...


🗣️ CONNECT WITH US!
===============================
Subscribe to the TWIML AI Podcast: https://twimlai.com/podcast/twimlai/
Follow us on Twitter:   / twimlai  
Follow us on LinkedIn:   / twimlai  
Join our Slack Community: https://twimlai.com/community/
Subscribe to our newsletter: https://twimlai.com/newsletter/
Want to get in touch? Send us a message: https://twimlai.com/contact/


📖 CHAPTERS
===============================
00:00 - Introduction
03:00 - Cerebras and AI hardware
18:32 - Cerebras vs alternatives such as GPUs, TPUs, and AWS Inferentia
21:28 - How does the WSE work?
33:56 - Who benefits from such a device?
47:31 - Building hardware around transformers
52:07 - Benefits of Cerebras research
54:38 - Conclusion


🔗 LINKS & RESOURCES
===============================
nanoGPT - https://github.com/karpathy/nanoGPT
Cerebras - https://www.cerebras.net/
TensorFlow - https://www.tensorflow.org/
PyTorch - https://pytorch.org/
JAX - https://jax.readthedocs.io/en/latest/...
AWS Inferentia - https://aws.amazon.com/machine-learni...
TPU - https://cloud.google.com/tpu


📸 Camera: https://amzn.to/3TQ3zsg
🎙️Microphone: https://amzn.to/3t5zXeV
🚦Lights: https://amzn.to/3TQlX49
🎛️ Audio Interface: https://amzn.to/3TVFAIq
🎚️ Stream Deck: https://amzn.to/3zzm7F5

show more

Share/Embed