From Idea to Production: AI Infra for Scaling LLM Apps
MLOps World: Machine Learning in Production MLOps World: Machine Learning in Production
2.59K subscribers
154 views
0

 Published On May 16, 2024

Speaker: Guy Eshet, Product manager, Qwak

AI applications have to adapt to new models, more stakeholders and complex workflows that are difficult to debug.

Add prompt management, data pipelines, RAG, cost optimization, and GPU availability into the mix, and you're in for a ride.

How do you smoothly bring LLM applications from Beta to Production? What AI infrastructure is required?

Join Guy in this exciting talk about strategies for building adaptability into your LLM applications.

We'll be diving into:

The challenges in building Generative AI and LLM apps
Adding adaptability into the design and deployment of LLM applications
Build LLM applications ready for the next best model

show more

Share/Embed