Using memory with LLM applications in production
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
When you implement memory for LLM chatbots, they can recall user interactions and learn from them over time. But getting this to work in production can be challenging. In this session, we'll review how you can use Metal to give your LLM chatbots both short and long-term memory, allowing them to support more complex queries and powerful retrieval augmentation in production.
Speaker:
Sergio Prada is the co-founder and CTO at Metal. Prior to joining Metal, Sergio worked in machine learning at Meta, developer tools and engineering at Datadog, and has spent +10 years in enterprise software.
Arcee AI, a tiny US startup that recently developed a completely open contender to Meta's Llama 4 Maverick, recently released Trinity-Large-Thinking, an open weights reasoning model claimed to be "the strongest open model ever released outside of China."
Microsoft AI announced the availability of three foundational models for transcription and voice and image generation. The launch highlights these models' competitive pricing and practical business value as Microsoft shifts its AI strategy towards business and productivity solutions.
Mistral raised $830 million in debt financing from a consortium of seven banks to purchase 13,800 Nvidia GPUs for its first data center near Paris, positioning itself as Europe's sovereign AI alternative to US proprietary models.
Mercor recently confirmed it was victim to a security incident related to the compromise of the popular open-source project LiteLLM. Extortion group Lapsus$ has claimed responsibility for the attack, allegedly posting a sample of the stolen data on its leak site.
Legal tech startup Harvey has recently confirmed that it successfully raised $200 million at an $11 billion valuation. The round was co-led by returning investors GIC and Sequoia, with participation from Andreessen Horowitz, Coatue, Conviction Partners, and Elad Gil, among others.
Data Phoenix is a live media platform for AI and Data professionals, covering technologies under the hood, best practices, and live demos from the builders shaping the industry, via original shows.
Copyright © 2026 Data Phoenix. Published with Ghost and Data Phoenix.
Privacy Policy | Terms of Service | Cookie Preferences
Comments