Using memory with LLM applications in production
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
When you implement memory for LLM chatbots, they can recall user interactions and learn from them over time. But getting this to work in production can be challenging. In this session, we'll review how you can use Metal to give your LLM chatbots both short and long-term memory, allowing them to support more complex queries and powerful retrieval augmentation in production.
Speaker:
Sergio Prada is the co-founder and CTO at Metal. Prior to joining Metal, Sergio worked in machine learning at Meta, developer tools and engineering at Datadog, and has spent +10 years in enterprise software.
Legal tech startup Harvey has recently confirmed that it successfully raised $200 million at an $11 billion valuation. The round was co-led by returning investors GIC and Sequoia, with participation from Andreessen Horowitz, Coatue, Conviction Partners, and Elad Gil, among others.
Wikipedia has implemented a new policy that bans AI-generated article creation and substantial rewrites. The ban on LLM usage was voted by Wikipedia's editor community, who found themselves increasingly overwhelmed by the non-stop flood of AI slop making its way into the encyclopedia.
Granola recently raised a $125M Series C at a $1.5B valuation. In addition to the round announcement, Granola also shared the three new features coming to the app: personal and enterprise APIs, an updated MCP, and Spaces for sharing and collaboration.
Interloom has raised a $16.5M seed round to develop a platform that captures undocumented operational expertise and transforms it into a permanent context layer for AI agents. With its "Context Graph", Interloom aims to address the critical knowledge gap that affects enterprise AI deployment.
Mistral has launched Forge, a platform that lets enterprises train AI models from scratch on their own proprietary data for greater accuracy and control.
Data Phoenix is a live media platform for AI and Data professionals, covering technologies under the hood, best practices, and live demos from the builders shaping the industry, via original shows.
Copyright © 2026 Data Phoenix. Published with Ghost and Data Phoenix.
Privacy Policy | Terms of Service | Cookie Preferences
Comments