Using memory with LLM applications in production
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
When you implement memory for LLM chatbots, they can recall user interactions and learn from them over time. But getting this to work in production can be challenging. In this session, we'll review how you can use Metal to give your LLM chatbots both short and long-term memory, allowing them to support more complex queries and powerful retrieval augmentation in production.
Speaker:
Sergio Prada is the co-founder and CTO at Metal. Prior to joining Metal, Sergio worked in machine learning at Meta, developer tools and engineering at Datadog, and has spent +10 years in enterprise software.
Nscale raised $2 billion in Europe's largest Series C at a $14.6 billion valuation to accelerate AI infrastructure buildout globally. In parallel, Nscale announced the appointment of Sheryl Sandberg, Nick Clegg, and Susan Decker to its board.
Replit raised $400 million at a $9 billion valuation, effectively tripling its valuation since its last funding round. Replit also launched Agent 4, a faster AI coding agent that can be run in multiple parallel instances and that can handle more complex workflows than its predecessors.
Recent AI translations of Wikipedia articles have been found to contain substantial errors and hallucinations, causing outrage amongst the Wikipedia volunteers tasked with fighting the endless stream of AI slop that threatens the encyclopedia's survival and integrity.
GPT-5.3 Instant, OpenAI's most recent model update, brings improved conversational tone, flow, and relevance after widespread frustration with GPT-5.2's overbearing tone and unwarranted assumptions about its users' intent and emotional states.
OpenAI raised $110 billion at a $730 billion pre-money valuation from SoftBank, NVIDIA, and Amazon. The startup also secured strategic partnerships for infrastructure and scaling with Amazon and NVIDIA, as OpenAI continues to serve 900M weekly ChatGPT users and 1.6M weekly Codex users.
SF Bay Area media and education platform focused on AI and Data. As a voice of AI industry, Data Phoenix delivers news, practical knowledge, and helps companies be heard in the community.
Copyright © 2026 Data Phoenix. Published with Ghost and Data Phoenix.
Privacy Policy | Terms of Service | Cookie Preferences
Comments