Using memory with LLM applications in production
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
When you implement memory for LLM chatbots, they can recall user interactions and learn from them over time. But getting this to work in production can be challenging. In this session, we'll review how you can use Metal to give your LLM chatbots both short and long-term memory, allowing them to support more complex queries and powerful retrieval augmentation in production.
Speaker:
Sergio Prada is the co-founder and CTO at Metal. Prior to joining Metal, Sergio worked in machine learning at Meta, developer tools and engineering at Datadog, and has spent +10 years in enterprise software.
DeepSeek released DeepSeek-V4, an open-source 1.6-trillion-parameter model with a one-million-token context window that achieves near-frontier performance at roughly one-sixth the API cost of GPT-5.5 and Claude Opus 4.7.
ComfyUI raised $30M to scale its open-source platform that gives creators granular, node-based control over AI-generated media, addressing the limitations of prompt-based tools. ComfyUI now serves over 4M users and has become essential infrastructure for production studios and creative agencies.
Cohere and Germany's Aleph Alpha are merging to create a $20 billion transatlantic AI powerhouse focused on sovereign AI solutions, targeting the $600 billion market for organizations seeking independence from dominant US and Chinese AI providers.
OpenAI released GPT-5.5, a model that achieves state-of-the-art performance across coding, knowledge work, and scientific research while preserving efficiency. GPT-5.5 marks progress toward OpenAI's vision of an AI "super app" combining ChatGPT, Codex, and browser capabilities.
Tomorrow marks the opening of Imagine Next — Silicon Valley’s Global Climate Tech Capital Summit, bringing together founders, investors, corporates, and system leaders to accelerate planet-first innovation.
Data Phoenix is a live media platform for AI and Data professionals, covering technologies under the hood, best practices, and live demos from the builders shaping the industry, via original shows.
Copyright © 2026 Data Phoenix. Published with Ghost and Data Phoenix.
Privacy Policy | Terms of Service | Cookie Preferences
Comments