Using memory with LLM applications in production
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
This talk will review how you can use Metal to give your LLM chatbots both short and long-term memory.
When you implement memory for LLM chatbots, they can recall user interactions and learn from them over time. But getting this to work in production can be challenging. In this session, we'll review how you can use Metal to give your LLM chatbots both short and long-term memory, allowing them to support more complex queries and powerful retrieval augmentation in production.
Speaker:
Sergio Prada is the co-founder and CTO at Metal. Prior to joining Metal, Sergio worked in machine learning at Meta, developer tools and engineering at Datadog, and has spent +10 years in enterprise software.
Reddit is positioning AI-powered search as its next major growth opportunity. The company saw its Reddit Answers feature surging from 1 million to 15 million weekly users in 2025. Although Reddit's AI search is not yet monetized, the company called it "an enormous market and opportunity".
ElevenLabs raised $500 million at an $11 billion valuation, tripling its valuation from a year ago. The startup also reported $330 million in ARR from enterprise voice AI adoption, and ambitious plans to expand its presence in global markets as it builds momentum towards an eventual IPO.
Anthropic released Claude Opus 4.6 with "agent teams" that enable multiple agents to work in parallel on complex tasks, a 1 million token context window, and direct PowerPoint integration, expanding the model's appeal beyond software developers to knowledge workers across industries.
China's Moonshot AI released Kimi K2.5, an open source multimodal model with agent swarm technology that enables up to 100 sub-agents to work in parallel, alongside Kimi Code, a coding tool that rivals Anthropic's Claude Code.
Arcee AI, a 30-person US startup, released Trinity Large, a 400B-parameter open source model that rivals Meta's Llama 4 Maverick, addressing concerns about China's dominance in open-weight models and uncertainty around US companies' commitment to open source AI.
SF Bay Area media and education platform focused on AI and Data. As a voice of AI industry, Data Phoenix delivers news, practical knowledge, and helps companies be heard in the community.
Copyright © 2026 Data Phoenix. Published with Ghost and Data Phoenix.
Comments