PaperPulse logo
FeedTopicsAI Researcher FeedBlogPodcastAccount

Stay Updated

Get the latest research delivered to your inbox

Platform

  • Home
  • About Us
  • Search Papers
  • Research Topics
  • Researcher Feed

Resources

  • Newsletter
  • Blog
  • Podcast
PaperPulse•

AI-powered research discovery platform

© 2024 PaperPulse. All rights reserved.

Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

arXivSource

Haozhen Zhang, Haodong Yue, Tao Feng, Quanyu Long, Jianzhu Bao, Bowen Jin, Weizhi Zhang, Xiao Li, Jiaxuan You, Chengwei Qin, Wenya Wang

cs.LG
|
Feb 5, 2026
4 views

One-line Summary

BudgetMem is a runtime memory framework for LLMs that optimizes performance-cost trade-offs using query-aware budget-tier routing across memory modules.

Plain-language Overview

BudgetMem is a new approach designed to improve how large language model (LLM) agents use memory when they need to process information beyond a single context. Unlike traditional methods that prepare memory in advance without considering specific queries, BudgetMem allows for real-time memory management that adjusts to the needs of the task at hand. It uses a system of memory modules, each available in different budget levels, to balance performance and cost effectively. By training a neural network to route tasks through these modules, BudgetMem achieves better accuracy while managing costs efficiently.

Technical Details