Headlines 24.nl verzamelt actueel nieuws via de rss feeds van online kranten. Op elk moment geven wij al het laatste nieuws overzichtelijk weer.

Tevens kunt u inloggen om uw eigen nieuws pagina samen te stellen en zo alleen het nieuws te zien dat u interesseert.


 
 

Microsoft unveils Memora to tackle AI agents’ memory problem

05/07 18:15 - Microsoft unveils Memora to tackle AI agents’ memory problem
With AI agents increasingly expected to remember conversations, preferences, and decisions over extended periods, Microsoft Research has developed Memora, a memory system designed to provide more scalable and reliable long-term recall than existing approaches. AI agents are increasingly expected to retain context across weeks or months rather than individual chat sessions. Memory can become fragmented, leading to duplicate information and slower retrieval as knowledge grows. According to Microsoft, Memora can solve this problem by decoupling what the AI remembers from how it looks up that information, ultimately reducing context token usage by up to 98% while matching or exceeding full-context accuracy, Microsoft Research claimed in a blog post. Limitations of today’s memory architectures As AI assistants and autonomous agents move into long-horizon deployments, the absence of a principled memory system has become a critical bottleneck. While modern LLMs are powerful reasoners, they still start every session from scratch. Long conversations require models to repeatedly re-read their entire history, while new information is either stored as raw text or compressed into summaries where important details may be lost. Solutions to address these are available, but they too have limitations. For instance, systems like Mem0 extract atomic facts from conversations, retrieval-augmented (RAG) approaches index raw text fragments for later recall, and graph-based memory systems such as Zep and GraphRAG impose structure through entity relations. But these mostly fall into two extremes. Content-fragmentation systems, such as RAG and Mem0, embed extracted facts or text fragments directly. This preserves detail but produces brittle, isolated entries that lose narrative coherence. Coarse-abstraction systems compress experience into compact summaries but strip away the constraints, edge cases, and numeric details that make memory useful in the first place. Graph-based systems add structure on top of content but still rely on the content itself for retrieval and typically require rigid ontologies that don’t generalize across domains. Decoupling memory from retrieval Memora architecture claims to address this by decoupling what is stored from how it is retrieved. For this, each memory entry will have two components. The first will be a primary abstraction, which is a short phrase (6–8 words) that will capture what the memory is fundamentally about. The second will be a memory value, which will hold the rich content itself. As a result of this separation, new information about an evolving topic will be merged into the existing memory entry under the same primary abstraction and will not be fragmented into a chain of partial duplicates. Complementing primary abstractions, cue anchors are short, context-aware tags extracted from each memory’s value, providing alternative access paths to the same memory. They will function as flexible, organically-generated metadata, claimed the post. Memora also introduces a policy-guided retriever that, rather than returning the top-k semantically similar items in a single shot, iteratively refines its query, expands through cue anchors to surface related-but-not-similar memories, and decides when to stop. “The deepest flaw in current agent memory is that it mistakes retrieval for memory. A vector store is superb at finding text that looks relevant. An enterprise agent needs more than resemblance. It needs to know what has changed, what still holds true, and what should never be recalled in the task at hand,” said Sanchit Vir Gogia, chief analyst at Greyhound Research. Memora is interesting precisely because it refuses that shortcut, Gogia noted. It separates the rich detail of a memory from the handle used to find it, indexing a stable abstraction and a set of cue anchors while keeping the full content intact beneath them. Retrieval then becomes an act of navigation rather than a single hopeful guess, as the system re-queries, widens its search, or stops once it has enough, he added. Benchmarking Memora Microsoft evaluated Memora on two long-context benchmarks. LoCoMo, where dialogues average 600 turns, and LongMemEval, which uses 115,000-token contexts. According to the company, Memora achieved 86.3% LLM-judge accuracy on LoCoMo and 87.4% on LongMemEval, outperforming RAG, Mem0, Nemori, Zep, LangMem, and even full-context inference. It also stored nearly half as many memory entries per conversation as Mem0 (344 versus 651) while reducing token consumption by up to 98% compared with full-context inference. While the benchmark results suggest significant efficiency gains, enterprises should not assume lower token consumption will automatically translate into lower infrastructure costs. Gogia cautioned against taking the token reduction number at face value. It is a benchmark context reduction, not a promise that an enterprise bill will fall by 98%, he said. “Real cost also includes memory construction, indexing, storage, and the audit logging that governance demands.” He warned that Memora’s strongest retrieval mode is also its slowest. Its policy retriever runs at between roughly five and six seconds per query across several model-calling steps, against under a second for the simpler semantic mode. The saving in prompt tokens is partly repaid as retrieval latency and extra inference. So the memory crunch does not disappear but moves. Instead of paying only for longer prompts, enterprises must now manage what is written, updated, and forgotten, and the indexing and testing that govern it. Enterprise implications Memora is currently an active Microsoft Research project, but the company has made the research code available on GitHub, enabling developers to experiment with the architecture and adapt it for their own AI applications. However, portability on paper should not be confused with production readiness. While a memory layer of this design can, in principle, sit above models from any major provider, Gogia suggests that until the code is fully verifiable, maintained, and supportable under enterprise controls, the prudent posture for IT leaders is to study Memora as an architecture rather than operationalize it as software. Beyond the technology, organizations will need governance and compliance policies to ensure AI memories are managed securely and remain auditable. He noted an enterprise must decide who may write to memory, who may read it, how long it persists, and how an auditor reconstructs why a memory shaped an action. “An enterprise must decide who may write to memory, who may read it, how long it persists, and how an auditor reconstructs why a memory shaped an action. ‘The agent remembered it’ will not satisfy a regulator under the European Union’s AI Act traceability duties, nor a customer under India’s Digital Personal Data Protection Act,” Gogia said. The article originally appeared on InfoWorld. ...


 
 

Meer over computer

05/07 19:15 FortiBleed gekoppeld aan ransomwaregroepen INC en Lynx

05/07 19:15 Google verliest definitief strijd om EU-boete van 4,1 miljard euro

05/07 19:15 Device code phishing- een ongeluk zit in een klein hoekje

05/07 19:15 AI-gebruik in cloud is vaak indirect en onduidelijk

05/07 19:15 OpenAI biedt regering VS belang van 5 procent aan

05/07 19:15 Anthropic overweegt eigen AI-chip met Samsung

05/07 19:15 Meta had meer verwacht van eigen agentic AI

05/07 19:15 Omgekeerde Alibaba beschuldigt Anthropic van backdoor

05/07 19:15 Koi Security aangeklaagd om vermeend AI-gegenereerd rapport

05/07 19:15 Slechts fractie AI-waarschuwingen is kritiek

05/07 19:15 BEKIJK – VBO wil groeiplan voor Belgische economie

05/07 19:15  Belgische bedrijven

05/07 19:15 Europese beurzen openen hoger

05/07 19:15 Groei Chinese dienstensector neemt iets af

05/07 19:15 Japanse dienstensector groeit weer

05/07 19:15  deze vijf boeken horen in uw reiskoffer

05/07 19:15 Aziatische beurzen kleuren groen

05/07 19:15 One Million HappyNest laat huurders voordelig hun woning kopen

05/07 19:15 ‘Onze fiscaliteit zit nog in het tijdperk van de vrachtwagen’

05/07 19:15 Bel20 maakt zich op voor verdere aanscherping recordstand

05/07 19:15 Inflatie is 5 stappen voor een andere invulling van uw portefeuille

05/07 19:15  Bel20 voorzichtig van start

05/07 19:15 Spaanse dienstensector groeit veel harder

05/07 19:15 Precisie in avatar moet helpen bij het bepalen van de juiste maat

05/07 19:15 AI & Tech-podcast | AI maakt van menselijk contact een luxe

05/07 19:15  zitten we in een AI-bubbel?

05/07 19:15 Italiaanse dienstensector groeit weer

05/07 19:15 Franse dienstensector krimpt minder hard

05/07 19:15 Krimp in Duitse dienstensector houdt aan

05/07 19:15 Nog slechts heel lichte krimp in Europese dienstensector

05/07 19:15 Je kan straks documenten ondertekenen met MyGov.be

05/07 19:15 Op bezoek bij Dries Van Noten in Venetië: ‘Ik wilde graag nog iets anders realiseren buiten de modewereld’

05/07 19:15  Brusselse beurs aarzelt tussen de plus en de min

05/07 19:15 Britse dienstensector verder onder druk

05/07 19:15 De Vlaamse arbeidsmarkt blijft robuust

05/07 19:15 Recordboete van 4,13 miljard euro voor Google nu waarom er ook gevolgen kunnen zijn voor Gemini

05/07 19:15 Europese beurzen verdeeld rond middaguur

05/07 19:15 Joris Relaes (ILVO) in Trends ‘De klassieke gezinsboerderij komt op haar einde’

05/07 19:15 Canadese vooruitblik op de tweede jaarhelft

05/07 19:15  farma onder druk in Brussel

05/07 19:15  na perfecte storm grootste kans in 20 jaar

05/07 19:15 Wall Street gesloten terwijl VS onafhankelijkheid viert

05/07 19:15 PlayStation bant de wat u moet weten wanneer u digitale games en andere media koopt

05/07 19:15 Tom Simonts’ 10 om te Zien – ‘Het autoverhaal maakt plaats voor fysieke AI’

05/07 19:15  Belgische bedrijven

05/07 19:15 Bel20 sluit week af op nieuw record

05/07 19:15 Banqup wil ruimte om kapitaal op te halen

05/07 19:15 Europese beurzen eindigen week op records

05/07 19:15 Bel20 sluit week af op nieuw record

05/07 19:15 ‘Een nieuw begin betekent ook afscheid nemen’

 

login Member login

Emailadres

Wachtwoord