Headlines 24.nl verzamelt actueel nieuws via de rss feeds van online kranten. Op elk moment geven wij al het laatste nieuws overzichtelijk weer.

Tevens kunt u inloggen om uw eigen nieuws pagina samen te stellen en zo alleen het nieuws te zien dat u interesseert.


 
 

Microsoft unveils Memora to tackle AI agents’ memory problem

05/07 08:15 - Microsoft unveils Memora to tackle AI agents’ memory problem
With AI agents increasingly expected to remember conversations, preferences, and decisions over extended periods, Microsoft Research has developed Memora, a memory system designed to provide more scalable and reliable long-term recall than existing approaches. AI agents are increasingly expected to retain context across weeks or months rather than individual chat sessions. Memory can become fragmented, leading to duplicate information and slower retrieval as knowledge grows. According to Microsoft, Memora can solve this problem by decoupling what the AI remembers from how it looks up that information, ultimately reducing context token usage by up to 98% while matching or exceeding full-context accuracy, Microsoft Research claimed in a blog post. Limitations of today’s memory architectures As AI assistants and autonomous agents move into long-horizon deployments, the absence of a principled memory system has become a critical bottleneck. While modern LLMs are powerful reasoners, they still start every session from scratch. Long conversations require models to repeatedly re-read their entire history, while new information is either stored as raw text or compressed into summaries where important details may be lost. Solutions to address these are available, but they too have limitations. For instance, systems like Mem0 extract atomic facts from conversations, retrieval-augmented (RAG) approaches index raw text fragments for later recall, and graph-based memory systems such as Zep and GraphRAG impose structure through entity relations. But these mostly fall into two extremes. Content-fragmentation systems, such as RAG and Mem0, embed extracted facts or text fragments directly. This preserves detail but produces brittle, isolated entries that lose narrative coherence. Coarse-abstraction systems compress experience into compact summaries but strip away the constraints, edge cases, and numeric details that make memory useful in the first place. Graph-based systems add structure on top of content but still rely on the content itself for retrieval and typically require rigid ontologies that don’t generalize across domains. Decoupling memory from retrieval Memora architecture claims to address this by decoupling what is stored from how it is retrieved. For this, each memory entry will have two components. The first will be a primary abstraction, which is a short phrase (6–8 words) that will capture what the memory is fundamentally about. The second will be a memory value, which will hold the rich content itself. As a result of this separation, new information about an evolving topic will be merged into the existing memory entry under the same primary abstraction and will not be fragmented into a chain of partial duplicates. Complementing primary abstractions, cue anchors are short, context-aware tags extracted from each memory’s value, providing alternative access paths to the same memory. They will function as flexible, organically-generated metadata, claimed the post. Memora also introduces a policy-guided retriever that, rather than returning the top-k semantically similar items in a single shot, iteratively refines its query, expands through cue anchors to surface related-but-not-similar memories, and decides when to stop. “The deepest flaw in current agent memory is that it mistakes retrieval for memory. A vector store is superb at finding text that looks relevant. An enterprise agent needs more than resemblance. It needs to know what has changed, what still holds true, and what should never be recalled in the task at hand,” said Sanchit Vir Gogia, chief analyst at Greyhound Research. Memora is interesting precisely because it refuses that shortcut, Gogia noted. It separates the rich detail of a memory from the handle used to find it, indexing a stable abstraction and a set of cue anchors while keeping the full content intact beneath them. Retrieval then becomes an act of navigation rather than a single hopeful guess, as the system re-queries, widens its search, or stops once it has enough, he added. Benchmarking Memora Microsoft evaluated Memora on two long-context benchmarks. LoCoMo, where dialogues average 600 turns, and LongMemEval, which uses 115,000-token contexts. According to the company, Memora achieved 86.3% LLM-judge accuracy on LoCoMo and 87.4% on LongMemEval, outperforming RAG, Mem0, Nemori, Zep, LangMem, and even full-context inference. It also stored nearly half as many memory entries per conversation as Mem0 (344 versus 651) while reducing token consumption by up to 98% compared with full-context inference. While the benchmark results suggest significant efficiency gains, enterprises should not assume lower token consumption will automatically translate into lower infrastructure costs. Gogia cautioned against taking the token reduction number at face value. It is a benchmark context reduction, not a promise that an enterprise bill will fall by 98%, he said. “Real cost also includes memory construction, indexing, storage, and the audit logging that governance demands.” He warned that Memora’s strongest retrieval mode is also its slowest. Its policy retriever runs at between roughly five and six seconds per query across several model-calling steps, against under a second for the simpler semantic mode. The saving in prompt tokens is partly repaid as retrieval latency and extra inference. So the memory crunch does not disappear but moves. Instead of paying only for longer prompts, enterprises must now manage what is written, updated, and forgotten, and the indexing and testing that govern it. Enterprise implications Memora is currently an active Microsoft Research project, but the company has made the research code available on GitHub, enabling developers to experiment with the architecture and adapt it for their own AI applications. However, portability on paper should not be confused with production readiness. While a memory layer of this design can, in principle, sit above models from any major provider, Gogia suggests that until the code is fully verifiable, maintained, and supportable under enterprise controls, the prudent posture for IT leaders is to study Memora as an architecture rather than operationalize it as software. Beyond the technology, organizations will need governance and compliance policies to ensure AI memories are managed securely and remain auditable. He noted an enterprise must decide who may write to memory, who may read it, how long it persists, and how an auditor reconstructs why a memory shaped an action. “An enterprise must decide who may write to memory, who may read it, how long it persists, and how an auditor reconstructs why a memory shaped an action. ‘The agent remembered it’ will not satisfy a regulator under the European Union’s AI Act traceability duties, nor a customer under India’s Digital Personal Data Protection Act,” Gogia said. The article originally appeared on InfoWorld. ...


 
 

Meer over computer

05/07 16:30  schrijven voor miljoenen fans

05/07 16:30 De WK-fanwalks als masterclass cultuurvorming

05/07 16:30 Iedereen knikt, toch verandert er de valkuil van schijninstemming

05/07 16:30 Van gestopte start-up naar tweede kans, mét AI

05/07 16:30 Vertrouwen is de snelste route naar B2B-groei [5 stappen]

05/07 16:30 De meest opvallende WK-inhakers van 2026

05/07 16:30 Zo voorkom je AI-hallucinaties met een slimme controlelaag

05/07 16:30 Zo maak je influencer marketing meetbaar [5 stappen]

05/07 16:30 10 redenen waarom je marketing nu écht niet meer zonder video kan

05/07 16:30 5 misverstanden over AI die ondernemers tijd kosten

05/07 16:30 Van Asch tot dit maakt sociale bewijskracht zo krachtig

05/07 16:30 B1-teksten invoeren? Begin met overtuigen, niet met schrijven

05/07 16:30 Ongevraagd klanten bellen met een commercieel aanbod mag sinds 1 juli niet meer

05/07 16:30 Hoe houd je grip op content als AI steeds meer taken overneemt?

05/07 16:30 Niet de luidste stem dialogisch leiderschap voor inclusieve samenwerking

05/07 16:30 Zo activeer je medewerkers op LinkedIn (ook als ze zeggen geen tijd te hebben)

05/07 16:30 Meer interactie in je trainingen? Zo pak je dat aan [5 leerpunten]

05/07 16:30 Waarom bezoekers afhaken terwijl ze wél geïnteresseerd zijn

05/07 16:30 Van online winkel naar AI- de transformatie van de webshop

05/07 16:30 Meta, Google en Wall Street zetten in op voorspellingsmarkten. Moet jij ook opletten?

05/07 16:30 Deze zomer pilot mobiele politiebureau in Gooi en Vechtstreek

05/07 16:30 MOBOTIX behaalt certificering voor thermische branddetectie

05/07 16:30 Nieuwe manager het CCV kiest voor impact

05/07 16:30 10 jaar geëist voor aanslag met vuurwerkbom

05/07 16:30 Securitas behaalt NIS2 Supply Chain-certificaat

05/07 16:30 KNVB en beveiligingsbranche krijgen steun voor motie evenementenbeveiliging

05/07 16:30 In Rien van der Linden

05/07 16:30 Zware criminelen krijgen strafkorting door trage rechtsgang

05/07 16:30 Chiu Man inspecteur-generaal Inspectie Justitie en Veiligheid

05/07 16:30 Digitalisering toegangsbeheer verlaagt kosten en versterkt beveiliging

05/07 16:15 Maak met je smartphone kiekjes die op Game Boy Camera-foto's lijken

05/07 16:15 Jim Carrey keert mogelijk terug als de Grinch

05/07 16:15 Toch afzien van je online aankoop? Webshops moeten hier nu een knop voor hebben

05/07 16:15 Lezen of luisteren op waarom je niet hóeft te kiezen tussen e-books en audioboeken

05/07 16:15 Je barbecue is pas compleet met deze 8 onmisbare BBQ-accessoires

05/07 16:15 Amazon weigert om deze film rondom OpenAI-baas uit te brengen

05/07 16:15 Review Philips 5000 airfryer single basket – Grote mand, handige rails en fijne stoomfunctie

05/07 16:15 Google Agenda krijgt eindelijk veel meer kleurtjes

05/07 16:15 Nothing brengt dit jaar geen nieuw model in budgettelefoonmerk CMF uit

05/07 16:15 Inzicht in de volg je internetsnelheid

05/07 16:15 Taylor Swift voor de verandering eens niet de populairste artiest op Apple Music

05/07 16:15 Nog eens drie nieuwe acteurs voor derde Fallout-seizoen bevestigd

05/07 16:15 AI in herschrijf, vat samen en verbeter

05/07 16:15 Netflix mag Sesamstraat verfilmen

05/07 16:15 Review ASUS Zenbook A16 - Lichte laptop is zware aanslag op je portemonnee

05/07 16:15 NASA werkt aan snellere Marsrover

05/07 16:15 Microsoft heeft eindelijk een antwoord op de MacBook Air

05/07 16:15 Nothing schrapte CMF Phone 3 Pro, maar komt wel met Nothing Phone 4b

05/07 16:15 Aardman toont meer van Pokémon The Misadventures of Sirfetch’d and Pichu

05/07 16:15 Apple Watch Ultra 4 wordt waarschijnlijk later dit jaar onthuld

 

login Member login

Emailadres

Wachtwoord