Back to Glossary
Architectures

Memory augmented neural networks

Memory-augmented neural networks are neural network architectures that incorporate external memory modules, allowing them to read from and write to memory locations independent of the network's weights. This external memory enables the network to store and retrieve information over extended periods, enhancing its ability to handle complex tasks that require long-term dependencies and reasoning.

Explanation

Memory-augmented neural networks address the limitations of traditional neural networks in handling sequential data and tasks requiring long-term memory. Unlike recurrent neural networks (RNNs) which rely on hidden states to maintain information, memory-augmented networks use an explicitly addressable external memory. This memory is typically a matrix where each row represents a memory location. The network learns to 'read' from and 'write' to these memory locations through attention mechanisms. These mechanisms determine which parts of the memory are relevant for a given input. The controller network (often an RNN or feedforward network) processes the input and generates read/write keys and strengths, which are used to access the external memory. Different architectures, such as Neural Turing Machines (NTMs) and Differentiable Neural Computers (DNCs), implement different memory addressing schemes and update mechanisms. NTMs use content-based and location-based addressing, while DNCs incorporate temporal linkages between memory locations to improve memory management and reasoning. Memory-augmented networks are particularly effective in tasks such as question answering, machine translation, and program synthesis, where the ability to store and retrieve information is crucial.

Related Terms