So far, we have seen how to perform read and write operation and we saw how the weight vector is used to perform those operations. But how do we compute this weight vector? We use an attention mechanism and different addressing schemes to compute it. We use two types of addressing mechanisms to access information from the memory:
- Content-based addressing
- Location-based addressing