Diagnostic Classifiers: Revealing how Neural Networks Process Hierarchical Structure
| Authors | |
|---|---|
| Publication date | 2016 |
| Host editors |
|
| Book title | Proceedings of the Workshop on Cognitive Computation: Integrating neural and symbolic approaches 2016 |
| Book subtitle | co-located with the 30th Annual Conference on Neural Information Processing Systems (NIPS 2016) : Barcelona, Spain, December 9, 2016 |
| Series | CEUR Workshop Proceedings |
| Event | Workshop on Cognitive Computation: Integrating neural and symbolic approaches 2016 |
| Article number | 6 |
| Number of pages | 9 |
| Publisher | Aachen: CEUR-WS |
| Organisations |
|
| Abstract |
We investigate how neural networks can be used for hierarchical, compositional semantics. To this end, we define the simple but nontrivial artificial task of processing nested arithmetic expressions and study whether different types of neural networks can learn to add and subtract. We find that recursive neural networks can implement a generalising solution, and we visualise the intermediate steps: projection, summation and squashing. We also show that gated recurrent neural networks, which process the expressions incrementally, perform surprisingly well on this task: they learn to predict the outcome of the arithmetic expressions with reasonable accuracy, although performance deteriorates with increasing length. To analyse what strategy the recurrent network applies, visualisation techniques are less insightful. Therefore, we develop an approach where we formulate and test hypotheses on what strategies these networks might be following. For each hypothesis, we derive predictions about features of the hidden state representations at each time step, and train ’diagnostic classifiers’ to test those predictions. Our results indicate the networks follow a strategy similar to our hypothesised ’incremental strategy’.
|
| Document type | Conference contribution |
| Language | English |
| Published at | http://ceur-ws.org/Vol-1773/CoCoNIPS_2016_paper6.pdf |
| Other links | http://ceur-ws.org/Vol-1773/ |
| Downloads |
CoCoNIPS_2016_paper6
(Final published version)
|
| Permalink to this page | |