Convolution and pooling
Projections use by default a set of weights per post-synaptic neuron. Some networks, including convolutional networks, define a single operation (convolution or pooling) to be applied systematically on all pre-synaptic neurons. In such cases, it would be a waste of resources to allocate weights for each post-synaptic neuron.
The extension convolution
(see its API) allows to implement such specific projections. It has to be imported explicitly at the beginning of the script:
import ANNarchy as ann
from ANNarchy.extensions.convolution import *
Shared weights are only implemented for rate-coded networks. The only possible backend is currently OpenMP, CUDA will be implemented later.
Simple convolutions
The simplest case of convolution is when the pre- and post-synaptic population have the same number of dimensions, for example:
= ann.Population(geometry=(100, 100),
pre =ann.Neuron(parameters="r = 0.0"))
neuron= ann.Population(geometry=(100, 100),
post =ann.Neuron(equations="r = sum(exc)")) neuron
Contrary to normal projections, the geometry of the populations (number of dimensions and neurons in each dimension) has a great influence on the operation to be performed. In particular the number of dimensions will define how the convolution will be applied.
If for example the pre-synaptic population represents an 2D image, you may want to apply a vertical edge detector to it and get the result in the post-synaptic population. Such a filter can be defined by the following Numpy array:
= np.array(
vertical_filter
[1.0, 0.0, -1.0],
[1.0, 0.0, -1.0],
[1.0, 0.0, -1.0]
[
] )
With 2 dimensions, the convolution operation (or more exactly, cross-correlation) with a 3*3 filter is defined for all neurons in the post-synaptic population by:
\text{post}[i, j] = \sum_{c_i=-1}^1 \sum_{c_j=-1}^1 \text{filter}[c_i][c_j] \cdot \text{pre}[i + c_i, j + c_j]
Such a convolution is achieved by creating a Convolution
object and using the connect_filter()
method to create the connection pattern:
= Convolution(pre=pre, post=post, target='exc')
proj =vertical_filter) proj.connect_filter(weights
Each neuron of the post-synaptic population will then receive in sum('exc')
(or whatever target name is used) the convolution between the kernel and a sub-region of the pre-syanptic population. ANNarchy defines the convolution operation for populations having 1, 2, 3, or 4 dimensions.
Several options can be passed to the convolve()
method:
padding
defines the value of the pre-synaptic firing rates which will be used when the coordinates are out-of-bounds. By default zero-padding is used, but you can specify another value with this argument. You can also use the'border'
value to repeat the firing rate of the neurons on the border (for example, if the filter tries to reach a neuron of coordinates (-1, -1), the firing rate of the neuron (0, 0) will be used instead).subsampling
. In convolutional networks, the convolution operation is often coupled with a reduction in the number of neurons in each dimension. In the example above, the post-synaptic population could be defined with a geometry (50, 50). For each post-synaptic neuron, the coordinates of the center of the applied kernel would be automatically shifted from two pre-synaptic neurons compared to the previous one. However, if the number of neurons in one dimension of the pre-synaptic population is not exactly a multiple of the number of post-synaptic neurons in the same dimension, ANNarchy can not guess what the correct correspondance should be. In this case, you have to specify this mapping by providing to thesubsampling
argument a list of pre-synaptic coordinates defining the position of the center of the kernel for each post-synaptic neuron. The list is indexed by the rank of the post-synaptic neurons (use therank_from_coordinates()
method) and must have the same size as the population. Each element should be a list of coordinates in the pre-synaptic population’s geometry (with as many elements as dimensions). It is possible to provide a Numpy array instead of a list of lists.
One can access the coordinates in the pre-synaptic geometry of the center of the filter corresponding to a particular post-synaptic neuron by calling the center()
method of Convolution
with the rank or coordinates of the post neuron:
= ann.Population(geometry=(100, 100), neuron = Whatever)
pre = ann.Population(geometry=(50, 50), neuron = Whatever)
post
= Convolution(pre=pre, post=post, target='exc')
proj =vertical_filter)
proj.connect_filter(weights
= proj.center(10, 10) # returns (20, 20) pre_coordinates
In some cases, the post-synaptic population can have less dimensions than the pre-synaptic one. An example would be when the pre-synaptic population has three dimensions (e.g. (100, 100, 3)), the last representing the R, G and B components of an image. A 3D filter, with 3 components in the last dimension, would result in a (100, 100, 1) post-synaptic population (or any subsampling of it). ANNarchy accepts in this case the use of a 2D population (100, 100), but it will be checked that the number of elements in the last dimension of the filter equals the number of pre-synaptic neurons in the last dimension:
= ann.Population(geometry=(100, 100, 3), neuron=ann.Neuron(parameters="r = 0.0"))
pre = ann.Population(geometry=(100, 100), neuron=ann.Neuron(equations="r = sum(exc)"))
post
= np.array(
red_filter
[
[2.0, -1.0, -1.0]
[
]
]
)
= Convolution(pre=pre, post=post, target='exc')
proj =red_filter) proj.connect_filter(weights
Non-linear convolutions
Convolution
uses by default a regular cross-correlation, summing w * pre.r
over the extent of the kernel. As for regular synapses, you can change this behavior when creating the projection:
the
psp
argument defines what will be summed. It isw*pre.r
by default but can be changed to any combination ofw
andpre.r
,such asw * log(1+pre.r)
:= Convolution(pre=pre, post=post, target='exc', psp='w*log(1+pre.r)') proj
the
operation
argument allows to change the summation operation. You can set it to ‘max’ (the maximum value ofw*pre.r
over the extent of the filter will be returned), ‘min’ (minimum) or ‘mean’ (same as ‘sum’, but normalized by the number of elements in the filter). The default is ‘sum’:= Convolution(pre=pre, post=post, target='exc', operation='max') proj
Layer-wise convolutions
It is possible to define kernels with less dimensions than the pre-synaptic population. A 2D filter can for example be applied on each color component independently:
= ann.Population(geometry=(100, 100, 3), neuron=ann.Neuron(parameters="r = 0.0"))
pre = ann.Population(geometry=(50, 50, 3), neuron=ann.Neuron(equations="r = sum(exc)"))
post
= np.array(
vertical_filter
[1.0, 0.0, -1.0],
[1.0, 0.0, -1.0],
[1.0, 0.0, -1.0]
[
]
)
= Convolution(pre=pre, post=post, target='exc')
proj =vertical_filter, keep_last_dimension=True) proj.connect_filter(weights
The important parameter in this case is keep_last_dimension
which tells the code generator that the last dimension of the input should not be used for convolution. The important constraint is that the post-synaptic population must have the same number of neurons in the last dimension than the pre-synaptic one (no subsampling is possible by definition).
Bank of filters
Convolutional networks often use banks of filters to perform different operations (such as edge detection with various orientations). It is possible to specify this mode of functioning by using the connect_filters()
method:
= ann.Population(geometry=(100, 100), neuron=ann.Neuron(parameters="r = 0.0"))
pre = ann.Population(geometry=(50, 50, 4), neuron=ann.Neuron(equations="r = sum(exc)"))
post
= np.array(
bank_filters
[1.0, 0.0, -1.0],
[1.0, 0.0, -1.0],
[1.0, 0.0, -1.0]
[
],
[-1.0, 0.0, 1.0],
[-1.0, 0.0, 1.0],
[-1.0, 0.0, 1.0]
[
],
[-1.0, -1.0, -1.0],
[0.0, 0.0, 0.0],
[ 1.0, 1.0, 1.0]
[
],
[1.0, 1.0, 1.0],
[ 0.0, 0.0, 0.0],
[ -1.0, -1.0, -1.0]
[
]
)
= Convolution(pre=pre, post=post, target='exc')
proj =bank_filters) proj.connect_filters(weights
Here the filter has three dimensions. The first one must correspond to each filter. The last dimension of the post-synaptic population must correspond to the total number of filters. It cannot be combined with keep_last_dimension
.
Current limitation: Each filter must have the same size, it is not possible yet to convolve over multiple scales.
Pooling
Another form of atypical projection for a neural network is the pooling operation. In max-pooling, each post-synaptic neuron is associated to a region of the pre-synaptic population and responds like the maximum firing rate in this region. This is already possible by defining the operation
argument of the synapse type, but it would use instantiated synapses, what would be a waste of memory.
The Pooling
class allows to define such an operation without defining any synapse:
= ann.Population(geometry=(100, 100), neuron=ann.Neuron(parameters="r = 0.0"))
pre = ann.Population(geometry=(50, 50), neuron=ann.Neuron(equations="r = sum(exc)"))
post
= Pooling(pre=pre, post=post, target='exc', operation='max')
proj proj.connect_pooling()
The pooling region of a post-synaptic region is automatically determined by comparing the dimensions of the two populations: here each post-synaptic neuron will cover an area of 2*2 neurons.
If the number of dimensions do not match, you have to specify the extent
argument to pooling()
. For example, you can pool completely over one dimension of the pre-synaptic population:
= ann.Population(geometry=(100, 100, 10), neuron=ann.Neuron(parameters="r = 0.0"))
pre = ann.Population(geometry=(50, 50), neuron=ann.Neuron(equations="r = sum(exc)"))
post
= Pooling(pre=pre, post=post, target='exc', operation='max')
proj =(2, 2, 10)) proj.connect_pooling(extent