Quickstart Sinabs#

If you’re familiar with how SNNs work, you might find this quick overview about Sinabs useful.

Sinabs is based on PyTorch#

All of Sinabs’ layers inherit from torch.nn.Module. Thus you will be able to access your parameters, wrap layers in a nn.Sequential module and all the other things that you would do with a normal PyTorch layer.

How to define your network#

We want to re-use as much PyTorch functionality as possible. We use Linear, Conv2d and AvgPool layers to define weight matrices, whereas Sinabs layers add state as well as the non-linear activation to each of those weight layers. This is a definition of a simple SNN which takes as an input a tensor of (Batch, Time, Channels):

import torch
import torch.nn as nn

import sinabs.activation
import sinabs.layers as sl

model = nn.Sequential(
    nn.Linear(16, 64),
    sl.LIF(
        tau_mem=10.0,
        activation_fn=sinabs.activation.ActivationFunction(
            surrogate_grad_fn=sinabs.activation.SingleExponential()
        ),
    ),
    nn.Linear(64, 4),
    sl.LIF(
        tau_mem=10.0,
        activation_fn=sinabs.activation.ActivationFunction(
            surrogate_grad_fn=sinabs.activation.SingleExponential()
        ),
    ),
)

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
/tmp/ipykernel_315/1188395460.py in <module>
      9     sl.LIF(
     10         tau_mem=10.0,
---> 11         activation_fn=sinabs.activation.ActivationFunction(
     12             surrogate_grad_fn=sinabs.activation.SingleExponential()
     13         ),

AttributeError: module 'sinabs.activation' has no attribute 'ActivationFunction'

Inference with SNNs#

For simple inference using SNNs, you just use the model like any other torch model

# Define an input (Batch, Time, Channels)
input = (torch.rand(1, 100, 16) > 0.2).float()

# Compute output with the model
with torch.no_grad():
    output = model(input)

print(output.shape)

You can see above that the output of the SNN model defined above has the shape (batch, time, neurons), where neurons is the number of neurons in the final layer of the model.

Note that the network state is retained after any forward pass/inference. If you require resetting of the states/gradient, you can do so using the corresponding methods layer.reset_states() or layer.zero_grad().

Training with BPTT#

BPTT (Back-Propagation-Through-Time) refers to training a model with data that spans several time steps. Crucially, to train models on such data, the model needs to learn the temporal dependence in the data and therefore, the computed gradients need to be propagated back in time in addition to the propagation along its layers.

Sinabs enables you to train SNNs using BPTT to take full advantage of the temporal computation and memory afforded by spiking neurons. You see below a small example of how you can train your Sinabs models using BPTT.

We first start with a couple of helper functions that loop over all the layers in our model and reset their states and gradients. You will see how they come handy in the next code block.

# Some helper functions to reset our model during the training loops
def reset_model_states(seq_model: nn.Sequential, randomize: bool = False):
    """
    Method to reset the internal states of a model
    """
    for lyr in seq_model:
        if isinstance(lyr, sl.LIF):
            lyr.reset_states(randomize=randomize)
    return


def zero_grad_states(seq_model: nn.Sequential):
    """
    Method to reset the gradients of the internal states of a model
    """
    for lyr in seq_model:
        if isinstance(lyr, sl.LIF):
            lyr.zero_grad()
    return

For the purpose of this demonstration, we define a very simple toy task:

Train the model to produce 10 spikes in response to an input spike pattern from 16 spiking neurons.

For simplicity, we generate a random spike train and use that as our input spike pattern.

Like with any standard training loop in pytorch, we start by defining an optimizer and loop over several training epochs.

In each training loop, the following steps are carried out.

Reset the parameter gradients.
Reset the state/vmem gradients.
Reset the model state/vmem to an initial condition.
Perform a forward pass.
Calculate the loss.
Backpropagate gradients based on the computed loss.
Update parameters.

Note the additional steps 2 and 3. These are additional required inorder to account for the stateful nature of spiking layers in our model.

# Define an input (Batch, Time, Channels)
input_data = (torch.rand(1, 100, 16) > 0.2).float()

# Training routine
optim = torch.optim.RMSprop(model.parameters(), lr=1e-3)
num_epochs = 100
target_num_spikes = 10

for epoch in range(num_epochs):
    # Reset the gradients of the parameters
    optim.zero_grad()

    # We will also need to reset the gradients of neuron states.
    zero_grad_states(model)
    # Alternatively you could also reset the states themselves.
    reset_model_states(model, randomize=False)

    # Forward pass
    out = model(input_data)
    print(f"Epoch {epoch}: Output spikes: {out.sum().item()}")

    # Compute loss
    loss = (out.sum() - target_num_spikes) ** 2

    # Back-propagate the gradients.
    loss.backward()

    # Update parameters
    optim.step()

    # Early stopage
    if not loss:
        break

out.sum(), out.shape

We see above that the model trains to produce 10 spikes as intended.

That is it! Now you know everything you need to know about training models with Sinabs!

Working with Convolutional networks#

When working with convolutional connectivity, a nn.Conv2d layer only takes as input a tensor of (Batch, Channels, Height, Width). If we feed a tensor that has an additional time dimension (Batch, Time, Channels, Height, Width) to such a layer, we will receive an error. In order for us to apply 2D convolutions across time, we have to make use of a small trick where we flatten batch and time dimension before feeding it to the Conv layer. If the input is flattened, the Squeeze versions of spiking Sinabs layers understand and take care of expanding the time dimension appropriately, without any major changes to your model definition.

batch_size = 8
time_steps = 100

conv_model = nn.Sequential(
    nn.Conv2d(2, 16, kernel_size=3),
    sl.LIFSqueeze(tau_mem=20.0, batch_size=batch_size),
    nn.Conv2d(16, 32, kernel_size=3),
    sl.LIFSqueeze(tau_mem=20.0, batch_size=batch_size),
    nn.Flatten(),
    nn.Linear(512, 4),
)

# (Batch, Time, Channels, Height, Width)
data = torch.rand(batch_size, time_steps, 2, 8, 8)

# Data reshaped to fit the flattened model definition
input = data.view(batch_size * time_steps, 2, 8, 8)

print(input.shape)

The rest of the forward pass or training loops remain the same as described in the above sections.

with torch.no_grad():
    output = conv_model(input)

This output has to then be reshaped to split and restore batch and time dimensions.

output_spike_raster = output.view(batch_size, time_steps, 4)
print(output_spike_raster.shape)

0.3.3

Quickstart Sinabs

Contents