Different convolutions within same TCN layer #14

FabianB98 · 2024-05-22T12:47:22Z

Hi,

I'm currently trying to implement the network architecture described in the paper "User-Driven Fine-Tuning for Beat Tracking" by Pinto et al., 2021. Within this network architecture, the authors propose the usage of a TCN where each TCN layer uses two separate sets of dilated convolutions, where one of the dilated convolutions has a dilation of twice that of the first dilated convolution. In figure 2 of that paper, they depict their TCN layout as follows:

As you can see, there are two dilated convolutions per TCN layer: "Dilated Convolution 1" with a dilation of dr1, and "Dilated Convolution 2" with a dilation of dr2 = 2 * dr1. The results of these dilations are then concatenated before the activation function, dropout and a 1x1 convolution (as a way of keeping the dimensionality equal throughout the TCN layers) are applied.

From what I could find so far, it appears as if this package only supports a single dilation rate within each TCN layer, which leads me to believe that it is not possible to implement this architecture using this Python package. Is my understanding of this correct? Or am I simply missing something (potentially obvious) and it is possible to implement the proposed network architecture with this package?

paul-krug · 2024-05-27T11:42:15Z

The architecture of the residual block is indeed different from the one implemented in this package. However, you could fork the repo and just modify the temporal block in order to include two parallel convolutions with different dilation rates. That should be relatively straight forward.

FabianB98 · 2024-05-27T18:17:28Z

Thank you for your response. I'll adjust the temporal block in a fork later this week or next week when I find time to do so. Would you mind if I try to incorporate these changes in a non-API-breaking way such that we could merge them in a PR?

FabianB98 · 2024-07-05T18:03:16Z

It may have taken a bit longer than anticipated, but I think my changes are now ready to be reviewed in a PR (see #15). It took a bit longer than I initially hoped, but I wanted to be sure that my changes work by training a network with these changes made to the package (and I actually found a few things which I tweaked throughout). Who could have thought that training a neural network on just a regular consumer GPU may take a while :D

FabianB98 mentioned this issue Jul 5, 2024

Add support for TCN architectures as described in "User-Driven Fine-Tuning for Beat Tracking" by Pinto et al., 2021 #15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different convolutions within same TCN layer #14

Different convolutions within same TCN layer #14

FabianB98 commented May 22, 2024

paul-krug commented May 27, 2024

FabianB98 commented May 27, 2024

FabianB98 commented Jul 5, 2024

Different convolutions within same TCN layer #14

Different convolutions within same TCN layer #14

Comments

FabianB98 commented May 22, 2024

paul-krug commented May 27, 2024

FabianB98 commented May 27, 2024

FabianB98 commented Jul 5, 2024