Some question about the svit method #14

King4819 · 2024-12-02T15:58:13Z

I have an interesting question: since most token pruning methods actually prune layer 4 to layer 12 of vit model, have you tried pruning the early layers of vit model (layer 1 ~ layer3) ? And how is the performance ? Hopes to get your reply, thanks !!!

kaikai23 · 2024-12-03T06:31:54Z

Yes we tried pruning token in layers1~3 during some initial experiments on classification, and the performance dropped significantly. We also tested some other pruning ratios for different layers, but didn't easily find one that outperformed the default setting provided by DynamicViT and EViT.

However, there is a relevant paper on pruning ratios for different layers: "DiffRate : Differentiable Compression Rate for Efficient Vision Transformers". I hope this might be helpful!

King4819 · 2024-12-03T07:22:49Z

@kaikai23 Thanks for your reply ! May I ask how did you set the token keep ratio of all layers ? I think most token pruning methods in 4th ~ 12th layer follow [k, k^2, k^3] setting.

kaikai23 · 2024-12-04T06:41:47Z

King4819 · 2024-12-04T06:44:55Z

@kaikai23 Oh I mean the settings of 12 layers (layer 1 ~ layer 12)

kaikai23 · 2024-12-05T08:26:38Z

Hi, we kept all the tokens in layer 1~layer 3, and keep 70%, 70%, 70%, 49%, 49%, 49%, 34.3%, 34.3% tokens in layer 4 ~ layer 12.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some question about the svit method #14

Some question about the svit method #14

King4819 commented Dec 2, 2024

kaikai23 commented Dec 3, 2024

King4819 commented Dec 3, 2024

kaikai23 commented Dec 4, 2024

King4819 commented Dec 4, 2024

kaikai23 commented Dec 5, 2024

Some question about the svit method #14

Some question about the svit method #14

Comments

King4819 commented Dec 2, 2024

kaikai23 commented Dec 3, 2024

King4819 commented Dec 3, 2024

kaikai23 commented Dec 4, 2024

King4819 commented Dec 4, 2024

kaikai23 commented Dec 5, 2024