You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
/home/tmp/SAE_BoardGameEval/circuits/dictionary_learning/buffer.py", line 404, in refresh
self.activations = t.cat([self.activations, hidden_states.to(self.device)], dim=0)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
RuntimeError: Sizes of tensors must match except in dimension 0. Expected size 2048 but got size 512 for tensor number 1 in the list.
The text was updated successfully, but these errors were encountered:
Apologies for this! I fixed chess_sae_trainer.py. There was a mismatch between submodule_type (mlp, dim 2048) and submodule (resid_post, dim 512).
However, after further thought I believe train_saes_parallel.py is the better default training script. I have archived chess_sae_trainer.py and othello_sae_trainer.py, and updated the training README with instructions for using train_saes_parallel.py.
I have tested this script on both ChessGPT and OthelloGPT using a variety of SAE training types (TopK, P_Anneal, Gated, and Standard).
Trying to repro the chess SAE trainining:
After modifying this line to pass the
meta.pkl
fromcircuits/resources/meta.pkl
https://github.com/adamkarvonen/SAE_BoardGameEval/blob/master/circuits/sae_training/chess_sae_trainer.py#L65
I get:
The text was updated successfully, but these errors were encountered: