Traceback (most recent call last): File "", line 11, in File "/home/ubuntu/.local/lib/python3.10/site-packages/safetensors/torch.py", line 286, in save_file serialize_file(_flatten(tensors), filename, metadata=metadata) File "/home/ubuntu/.local/lib/python3.10/site-packages/safetensors/torch.py", line 488, in _flatten raise RuntimeError( RuntimeError: Some tensors share memory, this will lead to duplicate memory on disk and potential differences when loading them again: [{'ema_model.transformer.time_embed.time_mlp.0.weight', 'transformer.time_embed.time_mlp.0.weight'}, {'ema_model.transformer.time_embed.time_mlp.0.bias', 'transformer.time_embed.time_mlp.0.bias'}, {'transformer.time_embed.time_mlp.2.weight', 'ema_model.transformer.time_embed.time_mlp.2.weight'}, {'transformer.time_embed.time_mlp.2.bias', 'ema_model.transformer.time_embed.time_mlp.2.bias'}, {'transformer.text_embed.text_embed.weight', 'ema_model.transformer.text_embed.text_embed.weight'}, {'ema_model.transformer.text_embed.text_blocks.0.dwconv.weight', 'transformer.text_embed.text_blocks.0.dwconv.weight'}, {'ema_model.transformer.text_embed.text_blocks.0.dwconv.bias', 'transformer.text_embed.text_blocks.0.dwconv.bias'}, {'transformer.text_embed.text_blocks.0.norm.weight', 'ema_model.transformer.text_embed.text_blocks.0.norm.weight'}, {'transformer.text_embed.text_blocks.0.norm.bias', 'ema_model.transformer.text_embed.text_blocks.0.norm.bias'}, {'transformer.text_embed.text_blocks.0.pwconv1.weight', 'ema_model.transformer.text_embed.text_blocks.0.pwconv1.weight'}, {'transformer.text_embed.text_blocks.0.pwconv1.bias', 'ema_model.transformer.text_embed.text_blocks.0.pwconv1.bias'}, {'ema_model.transformer.text_embed.text_blocks.0.grn.gamma', 'transformer.text_embed.text_blocks.0.grn.gamma'}, {'ema_model.transformer.text_embed.text_blocks.0.grn.beta', 'transformer.text_embed.text_blocks.0.grn.beta'}, {'ema_model.transformer.text_embed.text_blocks.0.pwconv2.weight', 'transformer.text_embed.text_blocks.0.pwconv2.weight'}, {'transformer.text_embed.text_blocks.0.pwconv2.bias', 'ema_model.transformer.text_embed.text_blocks.0.pwconv2.bias'}, {'ema_model.transformer.text_embed.text_blocks.1.dwconv.weight', 'transformer.text_embed.text_blocks.1.dwconv.weight'}, {'ema_model.transformer.text_embed.text_blocks.1.dwconv.bias', 'transformer.text_embed.text_blocks.1.dwconv.bias'}, {'ema_model.transformer.text_embed.text_blocks.1.norm.weight', 'transformer.text_embed.text_blocks.1.norm.weight'}, {'transformer.text_embed.text_blocks.1.norm.bias', 'ema_model.transformer.text_embed.text_blocks.1.norm.bias'}, {'ema_model.transformer.text_embed.text_blocks.1.pwconv1.weight', 'transformer.text_embed.text_blocks.1.pwconv1.weight'}, {'transformer.text_embed.text_blocks.1.pwconv1.bias', 'ema_model.transformer.text_embed.text_blocks.1.pwconv1.bias'}, {'transformer.text_embed.text_blocks.1.grn.gamma', 'ema_model.transformer.text_embed.text_blocks.1.grn.gamma'}, {'transformer.text_embed.text_blocks.1.grn.beta', 'ema_model.transformer.text_embed.text_blocks.1.grn.beta'}, {'ema_model.transformer.text_embed.text_blocks.1.pwconv2.weight', 'transformer.text_embed.text_blocks.1.pwconv2.weight'}, {'ema_model.transformer.text_embed.text_blocks.1.pwconv2.bias', 'transformer.text_embed.text_blocks.1.pwconv2.bias'}, {'transformer.text_embed.text_blocks.2.dwconv.weight', 'ema_model.transformer.text_embed.text_blocks.2.dwconv.weight'}, {'transformer.text_embed.text_blocks.2.dwconv.bias', 'ema_model.transformer.text_embed.text_blocks.2.dwconv.bias'}, {'ema_model.transformer.text_embed.text_blocks.2.norm.weight', 'transformer.text_embed.text_blocks.2.norm.weight'}, {'ema_model.transformer.text_embed.text_blocks.2.norm.bias', 'transformer.text_embed.text_blocks.2.norm.bias'}, {'transformer.text_embed.text_blocks.2.pwconv1.weight', 'ema_model.transformer.text_embed.text_blocks.2.pwconv1.weight'}, {'transformer.text_embed.text_blocks.2.pwconv1.bias', 'ema_model.transformer.text_embed.text_blocks.2.pwconv1.bias'}, {'ema_model.transformer.text_embed.text_blocks.2.grn.gamma', 'transformer.text_embed.text_blocks.2.grn.gamma'}, {'transformer.text_embed.text_blocks.2.grn.beta', 'ema_model.transformer.text_embed.text_blocks.2.grn.beta'}, {'transformer.text_embed.text_blocks.2.pwconv2.weight', 'ema_model.transformer.text_embed.text_blocks.2.pwconv2.weight'}, {'transformer.text_embed.text_blocks.2.pwconv2.bias', 'ema_model.transformer.text_embed.text_blocks.2.pwconv2.bias'}, {'transformer.text_embed.text_blocks.3.dwconv.weight', 'ema_model.transformer.text_embed.text_blocks.3.dwconv.weight'}, {'ema_model.transformer.text_embed.text_blocks.3.dwconv.bias', 'transformer.text_embed.text_blocks.3.dwconv.bias'}, {'ema_model.transformer.text_embed.text_blocks.3.norm.weight', 'transformer.text_embed.text_blocks.3.norm.weight'}, {'ema_model.transformer.text_embed.text_blocks.3.norm.bias', 'transformer.text_embed.text_blocks.3.norm.bias'}, {'transformer.text_embed.text_blocks.3.pwconv1.weight', 'ema_model.transformer.text_embed.text_blocks.3.pwconv1.weight'}, {'ema_model.transformer.text_embed.text_blocks.3.pwconv1.bias', 'transformer.text_embed.text_blocks.3.pwconv1.bias'}, {'transformer.text_embed.text_blocks.3.grn.gamma', 'ema_model.transformer.text_embed.text_blocks.3.grn.gamma'}, {'transformer.text_embed.text_blocks.3.grn.beta', 'ema_model.transformer.text_embed.text_blocks.3.grn.beta'}, {'ema_model.transformer.text_embed.text_blocks.3.pwconv2.weight', 'transformer.text_embed.text_blocks.3.pwconv2.weight'}, {'ema_model.transformer.text_embed.text_blocks.3.pwconv2.bias', 'transformer.text_embed.text_blocks.3.pwconv2.bias'}, {'transformer.input_embed.proj.weight', 'ema_model.transformer.input_embed.proj.weight'}, {'ema_model.transformer.input_embed.proj.bias', 'transformer.input_embed.proj.bias'}, {'transformer.input_embed.conv_pos_embed.conv1d.0.weight', 'ema_model.transformer.input_embed.conv_pos_embed.conv1d.0.weight'}, {'ema_model.transformer.input_embed.conv_pos_embed.conv1d.0.bias', 'transformer.input_embed.conv_pos_embed.conv1d.0.bias'}, {'ema_model.transformer.input_embed.conv_pos_embed.conv1d.2.weight', 'transformer.input_embed.conv_pos_embed.conv1d.2.weight'}, {'ema_model.transformer.input_embed.conv_pos_embed.conv1d.2.bias', 'transformer.input_embed.conv_pos_embed.conv1d.2.bias'}, {'transformer.rotary_embed.inv_freq', 'ema_model.transformer.rotary_embed.inv_freq'}, {'transformer.transformer_blocks.0.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.0.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.0.attn_norm.linear.bias', 'transformer.transformer_blocks.0.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.0.attn.to_q.weight', 'transformer.transformer_blocks.0.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.0.attn.to_q.bias', 'transformer.transformer_blocks.0.attn.to_q.bias'}, {'transformer.transformer_blocks.0.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.0.attn.to_k.weight'}, {'transformer.transformer_blocks.0.attn.to_k.bias', 'ema_model.transformer.transformer_blocks.0.attn.to_k.bias'}, {'transformer.transformer_blocks.0.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.0.attn.to_v.weight'}, {'transformer.transformer_blocks.0.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.0.attn.to_v.bias'}, {'transformer.transformer_blocks.0.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.0.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.0.attn.to_out.0.bias', 'transformer.transformer_blocks.0.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.0.ff.ff.0.0.weight', 'transformer.transformer_blocks.0.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.0.ff.ff.0.0.bias', 'transformer.transformer_blocks.0.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.0.ff.ff.2.weight', 'transformer.transformer_blocks.0.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.0.ff.ff.2.bias', 'transformer.transformer_blocks.0.ff.ff.2.bias'}, {'transformer.transformer_blocks.1.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.1.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.1.attn_norm.linear.bias', 'transformer.transformer_blocks.1.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.1.attn.to_q.weight', 'transformer.transformer_blocks.1.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.1.attn.to_q.bias', 'transformer.transformer_blocks.1.attn.to_q.bias'}, {'ema_model.transformer.transformer_blocks.1.attn.to_k.weight', 'transformer.transformer_blocks.1.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.1.attn.to_k.bias', 'transformer.transformer_blocks.1.attn.to_k.bias'}, {'transformer.transformer_blocks.1.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.1.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.1.attn.to_v.bias', 'transformer.transformer_blocks.1.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.1.attn.to_out.0.weight', 'transformer.transformer_blocks.1.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.1.attn.to_out.0.bias', 'transformer.transformer_blocks.1.attn.to_out.0.bias'}, {'transformer.transformer_blocks.1.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.1.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.1.ff.ff.0.0.bias', 'transformer.transformer_blocks.1.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.1.ff.ff.2.weight', 'transformer.transformer_blocks.1.ff.ff.2.weight'}, {'transformer.transformer_blocks.1.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.1.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.2.attn_norm.linear.weight', 'transformer.transformer_blocks.2.attn_norm.linear.weight'}, {'transformer.transformer_blocks.2.attn_norm.linear.bias', 'ema_model.transformer.transformer_blocks.2.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.2.attn.to_q.weight', 'transformer.transformer_blocks.2.attn.to_q.weight'}, {'transformer.transformer_blocks.2.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.2.attn.to_q.bias'}, {'transformer.transformer_blocks.2.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.2.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.2.attn.to_k.bias', 'transformer.transformer_blocks.2.attn.to_k.bias'}, {'transformer.transformer_blocks.2.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.2.attn.to_v.weight'}, {'transformer.transformer_blocks.2.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.2.attn.to_v.bias'}, {'transformer.transformer_blocks.2.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.2.attn.to_out.0.weight'}, {'transformer.transformer_blocks.2.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.2.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.2.ff.ff.0.0.weight', 'transformer.transformer_blocks.2.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.2.ff.ff.0.0.bias', 'transformer.transformer_blocks.2.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.2.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.2.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.2.ff.ff.2.bias', 'transformer.transformer_blocks.2.ff.ff.2.bias'}, {'transformer.transformer_blocks.3.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.3.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.3.attn_norm.linear.bias', 'transformer.transformer_blocks.3.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.3.attn.to_q.weight', 'transformer.transformer_blocks.3.attn.to_q.weight'}, {'transformer.transformer_blocks.3.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.3.attn.to_q.bias'}, {'transformer.transformer_blocks.3.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.3.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.3.attn.to_k.bias', 'transformer.transformer_blocks.3.attn.to_k.bias'}, {'transformer.transformer_blocks.3.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.3.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.3.attn.to_v.bias', 'transformer.transformer_blocks.3.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.3.attn.to_out.0.weight', 'transformer.transformer_blocks.3.attn.to_out.0.weight'}, {'transformer.transformer_blocks.3.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.3.attn.to_out.0.bias'}, {'transformer.transformer_blocks.3.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.3.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.3.ff.ff.0.0.bias', 'transformer.transformer_blocks.3.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.3.ff.ff.2.weight', 'transformer.transformer_blocks.3.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.3.ff.ff.2.bias', 'transformer.transformer_blocks.3.ff.ff.2.bias'}, {'transformer.transformer_blocks.4.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.4.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.4.attn_norm.linear.bias', 'transformer.transformer_blocks.4.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.4.attn.to_q.weight', 'transformer.transformer_blocks.4.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.4.attn.to_q.bias', 'transformer.transformer_blocks.4.attn.to_q.bias'}, {'transformer.transformer_blocks.4.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.4.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.4.attn.to_k.bias', 'transformer.transformer_blocks.4.attn.to_k.bias'}, {'transformer.transformer_blocks.4.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.4.attn.to_v.weight'}, {'transformer.transformer_blocks.4.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.4.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.4.attn.to_out.0.weight', 'transformer.transformer_blocks.4.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.4.attn.to_out.0.bias', 'transformer.transformer_blocks.4.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.4.ff.ff.0.0.weight', 'transformer.transformer_blocks.4.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.4.ff.ff.0.0.bias', 'transformer.transformer_blocks.4.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.4.ff.ff.2.weight', 'transformer.transformer_blocks.4.ff.ff.2.weight'}, {'transformer.transformer_blocks.4.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.4.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.5.attn_norm.linear.weight', 'transformer.transformer_blocks.5.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.5.attn_norm.linear.bias', 'transformer.transformer_blocks.5.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.5.attn.to_q.weight', 'transformer.transformer_blocks.5.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.5.attn.to_q.bias', 'transformer.transformer_blocks.5.attn.to_q.bias'}, {'transformer.transformer_blocks.5.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.5.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.5.attn.to_k.bias', 'transformer.transformer_blocks.5.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.5.attn.to_v.weight', 'transformer.transformer_blocks.5.attn.to_v.weight'}, {'transformer.transformer_blocks.5.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.5.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.5.attn.to_out.0.weight', 'transformer.transformer_blocks.5.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.5.attn.to_out.0.bias', 'transformer.transformer_blocks.5.attn.to_out.0.bias'}, {'transformer.transformer_blocks.5.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.5.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.5.ff.ff.0.0.bias', 'transformer.transformer_blocks.5.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.5.ff.ff.2.weight', 'transformer.transformer_blocks.5.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.5.ff.ff.2.bias', 'transformer.transformer_blocks.5.ff.ff.2.bias'}, {'transformer.transformer_blocks.6.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.6.attn_norm.linear.weight'}, {'transformer.transformer_blocks.6.attn_norm.linear.bias', 'ema_model.transformer.transformer_blocks.6.attn_norm.linear.bias'}, {'transformer.transformer_blocks.6.attn.to_q.weight', 'ema_model.transformer.transformer_blocks.6.attn.to_q.weight'}, {'transformer.transformer_blocks.6.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.6.attn.to_q.bias'}, {'ema_model.transformer.transformer_blocks.6.attn.to_k.weight', 'transformer.transformer_blocks.6.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.6.attn.to_k.bias', 'transformer.transformer_blocks.6.attn.to_k.bias'}, {'transformer.transformer_blocks.6.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.6.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.6.attn.to_v.bias', 'transformer.transformer_blocks.6.attn.to_v.bias'}, {'transformer.transformer_blocks.6.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.6.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.6.attn.to_out.0.bias', 'transformer.transformer_blocks.6.attn.to_out.0.bias'}, {'transformer.transformer_blocks.6.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.6.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.6.ff.ff.0.0.bias', 'transformer.transformer_blocks.6.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.6.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.6.ff.ff.2.weight'}, {'transformer.transformer_blocks.6.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.6.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.7.attn_norm.linear.weight', 'transformer.transformer_blocks.7.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.7.attn_norm.linear.bias', 'transformer.transformer_blocks.7.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.7.attn.to_q.weight', 'transformer.transformer_blocks.7.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.7.attn.to_q.bias', 'transformer.transformer_blocks.7.attn.to_q.bias'}, {'transformer.transformer_blocks.7.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.7.attn.to_k.weight'}, {'transformer.transformer_blocks.7.attn.to_k.bias', 'ema_model.transformer.transformer_blocks.7.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.7.attn.to_v.weight', 'transformer.transformer_blocks.7.attn.to_v.weight'}, {'transformer.transformer_blocks.7.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.7.attn.to_v.bias'}, {'transformer.transformer_blocks.7.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.7.attn.to_out.0.weight'}, {'transformer.transformer_blocks.7.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.7.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.7.ff.ff.0.0.weight', 'transformer.transformer_blocks.7.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.7.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.7.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.7.ff.ff.2.weight', 'transformer.transformer_blocks.7.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.7.ff.ff.2.bias', 'transformer.transformer_blocks.7.ff.ff.2.bias'}, {'transformer.transformer_blocks.8.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.8.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.8.attn_norm.linear.bias', 'transformer.transformer_blocks.8.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.8.attn.to_q.weight', 'transformer.transformer_blocks.8.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.8.attn.to_q.bias', 'transformer.transformer_blocks.8.attn.to_q.bias'}, {'ema_model.transformer.transformer_blocks.8.attn.to_k.weight', 'transformer.transformer_blocks.8.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.8.attn.to_k.bias', 'transformer.transformer_blocks.8.attn.to_k.bias'}, {'transformer.transformer_blocks.8.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.8.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.8.attn.to_v.bias', 'transformer.transformer_blocks.8.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.8.attn.to_out.0.weight', 'transformer.transformer_blocks.8.attn.to_out.0.weight'}, {'transformer.transformer_blocks.8.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.8.attn.to_out.0.bias'}, {'transformer.transformer_blocks.8.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.8.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.8.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.8.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.8.ff.ff.2.weight', 'transformer.transformer_blocks.8.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.8.ff.ff.2.bias', 'transformer.transformer_blocks.8.ff.ff.2.bias'}, {'transformer.transformer_blocks.9.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.9.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.9.attn_norm.linear.bias', 'transformer.transformer_blocks.9.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.9.attn.to_q.weight', 'transformer.transformer_blocks.9.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.9.attn.to_q.bias', 'transformer.transformer_blocks.9.attn.to_q.bias'}, {'transformer.transformer_blocks.9.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.9.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.9.attn.to_k.bias', 'transformer.transformer_blocks.9.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.9.attn.to_v.weight', 'transformer.transformer_blocks.9.attn.to_v.weight'}, {'transformer.transformer_blocks.9.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.9.attn.to_v.bias'}, {'transformer.transformer_blocks.9.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.9.attn.to_out.0.weight'}, {'transformer.transformer_blocks.9.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.9.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.9.ff.ff.0.0.weight', 'transformer.transformer_blocks.9.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.9.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.9.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.9.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.9.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.9.ff.ff.2.bias', 'transformer.transformer_blocks.9.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.10.attn_norm.linear.weight', 'transformer.transformer_blocks.10.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.10.attn_norm.linear.bias', 'transformer.transformer_blocks.10.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.10.attn.to_q.weight', 'transformer.transformer_blocks.10.attn.to_q.weight'}, {'transformer.transformer_blocks.10.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.10.attn.to_q.bias'}, {'ema_model.transformer.transformer_blocks.10.attn.to_k.weight', 'transformer.transformer_blocks.10.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.10.attn.to_k.bias', 'transformer.transformer_blocks.10.attn.to_k.bias'}, {'transformer.transformer_blocks.10.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.10.attn.to_v.weight'}, {'transformer.transformer_blocks.10.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.10.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.10.attn.to_out.0.weight', 'transformer.transformer_blocks.10.attn.to_out.0.weight'}, {'transformer.transformer_blocks.10.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.10.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.10.ff.ff.0.0.weight', 'transformer.transformer_blocks.10.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.10.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.10.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.10.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.10.ff.ff.2.weight'}, {'transformer.transformer_blocks.10.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.10.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.11.attn_norm.linear.weight', 'transformer.transformer_blocks.11.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.11.attn_norm.linear.bias', 'transformer.transformer_blocks.11.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.11.attn.to_q.weight', 'transformer.transformer_blocks.11.attn.to_q.weight'}, {'transformer.transformer_blocks.11.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.11.attn.to_q.bias'}, {'transformer.transformer_blocks.11.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.11.attn.to_k.weight'}, {'transformer.transformer_blocks.11.attn.to_k.bias', 'ema_model.transformer.transformer_blocks.11.attn.to_k.bias'}, {'transformer.transformer_blocks.11.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.11.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.11.attn.to_v.bias', 'transformer.transformer_blocks.11.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.11.attn.to_out.0.weight', 'transformer.transformer_blocks.11.attn.to_out.0.weight'}, {'transformer.transformer_blocks.11.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.11.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.11.ff.ff.0.0.weight', 'transformer.transformer_blocks.11.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.11.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.11.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.11.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.11.ff.ff.2.weight'}, {'transformer.transformer_blocks.11.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.11.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.12.attn_norm.linear.weight', 'transformer.transformer_blocks.12.attn_norm.linear.weight'}, {'transformer.transformer_blocks.12.attn_norm.linear.bias', 'ema_model.transformer.transformer_blocks.12.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.12.attn.to_q.weight', 'transformer.transformer_blocks.12.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.12.attn.to_q.bias', 'transformer.transformer_blocks.12.attn.to_q.bias'}, {'transformer.transformer_blocks.12.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.12.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.12.attn.to_k.bias', 'transformer.transformer_blocks.12.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.12.attn.to_v.weight', 'transformer.transformer_blocks.12.attn.to_v.weight'}, {'transformer.transformer_blocks.12.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.12.attn.to_v.bias'}, {'transformer.transformer_blocks.12.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.12.attn.to_out.0.weight'}, {'transformer.transformer_blocks.12.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.12.attn.to_out.0.bias'}, {'transformer.transformer_blocks.12.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.12.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.12.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.12.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.12.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.12.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.12.ff.ff.2.bias', 'transformer.transformer_blocks.12.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.13.attn_norm.linear.weight', 'transformer.transformer_blocks.13.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.13.attn_norm.linear.bias', 'transformer.transformer_blocks.13.attn_norm.linear.bias'}, {'transformer.transformer_blocks.13.attn.to_q.weight', 'ema_model.transformer.transformer_blocks.13.attn.to_q.weight'}, {'transformer.transformer_blocks.13.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.13.attn.to_q.bias'}, {'ema_model.transformer.transformer_blocks.13.attn.to_k.weight', 'transformer.transformer_blocks.13.attn.to_k.weight'}, {'transformer.transformer_blocks.13.attn.to_k.bias', 'ema_model.transformer.transformer_blocks.13.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.13.attn.to_v.weight', 'transformer.transformer_blocks.13.attn.to_v.weight'}, {'transformer.transformer_blocks.13.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.13.attn.to_v.bias'}, {'transformer.transformer_blocks.13.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.13.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.13.attn.to_out.0.bias', 'transformer.transformer_blocks.13.attn.to_out.0.bias'}, {'transformer.transformer_blocks.13.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.13.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.13.ff.ff.0.0.bias', 'transformer.transformer_blocks.13.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.13.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.13.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.13.ff.ff.2.bias', 'transformer.transformer_blocks.13.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.14.attn_norm.linear.weight', 'transformer.transformer_blocks.14.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.14.attn_norm.linear.bias', 'transformer.transformer_blocks.14.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.14.attn.to_q.weight', 'transformer.transformer_blocks.14.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.14.attn.to_q.bias', 'transformer.transformer_blocks.14.attn.to_q.bias'}, {'transformer.transformer_blocks.14.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.14.attn.to_k.weight'}, {'transformer.transformer_blocks.14.attn.to_k.bias', 'ema_model.transformer.transformer_blocks.14.attn.to_k.bias'}, {'transformer.transformer_blocks.14.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.14.attn.to_v.weight'}, {'transformer.transformer_blocks.14.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.14.attn.to_v.bias'}, {'transformer.transformer_blocks.14.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.14.attn.to_out.0.weight'}, {'transformer.transformer_blocks.14.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.14.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.14.ff.ff.0.0.weight', 'transformer.transformer_blocks.14.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.14.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.14.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.14.ff.ff.2.weight', 'transformer.transformer_blocks.14.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.14.ff.ff.2.bias', 'transformer.transformer_blocks.14.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.15.attn_norm.linear.weight', 'transformer.transformer_blocks.15.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.15.attn_norm.linear.bias', 'transformer.transformer_blocks.15.attn_norm.linear.bias'}, {'transformer.transformer_blocks.15.attn.to_q.weight', 'ema_model.transformer.transformer_blocks.15.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.15.attn.to_q.bias', 'transformer.transformer_blocks.15.attn.to_q.bias'}, {'ema_model.transformer.transformer_blocks.15.attn.to_k.weight', 'transformer.transformer_blocks.15.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.15.attn.to_k.bias', 'transformer.transformer_blocks.15.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.15.attn.to_v.weight', 'transformer.transformer_blocks.15.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.15.attn.to_v.bias', 'transformer.transformer_blocks.15.attn.to_v.bias'}, {'transformer.transformer_blocks.15.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.15.attn.to_out.0.weight'}, {'transformer.transformer_blocks.15.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.15.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.15.ff.ff.0.0.weight', 'transformer.transformer_blocks.15.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.15.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.15.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.15.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.15.ff.ff.2.weight'}, {'transformer.transformer_blocks.15.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.15.ff.ff.2.bias'}, {'transformer.transformer_blocks.16.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.16.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.16.attn_norm.linear.bias', 'transformer.transformer_blocks.16.attn_norm.linear.bias'}, {'transformer.transformer_blocks.16.attn.to_q.weight', 'ema_model.transformer.transformer_blocks.16.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.16.attn.to_q.bias', 'transformer.transformer_blocks.16.attn.to_q.bias'}, {'ema_model.transformer.transformer_blocks.16.attn.to_k.weight', 'transformer.transformer_blocks.16.attn.to_k.weight'}, {'transformer.transformer_blocks.16.attn.to_k.bias', 'ema_model.transformer.transformer_blocks.16.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.16.attn.to_v.weight', 'transformer.transformer_blocks.16.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.16.attn.to_v.bias', 'transformer.transformer_blocks.16.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.16.attn.to_out.0.weight', 'transformer.transformer_blocks.16.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.16.attn.to_out.0.bias', 'transformer.transformer_blocks.16.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.16.ff.ff.0.0.weight', 'transformer.transformer_blocks.16.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.16.ff.ff.0.0.bias', 'transformer.transformer_blocks.16.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.16.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.16.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.16.ff.ff.2.bias', 'transformer.transformer_blocks.16.ff.ff.2.bias'}, {'transformer.transformer_blocks.17.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.17.attn_norm.linear.weight'}, {'transformer.transformer_blocks.17.attn_norm.linear.bias', 'ema_model.transformer.transformer_blocks.17.attn_norm.linear.bias'}, {'transformer.transformer_blocks.17.attn.to_q.weight', 'ema_model.transformer.transformer_blocks.17.attn.to_q.weight'}, {'transformer.transformer_blocks.17.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.17.attn.to_q.bias'}, {'transformer.transformer_blocks.17.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.17.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.17.attn.to_k.bias', 'transformer.transformer_blocks.17.attn.to_k.bias'}, {'transformer.transformer_blocks.17.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.17.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.17.attn.to_v.bias', 'transformer.transformer_blocks.17.attn.to_v.bias'}, {'transformer.transformer_blocks.17.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.17.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.17.attn.to_out.0.bias', 'transformer.transformer_blocks.17.attn.to_out.0.bias'}, {'transformer.transformer_blocks.17.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.17.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.17.ff.ff.0.0.bias', 'transformer.transformer_blocks.17.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.17.ff.ff.2.weight', 'transformer.transformer_blocks.17.ff.ff.2.weight'}, {'transformer.transformer_blocks.17.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.17.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.18.attn_norm.linear.weight', 'transformer.transformer_blocks.18.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.18.attn_norm.linear.bias', 'transformer.transformer_blocks.18.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.18.attn.to_q.weight', 'transformer.transformer_blocks.18.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.18.attn.to_q.bias', 'transformer.transformer_blocks.18.attn.to_q.bias'}, {'ema_model.transformer.transformer_blocks.18.attn.to_k.weight', 'transformer.transformer_blocks.18.attn.to_k.weight'}, {'transformer.transformer_blocks.18.attn.to_k.bias', 'ema_model.transformer.transformer_blocks.18.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.18.attn.to_v.weight', 'transformer.transformer_blocks.18.attn.to_v.weight'}, {'transformer.transformer_blocks.18.attn.to_v.bias', 'ema_model.transformer.transformer_blocks.18.attn.to_v.bias'}, {'ema_model.transformer.transformer_blocks.18.attn.to_out.0.weight', 'transformer.transformer_blocks.18.attn.to_out.0.weight'}, {'transformer.transformer_blocks.18.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.18.attn.to_out.0.bias'}, {'transformer.transformer_blocks.18.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.18.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.18.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.18.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.18.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.18.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.18.ff.ff.2.bias', 'transformer.transformer_blocks.18.ff.ff.2.bias'}, {'transformer.transformer_blocks.19.attn_norm.linear.weight', 'ema_model.transformer.transformer_blocks.19.attn_norm.linear.weight'}, {'transformer.transformer_blocks.19.attn_norm.linear.bias', 'ema_model.transformer.transformer_blocks.19.attn_norm.linear.bias'}, {'transformer.transformer_blocks.19.attn.to_q.weight', 'ema_model.transformer.transformer_blocks.19.attn.to_q.weight'}, {'transformer.transformer_blocks.19.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.19.attn.to_q.bias'}, {'transformer.transformer_blocks.19.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.19.attn.to_k.weight'}, {'transformer.transformer_blocks.19.attn.to_k.bias', 'ema_model.transformer.transformer_blocks.19.attn.to_k.bias'}, {'transformer.transformer_blocks.19.attn.to_v.weight', 'ema_model.transformer.transformer_blocks.19.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.19.attn.to_v.bias', 'transformer.transformer_blocks.19.attn.to_v.bias'}, {'transformer.transformer_blocks.19.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.19.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.19.attn.to_out.0.bias', 'transformer.transformer_blocks.19.attn.to_out.0.bias'}, {'transformer.transformer_blocks.19.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.19.ff.ff.0.0.weight'}, {'transformer.transformer_blocks.19.ff.ff.0.0.bias', 'ema_model.transformer.transformer_blocks.19.ff.ff.0.0.bias'}, {'ema_model.transformer.transformer_blocks.19.ff.ff.2.weight', 'transformer.transformer_blocks.19.ff.ff.2.weight'}, {'ema_model.transformer.transformer_blocks.19.ff.ff.2.bias', 'transformer.transformer_blocks.19.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.20.attn_norm.linear.weight', 'transformer.transformer_blocks.20.attn_norm.linear.weight'}, {'ema_model.transformer.transformer_blocks.20.attn_norm.linear.bias', 'transformer.transformer_blocks.20.attn_norm.linear.bias'}, {'transformer.transformer_blocks.20.attn.to_q.weight', 'ema_model.transformer.transformer_blocks.20.attn.to_q.weight'}, {'transformer.transformer_blocks.20.attn.to_q.bias', 'ema_model.transformer.transformer_blocks.20.attn.to_q.bias'}, {'transformer.transformer_blocks.20.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.20.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.20.attn.to_k.bias', 'transformer.transformer_blocks.20.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.20.attn.to_v.weight', 'transformer.transformer_blocks.20.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.20.attn.to_v.bias', 'transformer.transformer_blocks.20.attn.to_v.bias'}, {'transformer.transformer_blocks.20.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.20.attn.to_out.0.weight'}, {'transformer.transformer_blocks.20.attn.to_out.0.bias', 'ema_model.transformer.transformer_blocks.20.attn.to_out.0.bias'}, {'ema_model.transformer.transformer_blocks.20.ff.ff.0.0.weight', 'transformer.transformer_blocks.20.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.20.ff.ff.0.0.bias', 'transformer.transformer_blocks.20.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.20.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.20.ff.ff.2.weight'}, {'transformer.transformer_blocks.20.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.20.ff.ff.2.bias'}, {'ema_model.transformer.transformer_blocks.21.attn_norm.linear.weight', 'transformer.transformer_blocks.21.attn_norm.linear.weight'}, {'transformer.transformer_blocks.21.attn_norm.linear.bias', 'ema_model.transformer.transformer_blocks.21.attn_norm.linear.bias'}, {'ema_model.transformer.transformer_blocks.21.attn.to_q.weight', 'transformer.transformer_blocks.21.attn.to_q.weight'}, {'ema_model.transformer.transformer_blocks.21.attn.to_q.bias', 'transformer.transformer_blocks.21.attn.to_q.bias'}, {'transformer.transformer_blocks.21.attn.to_k.weight', 'ema_model.transformer.transformer_blocks.21.attn.to_k.weight'}, {'ema_model.transformer.transformer_blocks.21.attn.to_k.bias', 'transformer.transformer_blocks.21.attn.to_k.bias'}, {'ema_model.transformer.transformer_blocks.21.attn.to_v.weight', 'transformer.transformer_blocks.21.attn.to_v.weight'}, {'ema_model.transformer.transformer_blocks.21.attn.to_v.bias', 'transformer.transformer_blocks.21.attn.to_v.bias'}, {'transformer.transformer_blocks.21.attn.to_out.0.weight', 'ema_model.transformer.transformer_blocks.21.attn.to_out.0.weight'}, {'ema_model.transformer.transformer_blocks.21.attn.to_out.0.bias', 'transformer.transformer_blocks.21.attn.to_out.0.bias'}, {'transformer.transformer_blocks.21.ff.ff.0.0.weight', 'ema_model.transformer.transformer_blocks.21.ff.ff.0.0.weight'}, {'ema_model.transformer.transformer_blocks.21.ff.ff.0.0.bias', 'transformer.transformer_blocks.21.ff.ff.0.0.bias'}, {'transformer.transformer_blocks.21.ff.ff.2.weight', 'ema_model.transformer.transformer_blocks.21.ff.ff.2.weight'}, {'transformer.transformer_blocks.21.ff.ff.2.bias', 'ema_model.transformer.transformer_blocks.21.ff.ff.2.bias'}, {'transformer.norm_out.linear.weight', 'ema_model.transformer.norm_out.linear.weight'}, {'ema_model.transformer.norm_out.linear.bias', 'transformer.norm_out.linear.bias'}, {'transformer.proj_out.weight', 'ema_model.transformer.proj_out.weight'}, {'ema_model.transformer.proj_out.bias', 'transformer.proj_out.bias'}]. A potential way to correctly save your model is to use `save_model`. More information at https://huggingface.co/docs/safetensors/torch_shared_tensors