About 5 results
Open links in new tab

mamba/mamba_ssm/modules/mamba2.py at main - GitHub
# TODO: What if batch size changes between generation, and we reuse the same states?
mamba/mamba_ssm/modules/mamba_simple.py at main · state ... - GitHub
# TODO: What if batch size changes between generation, and we reuse the same states?
Is there a small model trainer? · Issue #24 · state-spaces/mamba
Dec 6, 2023 · You will be able to reuse this secret in all of your notebooks. Please note that authentication is recommended but still optional to access public models or datasets. …
If batchsize is large (ex.4096), this mamba implementation ... - GitHub
Apr 9, 2024 · # TODO: What if batch size changes between generation, and we reuse the same states? if initialize_states: conv_state.zero_() ssm_state.zero_() return conv_state, ssm_state …