load balance bias gradient leak][bugf][act-cache-consistency][keep all loops with kv cache][bugf][lora-depth-extrapolation][clamp scale index beyond max loops][improvement][pyproject-version][bump version to 0 4 0] |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| main.py | ||
| moda.py | ||
| tokenizer.py | ||
| variants.py | ||