A independent contribution was noted where by a user created a fused GEMM for int4, which can be effective for training with preset sequence lengths, providing the fastest Remedy.LingOly Problem Introduces: A fresh LingOly benchmark is addressing the evaluation of LLMs in State-of-the-art reasoning involving linguistic puzzles. With above a thousan