newmindai/Llama-3.1-8B-Instruct-w16a16-tw
Text Generation
•
8B
•
Updated
•
57
FP8 Rowwise and BF16 tensorwise models with optimized recipes for large-scale training efficiency and convergence stability.