Boolean Parameters Tableau Tutorial

GPT-2 124M From Scratch

A from-scratch implementation of GPT-2 (124M parameters) following Andrej Karpathy's "Zero to Hero" playlist. The model was trained on 10B tokens from the FineWeb-Edu dataset using x2 H100 GPUs for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

GPT-2 124M From Scratch

Trending now