Page cover

Intro

A thread on Hacker Newsarrow-up-right recently caught my eye, which then lead me to Sebastian Raschka's "Build a Large Language Model (From Scratch).arrow-up-right Seeing as GPU prices have been soaring and I was lucky enough to snag an NVIDIA RTX 4080 Super not too long ago, I decided that as long as I spent this egregious amount of money on parallel compute for gaming shenanigans, I might as well use it for something that may teach me more about the hot new thingâ„¢, which, at the moment is generative AI.

As someone that's fairly reticent and does not post many of my deeds on the internet, this will be a first in both documenting a project like this, as well as being able to spew info at you (the reader) in a manner that is digestible, so buckle up.

Last updated