AI Training

Deep dive into LLMs like ChatGPT by Andrej Karpathy (TL;DR)

Andrej Karpathy's deep dive into LLMs covers the complete lifecycle from pretraining to post-training, explaining tokenization, neural network architectures, and fine-tuning processes. The comprehensive guide explores how LLMs process information, handle hallucinations, and utilize reinforcement learning to improve performance and reasoning capabilities.

”Torrenting from a corporate laptop doesn’t feel right”: Meta emails unsealed

Meta faces serious allegations of copyright infringement after unsealed emails reveal the company torrented over 160 terabytes of pirated books from shadow libraries for AI training. Internal communications show Meta employees expressed concerns about the legal implications of torrenting and seeding copyrighted content using corporate resources.