Here's the final version of Chapter 7 of the neural net series, about the multilayer perception blocks in transformers, as motivated by the question of how LLMs may store facts. The plan is to make it public tomorrow, let me know if you catch any little errors in the meantime.
Steven Siddals
2024-09-01 16:27:53 +0000 UTCHolger Flier
2024-09-01 14:12:04 +0000 UTCMeghan
2024-09-01 10:59:55 +0000 UTCMeghan
2024-08-31 19:53:58 +0000 UTCWilliam Smith
2024-08-31 16:45:59 +0000 UTC3blue1brown
2024-08-31 04:18:03 +0000 UTCKyra
2024-08-31 03:15:37 +0000 UTCNeel Nanda
2024-08-31 01:29:38 +0000 UTCwye
2024-08-30 19:51:50 +0000 UTCwye
2024-08-30 19:38:07 +0000 UTCAlex Loftus
2024-08-30 19:08:03 +0000 UTCJesse Thompson
2024-08-30 18:35:22 +0000 UTCDaniel Armesto
2024-08-30 18:27:16 +0000 UTC