This is pretty cool. The full model runs (but comically slowly) on a 6 yr old ESP32-S2 board with external SPI PSRAM, though I did have to turn off the idle task watchdog.
==================== ATOME on SILICON ==================== chip : ESP32-S2 rev v0.0 cores=1 flash : 4 MB PSRAM : 2048 KB (detected) model : 276655 bytes embedded in flash config : d=256 layers=8 head=64 seq=128 state=811 KB --------------------------------------------------------- prompt: Once >>> upon a time, there was a little girl named Lily average: 0.1 tok/s | heap low-water: 243 KB internal