LLM
Can small language models speak coherent english
Language models (LMs) are powerful tools for natural language processing, but they often struggle to produce coherent and fluent text when they are small. Models with around 125M parameters such as GPT-Neo (small) or GPT-2 (small) can rarely generate coherent and consistent English text beyond a few words even after extensive training, Lets check this cool paper i came across: https://arxiv.org/pdf/2305.07759