A visualisation of major large-language models (LLMs), ranked by performance, using MMLU (Massive Multitasks Language Understanding) a benchmark for evaluating the capabilities of large language models.
» see the visualisation
» main source: Life Architect data
» see the data
» chart rendered with our lovely tool VizSweet