Public LLM leaderboard Computed using Vectara’s Hallucination Evaluation Model

OpenAI’s GPT-4 Turbo currently holds the crown, but Meta’s LLaMA is hot on its heels

Korkrid Kyle Akepanidtaworn
4 min readNov 29, 2023

Prelude

Large Language Models(LLM) have taken the NLP community AI community the whole world by storm! LLMs are black box AI systems that use deep learning on extremely large datasets to understand and generate new text. Modern LLMs began taking shape in 2014 when the attention mechanism — a machine learning technique designed to mimic human cognitive attention — was introduced in a research paper titled “Neural Machine Translation by Jointly Learning to Align and Translate.” In 2017, that attention mechanism was honed with the introduction of the transformer model in another paper, “Attention Is All You Need.”

Emerging from OpenAI’s labs on November 30, 2022, ChatGPT revolutionized the chatbot landscape, harnessing the power of LLMs like GPT-3.5 and GPT-4. Yet, the race for AI supremacy is far from over, with a myriad of LLMs from rival players striving for dominance. Today, I want to share more with you guys which LLMs are leading the charge and which are playing catch-up.

--

--

Korkrid Kyle Akepanidtaworn

AI Specialized CSA @ Microsoft | Enterprise AI, GenAI, LLM, LLamaIndex, ML | GenAITechLab Fellow, MScFE at WorldQuant, MSDS at CU Boulder