Recent publication
Comparing Large Language Models
How to decide which is the best state-of-the-art model?
We are excited to share with you our latest research Comparing Large Language Models where we delve into some of the methods used to compare state-of-the-art Large Language Models (LLMs).
Here are the key highlights:
- Evaluation Methods: In this research, we report some of the methods used to compare state-of-the-art Large Language Models (LLMs).
- Chatbot Arena: One of the most popular methods is the Chatbot Arena; a leaderboard for the best performing LLMs, obtained via a crowdsourced approach.
- Quality Trends: Analyzing the Chatbot Arena over time, we show there are indications that the difference in quality between new state-of-the-art LLMs is diminishing.
We believe this analysis provides valuable insights into the evolving landscape of LLMs and their performance metrics.
Useful links
Request product details
Call your local sales team
Americas
All countries (toll free): +1 800 427 7570
Brazil: +55 11 47009629
Argentina: +54 11 53546700
Chile: +56 2 24838932
Mexico: +52 55 80005740
Colombia: +57 1 4419404
Europe, Middle East, Africa
Europe: +442045302020
Africa: +27 11 775 3188
Middle East & North Africa: 800035704182
Asia Pacific (Sub-Regional)
Australia & Pacific Islands: +612 8066 2494
China mainland: +86 10 6627 1095
Hong Kong & Macau: +852 3077 5499
India, Bangladesh, Nepal, Maldives & Sri Lanka:
+91 22 6180 7525
Indonesia: +622150960350
Japan: +813 6743 6515
Korea: +822 3478 4303
Malaysia & Brunei: +603 7 724 0502
New Zealand: +64 9913 6203
Philippines: 180 089 094 050 (Globe) or
180 014 410 639 (PLDT)
Singapore and all non-listed ASEAN Countries:
+65 6415 5484
Taiwan: +886 2 7734 4677
Thailand & Laos: +662 844 9576