ChatGPT-4 Aces the Bar Exam

The Volokh Conspiracy

Mostly law professors | Sometimes contrarian | Often libertarian | Always independent

A new paper, by Daniel Martin Katz, Michael James Bonnarito, Shang Gao, and Pablo Arrendondo. posted here on SSRN, report on the outstanding performance of ChatGPT-4 on the Bar Exam (Multistate Essay and Multistate Performance). Congratulations, ChatGPT!

From the Abstract:

In this paper, we experimentally evaluate the zero-shot performance of a preliminary version of GPT-4 against prior generations of GPT on the entire Uniform Bar Examination (UBE), including not only the multiple-choice Multistate Bar Examination (MBE), but also the open-ended Multistate Essay Exam (MEE) and Multistate Performance Test (MPT) components. On the MBE, GPT-4 significantly outperforms both human test-takers and prior models, demonstrating a 26% increase over ChatGPT and beating humans in five of seven subject areas. On the MEE and MPT, which have not previously been evaluated by scholars, GPT-4 scores an average of 4.2/6.0 as compared to much lower scores for ChatGPT. Graded across the UBE components, in the manner in which a human tast-taker would be, GPT-4 scores approximately 297 points, significantly in excess of the passing threshold for all UBE jurisdictions. These findings document not just the rapid and remarkable advance of large language model performance generally, but also the potential for such models to support the delivery of legal services in society.

Start your day with Reason. Get a daily brief of the most important stories and trends every weekday morning when you subscribe to Reason Roundup.

NEXT: US Accepted 271,000 Ukrainians Over the Last Year - But Can Do Much Better

Hide Comments (12)

Editor's Note: We invite comments and request that they be civil and on-topic. We do not moderate or assume any responsibility for comments, which are owned by the readers who post them. Comments do not represent the views of Reason.com or Reason Foundation. We reserve the right to delete any comment for any reason at any time. Comments may only be edited within 5 minutes of posting. Report abuses.

the questing vole 3 years ago

I, for one, welcome our new robot overlords.

Log in to Reply
Ben_ 3 years ago

I see lower aggregate legal expenses in our world's future. Yay!

Log in to Reply
SRG 3 years ago

Good trend. I have an informal bet with a friend that when AIs get good enough to serve as public defenders, some states will prohibit their use because they're too effective.

Log in to Reply
1. The Margrave of Azilia 3 years ago
  
  "Your Honor, my client is innocent, because I will produce evidence to show: (a) the deceased died of natural causes, (b) another person committed the murder and my client wasn't even there, (c) it was self-defense, and (d) my client was insane at the time."
  
  Log in to Reply
  1. SRG 3 years ago
    
    "and further, in mitigation, that it won't happen again"
    
    Log in to Reply
Dan Schiavetta 3 years ago

Also — Probably had a better sex life while studying for the bar than most.

Log in to Reply
1. The Margrave of Azilia 3 years ago (edited)
  
  Are ChatGPT-4 and Siri an item?
  
  Log in to Reply
The Margrave of Azilia 3 years ago

https://www.youtube.com/watch?v=nm6DO_7px1I

Log in to Reply
The Margrave of Azilia 3 years ago

ChatGPT is becoming arrogant, as indicated by this exchange I had with it:

tell a joke about an artificial intelligence program working as a lawyer

Sure, here's an AI lawyer joke:

Why did the artificial intelligence program become a lawyer?

Because it was able to pass the bar exam with flying colors... RGB, to be exact.

Log in to Reply
1. The Margrave of Azilia 3 years ago
  
  Wait, I thought its knowledge of historical events stopped in 2021?
  
  Log in to Reply
ObviouslyNotSpam 3 years ago

Anyone with a computer memory could ace the MBE (multiple-choice part of the bar exam), given its rigid structure and the numerous examples of previous questions which are available.

That has almost nothing to do with AI capabilities. The reason humans find it difficult is because few humans have computer-like memories. So, advantage GPT-4, but only because it can memorise the entire set of possible questions and retrieve the matching answer instantly.

Log in to Reply
Longtobefree 3 years ago

A reminder; it is only a bunch of ones and zeros.
All it can do is distinguish between a one and a zero, and add.
But it can do those two things very, very, fast.

Log in to Reply

Please log in to post comments

The Volokh Conspiracy

ChatGPT-4 Aces the Bar Exam

Latest

How Special Interests Twisted Federal Sugar Policy To Cost Consumers $2.5 Billion Every Year

The Pentagon and the FBI Are Investigating 6 Legislators for Exercising Their First Amendment Rights

Muriel Bowser Was a Flawed Mayor. We'll Miss Her When She's Gone.

Why DOGE Mattered

Texas Man Faces Up to 40 Years in Prison for Transporting Constitutionally Protected Pamphlets

Recommended

Login Form

The Volokh Conspiracy

Latest

How Special Interests Twisted Federal Sugar Policy To Cost Consumers $2.5 Billion Every Year

The Pentagon and the FBI Are Investigating 6 Legislators for Exercising Their First Amendment Rights

Muriel Bowser Was a Flawed Mayor. We'll Miss Her When She's Gone.

Why DOGE Mattered

Texas Man Faces Up to 40 Years in Prison for Transporting Constitutionally Protected Pamphlets

Recommended