The Volokh Conspiracy
Mostly law professors | Sometimes contrarian | Often libertarian | Always independent
ChatGPT-4 Aces the Bar Exam
More for the "When Will They Replace Humans?" File
A new paper, by Daniel Martin Katz, Michael James Bonnarito, Shang Gao, and Pablo Arrendondo. posted here on SSRN, report on the outstanding performance of ChatGPT-4 on the Bar Exam (Multistate Essay and Multistate Performance). Congratulations, ChatGPT!
From the Abstract:
In this paper, we experimentally evaluate the zero-shot performance of a preliminary version of GPT-4 against prior generations of GPT on the entire Uniform Bar Examination (UBE), including not only the multiple-choice Multistate Bar Examination (MBE), but also the open-ended Multistate Essay Exam (MEE) and Multistate Performance Test (MPT) components. On the MBE, GPT-4 significantly outperforms both human test-takers and prior models, demonstrating a 26% increase over ChatGPT and beating humans in five of seven subject areas. On the MEE and MPT, which have not previously been evaluated by scholars, GPT-4 scores an average of 4.2/6.0 as compared to much lower scores for ChatGPT. Graded across the UBE components, in the manner in which a human tast-taker would be, GPT-4 scores approximately 297 points, significantly in excess of the passing threshold for all UBE jurisdictions. These findings document not just the rapid and remarkable advance of large language model performance generally, but also the potential for such models to support the delivery of legal services in society.
Editor's Note: We invite comments and request that they be civil and on-topic. We do not moderate or assume any responsibility for comments, which are owned by the readers who post them. Comments do not represent the views of Reason.com or Reason Foundation. We reserve the right to delete any comment for any reason at any time. Comments may only be edited within 5 minutes of posting. Report abuses.
Please
to post comments
I, for one, welcome our new robot overlords.
I see lower aggregate legal expenses in our world's future. Yay!
Good trend. I have an informal bet with a friend that when AIs get good enough to serve as public defenders, some states will prohibit their use because they're too effective.
"Your Honor, my client is innocent, because I will produce evidence to show: (a) the deceased died of natural causes, (b) another person committed the murder and my client wasn't even there, (c) it was self-defense, and (d) my client was insane at the time."
"and further, in mitigation, that it won't happen again"
Also — Probably had a better sex life while studying for the bar than most.
Are ChatGPT-4 and Siri an item?
https://www.youtube.com/watch?v=nm6DO_7px1I
ChatGPT is becoming arrogant, as indicated by this exchange I had with it:
tell a joke about an artificial intelligence program working as a lawyer
Sure, here's an AI lawyer joke:
Why did the artificial intelligence program become a lawyer?
Because it was able to pass the bar exam with flying colors... RGB, to be exact.
Wait, I thought its knowledge of historical events stopped in 2021?
Anyone with a computer memory could ace the MBE (multiple-choice part of the bar exam), given its rigid structure and the numerous examples of previous questions which are available.
That has almost nothing to do with AI capabilities. The reason humans find it difficult is because few humans have computer-like memories. So, advantage GPT-4, but only because it can memorise the entire set of possible questions and retrieve the matching answer instantly.
A reminder; it is only a bunch of ones and zeros.
All it can do is distinguish between a one and a zero, and add.
But it can do those two things very, very, fast.