Switch to: References

Citations of:

Poor writing, not specialized concepts, drives processing difficulty in legal language

Eric Martínez, Francis Mollica & Edward Gibson

Cognition 224 (C):105070 (2022)

Add citations

You must login to add citations.

Re-evaluating GPT-4’s bar exam performance.Eric Martínez - forthcoming - Artificial Intelligence and Law:1-24.details Perhaps the most widely touted of GPT-4’s at-launch, zero-shot capabilities has been its reported 90th-percentile performance on the Uniform Bar Exam. This paper begins by investigating the methodological challenges in documenting and verifying the 90th-percentile claim, presenting four sets of findings that indicate that OpenAI’s estimates of GPT-4’s UBE percentile are overinflated. First, although GPT-4’s UBE score nears the 90th percentile when examining approximate conversions from February administrations of the Illinois Bar Exam, these estimates are heavily skewed towards repeat test-takers (...) Download Export citation Bookmark 1 citation

1