Switch to: References

Add citations

You must login to add citations.
  1. Re-evaluating GPT-4’s bar exam performance.Eric Martínez - forthcoming - Artificial Intelligence and Law:1-24.
    Perhaps the most widely touted of GPT-4’s at-launch, zero-shot capabilities has been its reported 90th-percentile performance on the Uniform Bar Exam. This paper begins by investigating the methodological challenges in documenting and verifying the 90th-percentile claim, presenting four sets of findings that indicate that OpenAI’s estimates of GPT-4’s UBE percentile are overinflated. First, although GPT-4’s UBE score nears the 90th percentile when examining approximate conversions from February administrations of the Illinois Bar Exam, these estimates are heavily skewed towards repeat test-takers (...)
    Download  
     
    Export citation  
     
    Bookmark