Doveryai, No Proveryai!

Image: Holley Robinson, EDRM with AI.

[EDRM Editor’s Note: The opinions and positions are those of Craig Ball. This article is republished with permission and was first published on August 7, 2024.]

I recently published an AI prompt to run against search terms then get the AI to propose improvements. Among the pitfalls I’d hoped to expose was the presence of “stop” or “noise” words; terms routinely excluded from search indices. Searches incorporating stop words fail because terms not in the index won’t be found. Ensuring your searches don’t include stop words is an essential step in framing effective queries.

To help the AI recognize stop words, the prompt included a list of default stop words for well-known eDiscovery tools. That is, I thought I’d done that, but what I included in error (and have now replaced) was ChatGPT’s rendition of stop words for the major tools. I’d made a mental note to check the lists supplied but—DOH!—I plugged it into the prompt and then forgot to do my due diligence.

I was feeling pretty good about the post and getting some nice feedback. Last night, my dear friend and e-discovery Empress Mary Mack commented on the novelty of seeing the various stop word lists broken out in a ready reference. I think echoes of Mary’s kind comment woke me at 4:00am, my subconscious screaming, “HEY DUMMY! Did you verify those stop words? Tell me you didn’t blindly trust an AI?!?”

…you just cannot trust an AI generative large language model to do your research without careful human assessment of the output. I know this and let it slip my mind. Last time for that.
Craig Ball.

So, long before sunrise, I was manually checking each stop word list against product websites and—lo and behold—every list was off: some merely incomplete but others not even close. ChatGPT hallucinated the lists, and I failed to do the crucial thing lawyers must do when using AI as a research assistant: Trust but verify.

No harm done, but I share my chagrin here to underscore that you just cannot trust an AI generative large language model to do your research without careful human assessment of the output. I know this and let it slip my mind. Last time for that. I’ve corrected the prompt on my blog and hope I’ve gotten it right. I post this to remind my readers that AI LLMs are great—USE THEM–but they are no substitute for you. Doveryai, no proveryai!

Read the original release here.

Assisted by GAI and LLM Technologies per EDRM GAI and LLM Policy.

Author

Craig Ball

Craig Ball is a Texas trial lawyer, computer forensic examiner, law professor and noted authority on electronic evidence. He limits his practice to serving as a court-appointed special master and consultant in computer forensics and electronic discovery and has served as the Special Master or testifying expert in computer forensics and electronic discovery in some of the most challenging and celebrated cases in the U.S. Craig is also EDRM’s General Counsel and a key contributor to many EDRM projects.
View all posts