Mind Readings: AI Content Detectors Deep Dive Part 3

Written by

AI, Artificial Intelligence, Generative AI

Mind Readings: AI Content Detectors Deep Dive Part 3

In today’s episode, you’ll witness a live demonstration of popular AI content detectors and discover their shocking inaccuracy. You’ll learn about the critical importance of holdout data and why a zero-tolerance policy for false accusations is essential. You’ll also see how easily these tools misidentify both human and AI-generated content, potentially leading to unfair penalties and legal trouble for institutions. Tune in to see the undeniable proof!

Mind Readings: AI Content Detectors Deep Dive Part 3

Watch this video on YouTube.

Can’t see anything? Watch it on YouTube here.

Listen to the audio here:

Download the MP3 audio here.

Machine-Generated Transcript

What follows is an AI-generated transcript. The transcript may contain errors and is not a substitute for watching the video.

Welcome back. This is part three. We’re going to put some AI content detectors to the test.

First, let’s talk about methodology. How would you do this if you are an institution? The first thing—and I said this in a previous post—is you have to decide what your acceptable rate of false accusation is. How permissible is it to falsely accuse a student, a faculty member, a member of the administration at your institution? What is the acceptable rate of false accusation? Because when you use AI content detectors, you are going to have false positives.

Remember good old statistics: Type 1 and Type 2 errors. A Type 1 error is a false positive, in this case, saying something was made by AI when it was not. A false negative: something was made by AI, but you detected it as not being made by AI. What is your acceptable rate? Are you willing to falsely accuse 5% of your students? One percent? Sixteen thousand percent? What is your acceptable rate of false accusation? This is something that your ethics committee has to decide. I would argue that with this technology, because it is so flawed, the acceptable false positive rate is zero. If you’re talking about ending someone’s academic career—kicking them out of school, revoking their tenure, firing them, or not hiring someone—those are very high-risk decisions. As a result, the margin for error for false positives should be zero.

Now, how do we test this? You’re going to need some samples. You will need—I would argue, again, if you are testing systems, if you want to buy a system—a robust testing library, a library of four types of content.

Number one: samples of content known to be human-led with no AI use at all, typically from historical databases. Now, one caution is that AI tools—from which models were made—ingested many of these documents. So, they will naturally have some statistical correlations in their outputs that can sound a little bit like pieces of those original documents. Be aware of that. So that’s your first set of samples.

Your second set of samples should be holdout data. This is known human-led content that you know to be 100% human. Typically, this would be papers or things written by humans, maybe prior to 2022, because there was no generative AI back then. That would have never been published, never been put online, never vacuumed up into the databases AI makers used to make their models. This is critical data. You want unpublished research, unpublished papers, things that have sat on a hard drive, private emails—whatever the data set is that you will hold someone accountable to. You need examples that you can test services with and say, “We want to make sure that the service is good. We want to make sure there are no false positives, no false accusations,” or whatever your acceptable rate of false accusation is at your institution, “using data that these models have not trained on, and therefore, we know it’s a fair test.”

Third, you need samples that are 100% AI-led, made by a machine. You know they are made by a machine. There’s absolutely no question; this was made by machines.

Fourth, you probably want some samples that are human-led but AI-modified. So, for example, if you were to take a transcript of something that you know for sure was human-read—like I read something aloud—and then you had AI clean up the grammar, that would be an example of something human-made and machine-modified.

So, you need four different samples to do this. Let’s get started. I’m going to do what most people would do. I’m going to Google for “AI detector.” We’ve got the top five results for “AI detector.” This is great.

Now, I’m going to start with a paper. This paper is called “Supreme Court Cases and the Economy.” It was written by my kid, and it was written—let me double-check the date on this file—this file was written April 28th, 2022. ChatGPT, of course, came out in November of that year. So, this file is guaranteed free of AI. So, let’s go ahead and—actually, no, let’s start with our known good. I’m going to take Federalist Paper Number 8 by Alexander Hamilton as our first test candidate. We’re going to put a few paragraphs of Federalist Paper Number 8 by Alexander Hamilton in here. Let’s go service by service. We’ve got 500 words, so it’s a reasonably fair test. ZeroGPT: 64% of Federalist Paper Number 8, written in 1787, is AI-generated. So, if your false positive rate is zero, this tool fails. This is a false positive. There’s no way—I don’t care how good Lin-Manuel Miranda is—there is no way Alexander Hamilton used AI to write Federalist Paper Number 8. Not a chance. Okay, QuillBot says 0% is likely AI-generated. CopyLeaks gives it a thumbs up. That’s not super helpful. ZeroGPT says highly, and Writer says 99% human-generated content. Okay. So, Federalist Paper Number 8 passes.

Let’s try something a bit more sophisticated. This is an op-ed from The New York Times from April 27th, 1924, about immigration legislation. Boy, it’s amazing. One hundred years ago, we were still talking about the same things. I’m going to take the text of this article—this is 1924—let’s put this in. Oh, can only scan 5,000 characters. That’s fine. Let’s see how we’re doing. ZeroGPT came back to 0% AI. 0% on this one. One says human text: 0%, 100% human. So, interesting, Federalist Paper got a 99% in Writer, and this got 100%. That is round two.

We used two papers that are almost certainly content that was in training databases. Let’s now take my kid’s paper. Let’s see how we do. 72% AI. Cool. So this would get my kid kicked out of school if you were using this particular detector. This one came out with 0%. This one came out “AI content detected.” Look at that. This one came out 2%. It’s not highlighting which ones it is. This one came out at 94%, saying there’s probably some AI content in here. That’s really bad if we have a holdout of the type of content that someone would submit. This is a paper that my kid wrote, and several detectors said, “Hey, you probably used AI in parts of this.” That’s really bad. So, if your false positive rate is zero, most of these tools fail. You would want to keep doing this over and over again.

Let’s take another one just for fun. I’m going to take—”Natural Rights in the Constitution”—this was written in 2021, so a whole year before ChatGPT. Here we go. Remember, these documents have never been seen online before. These are my kid’s writings: 49% AI. Okay, this one says 0%. This one says “human text” this time, 2% AI in there, and 95%. So, again, across the board, this should be 100% because there’s no way my kid went back in time—or forward in time—to do this.

Now, let’s create some AI content because we need some to test. Let’s go back to—close this up here—let’s take our “Supreme Court cases” from the previous episode. Let’s clean up the markdown—because markdown is a dead giveaway that you’ve used AI—and format it nicely. We end up with a document that looks like this, pretty straightforward. This is 100% AI. Let’s restart our detectors and see how we did. 98% AI. Okay, that one did a good job. 100% AI-generated. You’ve reached your daily limit. 100% AI, 61% human-generated. So now this is the territory of false negatives. This is a case where a tool erred on the side of “it’s human” when it really isn’t. In a scenario where you have a substantial risk of people being kicked out of school, I’m okay with false negatives, where someone did something with AI and it doesn’t get caught. The problem is the false positives when someone doesn’t use AI and is falsely accused of using AI. As we’ve seen in just the examples in this episode, it’s pretty clear that these tools fall down. If your tolerance for false accusation—which, by the way, is probably a lawsuit—is zero, none of these will pass.

Let’s look at two more examples. First, we’re going to use a human-led document that is AI-modified. What does that mean? That means I’m going to take a transcript from a previous episode of mine. These are my words. I spoke these words aloud. I had a generative AI tool clean up my grammar, clean up my words without rewriting it. Let’s see how just AI use on human-led content performs. Isn’t that interesting? So this was modified by AI, but it is obviously a transcript. This one came out 0%. This one came out 40% AI-detected, which—0% of it was made by AI. Still hit my daily limit there. This one says 1% probability, 96% human-generated. So, again, kind of all over the place.

One final test: a section from my book, The Intelligence Revolution. This is a book that is derived by AI from my original newsletters. I gave it all of my newsletters, YouTube videos, and things like that, and said, “I want you to write as though you were me,” plagiarizing—not plagiarizing—from me as much as possible. Let’s see how this does. Again, these are derived from my words. I’m probably going to hit the daily limit on that one. That’s okay. Let’s see how we did here. 92% AI-generated. 100% AI-generated for that one. Daily limit. 76% AI-generated there. 76% human-generated.

So here we are at the end of our tests. All of these tools fall down, all of them without exception. They don’t get things right. Why don’t they get things right? It depends on how the tool is engineered, whether it’s looking at basic text statistics like perplexity and burstiness or whether they’re using an AI model underneath that’s been trained on some sample data. Whatever the case may be, all of them fell down on historical documents, on holdout data. The holdout data was the really important one because, again, if you’re using this in an academic context—I’m using my kid’s papers from 2021 to early 2022—none of this could be AI-generated, period. This has never been seen by an AI model, so AI could not train on it and replicate any portion of it. These tools still fell down on detecting AI content, especially on both slightly modified and substantially modified content.

The key message here is: do not use these tools to detect AI content, period, end of story, unless you are okay with falsely accusing someone and reaping the consequences, which will include lawsuits against you. If you accuse someone and they take this video, which clearly demonstrates these tools are highly flawed, you’re going to lose that case, and you’re going to end up paying a whole bunch of money. So, do not use these tools.

In part four, let’s talk about building the term paper of the future and the process you might want to teach a student on how to use AI to write a term paper. Stay tuned. If you enjoyed this video, please hit the like button. Subscribe to my channel if you haven’t already. And if you want to know when new videos are available, hit the bell button to be notified as soon as new content is live. Thank you.

Mind Readings: AI Content Detectors Deep Dive Part 3

Machine-Generated Transcript

Comments

Leave a Reply Cancel reply

More posts

Mind Readings: Never Think Alone, AI as a Thought Partner, Part 5

Mind Readings: Never Think Alone, AI as a Thought Partner, Part 3

Mind Readings: Never Think Alone, AI as a Thought Partner, Part 2

Mind Readings: Never Think Alone, AI as a Thought Partner, Part 1

Pin It on Pinterest