ChatGPT or human author?

Fraunhofer SIT Researches Automated Recognition Capabilities

Whether application letters, school essays, exams or program codes - with the text AI ChatGPT, texts of all kinds can be generated automatically within seconds. This works sometimes more, sometimes less well - and humans often fail to reliably classify the result as AI-generated or human-written.

ChatGPT text recognition with authorship verification

Fraunhofer SIT is researching ways to help recognize texts created with ChatGPT. Among other things, our text forensic experts work with a self-developed method for authorship verification, COAV: Originally, it was used to detect plagiarism in scientific papers, for example. Since COAV compares texts on a stylistic basis, this method can also be used to identify a specific "author," namely ChatGPT. This is used to calculate the distances between texts using similarities of text modules and typical consecutive letter strings: Is the text closer to GPT or closer to a human?

Our IT forensics expert Lukas Graner has demonstrated this using news texts as an example: He had existing online messages written by (human) journalists reworded via ChatGPT and tested them with the tool. Text passages marked in blue indicate that they are more likely to have been written by a human, while red markings stand for ChatGPT.

The graphics clearly show the result: In the first case, you see an excerpt from a news text from the Süddeutsche Zeitung with markings - added up in the result chart, you see that the blue portion predominates. The result: A human being probably wrote this text.

For the second example, the message text from ChatGPT was rephrased with our own stylistic preferences and then tested with COAV. The result shows: This text was created by machine.

Other methods of ChatGPT text detection

There are other research approaches in text forensics that are looking at how to get to the bottom of ChatGPT texts - again using automated AI tools. For example, there are