I Asked ChatGPT: Is there a python library that's able to tell if text is generated by LLMs or if it's written by humans? What are my options?
Answer below in the comments.
I Asked ChatGPT: Is there a python library that's able to tell if text is generated by LLMs or if it's written by humans? What are my options?
Answer below in the comments.
Yes, there are Python libraries and tools designed to detect if text was generated by large language models (LLMs) or written by humans. Some notable options include:
GPTZero: Specifically built to detect AI-generated text from models like GPT, commonly used in educational settings.
OpenAI's Text Classifier: OpenAI offers a classifier for distinguishing between human and GPT-generated text. It can be accessed via the API, though the tool is not always highly accurate.
DetectGPT: A research tool developed to detect GPT-3 text. This model-based approach analyzes text perplexity and distinctiveness.
Hugging Face Transformers: Using pre-trained transformers to evaluate text perplexity can help identify AI-generated patterns, though it's not dedicated to detection.
AI Text Classifier: Several third-party tools and APIs use statistical methods to determine the likelihood that text was machine-generated.
I Followed Up: How big the download for each of these libraries?
Here's an estimate of the download sizes for the libraries and tools mentioned:
GPTZero:
OpenAI's Text Classifier:
DetectGPT:
Hugging Face Transformers:
transformers
library itself is around 400-500 MB depending on the models you download (like GPT-2, which is around 500 MB).AI Text Classifier (Third-Party APIs):
If you’re looking for a lightweight option, third-party APIs or OpenAI’s API might be your best bet. For local use, Hugging Face and DetectGPT involve bigger downloads.