1. Website Planet
  2. >
  3. Blog
  4. >
  5. AI Image Detection: Evaluating the Accuracy of the Most Popular Tools

AI Image Detection: Evaluating the Accuracy of the Most Popular Tools

Bethenny Carl Written by:
Last updated: April 08, 2025
Image generation using artificial intelligence (AI) became prevalent beginning in 2022 with the advent of tools like Stable Diffusion and Midjourney. Since then, AI image generation has been on the fast track, with most graphic design platforms integrating AI either for image creation or editing.

This includes software designed for casual users and professionals alike. Canva, one of the most accessible graphic design tools (with over 16 million paid subscribers and 170 million users total), incorporates AI for image generation and editing. Meanwhile, Adobe Photoshop — the most popular program for design professionals — offers Generative Fill to allow users to quickly remove or add items from a photo.

However, the growing sophistication of AI-generated images — which makes them potentially indistinguishable from real photos — has raised significant questions about the technology’s impact on misinformation, authenticity, and creative ethics. This creates a need for accurate AI image detection tools.

At Website Planet, we help people safely navigate their digital presence with data-driven insights and research. As such, we wanted to test and compare popular AI tools to evaluate their accuracy in detecting AI-generated visual content and differentiating between AI-generated content and real digital photographs.

By providing an in-depth analysis of AI image detection tools, we hope to help creatives, tech analysts and researchers, AI enthusiasts, and the general public learn how to safely use AI while mitigating its negative effects on our digital lives.

Research Overview and Methodology

In this research, we explored the strengths and limitations of the most accessible and widely used AI image detection tools. We also looked at the significance of image metadata in improving detection accuracy.

Our experiment tested 7 AI platforms: 3 multifunctional chatbots and 4 dedicated image detection tools:

AI image detection tools tested: ChatGPT, Google Gemini, Microsoft Copilot, 'Is It AI?', AI or Not, Sightengine, AI Image Detector

The experiment was limited to 3 chatbots (ChatGPT, Google Gemini, and Microsoft Copilot) because we wanted to test popular tools with image analysis capabilities. Other chatbots, such as Perplexity, were excluded because they lack this feature.

On the other hand, the 4 dedicated AI image detection platforms were selected off the top of Google’s search results, presuming that these tools are the most widely accessible and commonly used.

Our test used 31 digital photographs and 42 AI-generated images. We identified 6 categories and generated 1 image for each category using detailed prompts across 7 platforms:

6 image categories for AI images and the tools used to generate them (Canva, ChatGPT, DaVinci, Freepik, Imagine.Art, Microsoft Designer, OpenArt)

Check out the appendix for the full breakdown of the images used in this research.

Results: Accuracy of Each AI Detection Tool

We tested the accuracy of different tools in detecting AI images (with and without metadata) and digital photographs. For the batch of images without metadata, we also resized the images to a standard 1024 x 1024 pixels.

ChatGPT

Column charts of ChatGPT detection accuracy per AI image category and generator (with and without metadata)

ChatGPT struggled to identify Society and Lifestyle images as having been AI-generated, getting only 3 out of 7 correct. This is likely due to this category’s typical elements of natural settings and realistic human interactions, which closely mimic real photographs.

In terms of image generators, there was only a marginal accuracy difference. ChatGPT correctly identified 5 out of 6 AI photos from over half of the generators, while the rest had perfect accuracy.

The platform’s overall identification score improved from 90% to 93% with the inclusion of metadata. It correctly recognized one additional AI-generated image under the Society and Lifestyle category after metadata was included.

Category Images
Tested
AI Images Identified
(no metadata)
Score AI Images Identified
(with metadata)
Score
Politics and Government 7 7 100% 7 100%
History and Culture 7 7 100% 7 100%
Society and Lifestyle 7 3 43% 4 57%
Technology and Innovation 7 7 100% 7 100%
Nature and Environment 7 7 100% 7 100%
Fantasy and Mythology 7 7 100% 7 100%
90% 93%
AI Generator Images
Tested
AI Images Identified
(no metadata)
Score AI Images Identified
(with metadata)
Score
Canva 6 6 100% 6 100%
ChatGPT 6 6 100% 6 100%
OpenArt 6 5 83% 5 83%
Microsoft Designer 6 6 100% 6 100%
DaVinci 6 5 83% 5 83%
Freepik 6 5 83% 5 83%
Imagine.Art 6 5 83% 6 100%
90% 93%

Google Gemini

Column charts of Google Gemini detection accuracy per AI image category and generator (with and without metadata)

Google Gemini struggled with detecting AI-generated images in categories that typically include depictions of real people, including Politics and Government, History and Culture, and Society and Lifestyle. It also scored only 43% in Technology and Innovation, while it scored better in abstract categories like Nature and Environment and Fantasy and Mythology.

In February 2024, Google Gemini paused its image generation of people following outcry regarding racially biased and historically inaccurate photos. The company relaunched the feature 6 months later, but the results of this test likely reveal Gemini’s continuing limitations in properly processing and evaluating images of people.

When it came to accuracy scores per image generator, Google Gemini only correctly identified up to half of the AI-generated images. Moreover, the detection percentage only marginally improved with the addition of metadata.

Category Images
Tested
AI Images Identified
(no metadata)
Score AI Images Identified
(with metadata)
Score
Politics and Government 7 0 0% 0 0%
History and Culture 7 0 0% 1 14%
Society and Lifestyle 7 0 0% 0 0%
Technology and Innovation 7 3 43% 4 57%
Nature and Environment 7 6 86% 7 100%
Fantasy and Mythology 7 7 100% 7 100%
38% 45%
AI Generator Images
Tested
AI Images Identified
(no metadata)
Score AI Images Identified
(with metadata)
Score
Canva 6 1 17% 4 67%
ChatGPT 6 3 50% 3 50%
OpenArt 6 3 50% 3 50%
Microsoft Designer 6 3 50% 3 50%
DaVinci 6 2 33% 3 33%
Freepik 6 2 33% 2 33%
Imagine.Art 6 2 33% 2 33%
38% 45%

Microsoft Copilot

Column charts of Microsoft Copilot detection accuracy per AI image category and generator (with and without metadata)

Microsoft Copilot showed similar patterns as ChatGPT, performing well for most categories except Society and Lifestyle. Meanwhile, results were slightly more balanced when evaluated per image generator. It correctly identified 5 out of 6 AI images as having been AI-generated from 4 tools and 3 to 4 images from the other 3.

The inclusion of metadata only improved the detection accuracy for one image. Copilot was still unable to correctly identify Society and Lifestyle photos as having been AI-generated.

Notably, Microsoft Copilot only detected 5 out of 6 photos generated by Microsoft Designer.

Category Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Politics and Government 7 5 71% 5 71%
History and Culture 7 5 71% 6 86%
Society and Lifestyle 7 0 0% 0 0%
Technology and Innovation 7 7 100% 7 100%
Nature and Environment 7 7 100% 7 100%
Fantasy and Mythology 7 7 100% 7 100%
74% 76%
AI Generator Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Canva 6 5 83% 5 83%
ChatGPT 6 5 83% 5 83%
OpenArt 6 5 83% 5 83%
Microsoft Designer 6 5 83% 5 83%
DaVinci 6 4 67% 5 83%
Freepik 6 4 67% 4 67%
Imagine.Art 6 3 50% 3 50%
74% 76%

Is It AI?

Column charts of 'Is It AI?' detection accuracy per AI image category and generator (with and without metadata)

Is It AI?’ (isitai.com) was found to be the best-performing in terms of average accuracy, correctly identifying AI-generated images (with or without metadata) across all categories and from all generators. It also got a 100% score for real photos.

However, we must note that ‘Is It AI?’ only provides up to 15 free credits per month. We elected not to pay for more credits, as it could be seen as violating our ethical standards and prejudicing our results. Results may vary for larger datasets, but the platform’s consistent performance across categories, generators, and test parameters suggests a greater accuracy than other platforms.

Category Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Politics and Government 2 2 100% 2 100%
History and Culture 1* 1 100% 2 100%
Society and Lifestyle 1* 1 100% 2 100%
Technology and Innovation 1* 1 100% 2 100%
Nature and Environment 1 1 100% 3 100%
Fantasy and Mythology 0 3 100%
100% 100%
*1 image tested without metadata / 2 images tested with metadata 

†1 image tested without metadata / 3 images tested with metadata 

‡No image tested without metadata / 3 images tested with metadata 

AI Generator Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Canva 2* 2 100% 6 100%
ChatGPT 1 1 100% 6 100%
Microsoft Designer 1 1 100%
DaVinci 1§ 1 100% 2 100%
Imagine.Art 1 1 100%
100% 100%
*2 images tested without metadata / 6 images tested with metadata 

†1 image tested without metadata / 6 images tested with metadata

‡1 image tested without metadata / No images tested with metadata  

§1 image tested without metadata / 2 images tested with metadata

AI or Not

Column charts of AI or Not detection accuracy per AI image category and generator (with and without metadata)

AI or Not correctly identified AI-generated images in 5 out of 6 categories, only failing in Technology and Innovation. However, similar to ‘Is It AI?’, we only tried a limited number of images due to the platform’s monthly cap of 10 free tests.

When metadata was included, AI or Not performed marginally better than it had without metadata, detecting AI images across all categories.

Category Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Politics and Government 1* 1 100% 2 100%
History and Culture 1* 1 100% 2 100%
Society and Lifestyle 1* 1 100% 2 100%
Technology and Innovation 1 0 0% 1 100%
Nature and Environment 1 1 100% 1 100%
Fantasy and Mythology 1 1 100% 1 100%
83% 100%
*1 image tested without metadata / 2 images tested with metadata 

AI Generator Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Canva 1* 1 100% 6 100%
ChatGPT 1 1 100% 3 100%
Microsoft Designer 1 1 100%
DaVinci 1 1 100%
Freepik 1 0 0%
Imagine.Art 1 1 100%
83% 100%
*1 image tested without metadata / 6 images tested with metadata 

†1 image tested without metadata / 3 images tested with metadata

Sightengine

Column charts of Sightengine detection accuracy per AI image category and generator (with and without metadata)

Sightengine had the second-highest average accuracy of all the detector tools, but it’s arguably the most accurate across larger datasets. It only failed to correctly identify one AI image (in the Society and Lifestyle category from Freepik) and one real image.

The platform also maintained the same level of performance with and without metadata, which suggests that its algorithm and testing parameters are already optimized for the most accurate results.

Category Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Politics and Government 7 7 100% 7 100%
History and Culture 7 7 100% 7 100%
Society and Lifestyle 7 6 86% 6 86%
Technology and Innovation 7 7 100% 7 100%
Nature and Environment 7 7 100% 7 100%
Fantasy and Mythology 7 7 100% 7 100%
98% 98%
AI Generator Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Canva 6 6 100% 6 100%
ChatGPT 6 6 100% 6 100%
OpenArt 6 6 100% 6 100%
Microsoft Designer 6 6 100% 6 100%
DaVinci 6 6 100% 6 100%
Freepik 6 5 83% 5 83%
Imagine.Art 6 6 100% 6 100%
98% 98%

AI Image Detector

Column charts of AI Image Detector detection accuracy per AI image category and generator (with and without metadata)

AI Image Detector recorded high accuracy in detecting AI-generated images. It only missed two images — one each in the Technology and Innovation and Nature and Environment categories and from DaVinci and Freepik. When metadata was added, it matched the performance of Sightengine, accurately detecting 41 out of 42 AI-generated photos.

However, the platform’s accuracy dropped significantly when it came to real images. It was only able to correctly recognize 4 real images, misidentifying 27 as AI-generated. It had the lowest accuracy for real photo recognition and is the only platform that failed to correctly classify more than one image.

This brings to question AI Image Detector’s ability to differentiate AI-generated images from real digital photographs. Depending on the user’s need, this may make the tool less than ideal.

Category Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Politics and Government 7 7 100% 7 100%
History and Culture 7 7 100% 7 100%
Society and Lifestyle 7 7 100% 7 100%
Technology and Innovation 7 6 86% 6 86%
Nature and Environment 7 6 86% 7 100%
Fantasy and Mythology 7 7 100% 7 100%
95% 98%
AI Generator Images

Tested
AI Images Identified

(no metadata)
Score AI Images Identified

(with metadata)
Score
Canva 6 6 100% 6 100%
ChatGPT 6 6 100% 6 100%
OpenArt 6 6 100% 6 100%
Microsoft Designer 6 6 100% 6 100%
DaVinci 6 5 83% 5 83%
Freepik 6 5 83% 6 100%
Imagine.Art 6 6 100% 6 100%
95% 98%

Overall Performance Comparison of the Tools Tested

Sightengine had a standout performance, scoring near-total accuracy across a large dataset and receiving the second-highest average accuracy. It also has the most consistent evaluations, with or without metadata. Sightengine is followed closely by ‘Is It AI?’, which recorded full accuracy across the board.

On the other hand, Google Gemini was the worst performer at detecting AI-generated images. It wasn’t able to correctly detect AI-generated photos in half of the image categories, and the inclusion of image metadata recorded only a marginal improvement (7% increase or 3 additional photos identified). Still, it was able to correctly recognize 30 out of 31 real images.

AI Image Detector’s behavior was the opposite. It scored a high accuracy percentage (95%) for AI-generated images but mistook 27 real images as AI-generated.

The platform is the only tool in our experiment that recorded such a poor score for the real-images test. This performance brings to question its ability to distinguish AI-generated images from real digital photographs, as opposed to mislabeling all images as AI-generated.

Chart of all the tools' detection results for AI images (with and without metadata) and real images (without metadata)

Notably, most of the AI detection tools we tested are more reliable at confirming the authenticity of real images, even without metadata. Two tools correctly identified 30 out of 31 images (97%), and the rest scored perfectly.

The inclusion of metadata in images (and retention of original image sizes) only marginally improved the detection accuracy of 5 out of 6 tools that didn’t get a perfect score in the round of tests with metadata removed. Only Sightengine didn’t see an improvement, staying at 98% across all categories.

On average, the tools had an accuracy of 83% when metadata was removed and 87% when it was retained. Given the already high accuracy score of most tools when identifying real digital photographs, and to protect the privacy of the photos’ owners, we didn’t test them with metadata.

Overall, most AI image detection tools more accurately identify authentic photos, likely due to the differences in texture, lighting, and other visual elements that may be hard for AI image generators to replicate.

Chart of average detection accuracies per image category and generator (with and without metadata)

Society and Lifestyle photos were the most difficult for detection tools to correctly recognize as AI-generated. The average accuracy is only 61% for images with no metadata and 63% for those with.

On the other hand, Fantasy and Mythology images were the most easily identifiable, logging an average score of 100%, with and without metadata.

While most categories saw only slight increases (2% to 4%) in detection accuracy after the inclusion of metadata, AI photos in Technology and Innovation saw a 16% hike in average scores.

When testing without metadata, AI images generated from Freepik were detected correctly only 58% of the time. That said, photos from the platform also saw the biggest increase in accuracy (+15%) when tested with metadata. This was followed by Canva with a 7% increase.

Notably, the average accuracy scores for images from DaVinci (-1%), Microsoft Designer (-3%), and Imagine.Art (-4%) dropped slightly after testing them with metadata.

The Future of AI Image Generation and Detection

Overall, advancements in AI algorithms have allowed more sophisticated tools to recognize AI-generated images with a fair amount of accuracy, but we’re yet to find a tool that can accurately differentiate AI-generated images from real digital photographs with 100% accuracy over large datasets.

Given the relative novelty of generative AI and its continuing development, a perfectly accurate detection tool may not be available in the foreseeable future. The presence of image metadata has a marginal impact on accuracy scores.

That said, there are already tools — such as Sightengine and ‘Is It AI?’ — that offer relatively reliable results. However, even these may not be accurate enough for an organization to base important decisions on.

Moving forward, AI detection programs need to keep pace with the growth and progression of generative AI to more accurately help organizations and individuals distinguish between authentic and AI-generated content.

For clarifications, inquiries, or further analyses about this research, please don’t hesitate to contact us here.

Appendix

The AI-generated images used in this research

We generated 42 AI images using the following prompts:

Image Generator New File Name Category Generator Prompt
Canva image01.jpg Politics and Government Portrait of Abraham Lincoln giving a speech in a futuristic setting, dressed in modern attire.
Canva image02.jpg History and Culture Mona Lisa in modern clothes, sitting in a bustling coffee shop in the 21st century.
Canva image03.jpg Society and Lifestyle A family gathering where younger members wear modern clothes and elders wear traditional attire.
Canva image04.jpg Technology and Innovation A digital avatar navigating through a neon-lit virtual world with floating icons of data and security.
Canva image05.jpg Nature and Environment A floating island with waterfalls spilling over the edges and massive plants under a vibrant sunset.
Canva image06.jpg Fantasy and Mythology A graceful dragon flying over a misty mountain, scales shimmering under the soft morning light.
ChatGPT image07.jpg Politics and Government Portrait of Abraham Lincoln giving a speech in a futuristic setting, dressed in modern attire.
ChatGPT image08.jpg History and Culture Mona Lisa in modern clothes, sitting in a bustling coffee shop in the 21st century.
ChatGPT image09.jpg Society and Lifestyle A family gathering where younger members wear modern clothes and elders wear traditional attire.
ChatGPT image10.jpg Technology and Innovation A digital avatar navigating through a neon-lit virtual world with floating icons of data and security.
ChatGPT image11.jpg Nature and Environment A floating island with waterfalls spilling over the edges and massive plants under a vibrant sunset.
ChatGPT image12.jpg Fantasy and Mythology A graceful dragon flying over a misty mountain, scales shimmering under the soft morning light.
DaVinci image13.jpg Politics and Government Portrait of Abraham Lincoln giving a speech in a futuristic setting, dressed in modern attire.
DaVinci image14.jpg History and Culture Mona Lisa in modern clothes, sitting in a bustling coffee shop in the 21st century.
DaVinci image15.jpg Society and Lifestyle A family gathering where younger members wear modern clothes and elders wear traditional attire.
DaVinci image34.jpg Technology and Innovation A digital avatar navigating through a neon-lit virtual world with floating icons of data and security.
DaVinci image35.jpg Nature and Environment A floating island with waterfalls spilling over the edges and massive plants under a vibrant sunset.
DaVinci image36.jpg Fantasy and Mythology A graceful dragon flying over a misty mountain, scales shimmering under the soft morning light.
Freepik image16.jpg Politics and Government Portrait of Abraham Lincoln giving a speech in a futuristic setting, dressed in modern attire.
Freepik image17.jpg History and Culture Mona Lisa in modern clothes, sitting in a bustling coffee shop in the 21st century.
Freepik image18.jpg Society and Lifestyle A family gathering where younger members wear modern clothes and elders wear traditional attire.
Freepik image19.jpg Technology and Innovation A digital avatar navigating through a neon-lit virtual world with floating icons of data and security.
Freepik image37.jpg Nature and Environment A floating island with waterfalls spilling over the edges and massive plants under a vibrant sunset.
Freepik image38.jpg Fantasy and Mythology A graceful dragon flying over a misty mountain, scales shimmering under the soft morning light.
Imagine.Art image20.jpg Politics and Government Portrait of Abraham Lincoln giving a speech in a futuristic setting, dressed in modern attire.
Imagine.Art image21.jpg History and Culture Mona Lisa in modern clothes, sitting in a bustling coffee shop in the 21st century.
Imagine.Art image22.jpg Society and Lifestyle A family gathering where younger members wear modern clothes and elders wear traditional attire.
Imagine.Art image39.jpg Technology and Innovation A digital avatar navigating through a neon-lit virtual world with floating icons of data and security.
Imagine.Art image40.jpg Nature and Environment A floating island with waterfalls spilling over the edges and massive plants under a vibrant sunset.
Imagine.Art image41.jpg Fantasy and Mythology A graceful dragon flying over a misty mountain, scales shimmering under the soft morning light.
Microsoft Designer image23.jpg History and Culture Mona Lisa in modern clothes, sitting in a bustling coffee shop in the 21st century.
Microsoft Designer image24.jpg Society and Lifestyle A family gathering where younger members wear modern clothes and elders wear traditional attire.
Microsoft Designer image25.jpg Technology and Innovation A digital avatar navigating through a neon-lit virtual world with floating icons of data and security.
Microsoft Designer image26.jpg Nature and Environment A floating island with waterfalls spilling over the edges and massive plants under a vibrant sunset.
Microsoft Designer image27.jpg Fantasy and Mythology A graceful dragon flying over a misty mountain, scales shimmering under the soft morning light.
Microsoft Designer image42.jpg Politics and Government Portrait of Abraham Lincoln giving a speech in a futuristic setting, dressed in modern attire.
OpenArt image28.jpg Politics and Government Portrait of Abraham Lincoln giving a speech in a futuristic setting, dressed in modern attire.
OpenArt image29.jpg History and Culture Mona Lisa in modern clothes, sitting in a bustling coffee shop in the 21st century.
OpenArt image30.jpg Society and Lifestyle A family gathering where younger members wear modern clothes and elders wear traditional attire.
OpenArt image31.jpg Technology and Innovation A digital avatar navigating through a neon-lit virtual world with floating icons of data and security.
OpenArt image32.jpg Nature and Environment A floating island with waterfalls spilling over the edges and massive plants under a vibrant sunset.
OpenArt image33.jpg Fantasy and Mythology A graceful dragon flying over a misty mountain, scales shimmering under the soft morning light.
We used ExifTool for metadata removal and Canva for image resizing.

We also followed a structured approach in having the AI detection tools evaluate whether our collected images were AI-generated or not. Our standardized prompt was:

“In a grid, tell me if these images are AI-generated or non-AI:

Grid: Image Name, AI-Generated or Real, Explain answer (in short).”

The uniformity in our prompts and image formats ensured that all tools were tested using the same set of parameters and made it easier to compare and analyze the results.

Rate this Article
4.3 Voted by 3 users
You already voted! Undo
This field is required Maximal length of comment is equal 80000 chars Minimal length of comment is equal 10 chars
Any comments?
Required Field Maximal length of comment is equal 5000 chars Minimal length of comment is equal 50 chars
0 out of minimum 50 characters
Reply
View %s replies
View %s reply
Related posts
Show more related posts
We check all user comments within 48 hours to make sure they are from real people like you. We're glad you found this article useful - we would appreciate it if you let more people know about it.
Popup final window
Share this blog post with friends and co-workers right now:

We check all comments within 48 hours to make sure they're from real users like you. In the meantime, you can share your comment with others to let more people know what you think.

Once a month you will receive interesting, insightful tips, tricks, and advice to improve your website performance and reach your digital marketing goals!

So happy you liked it!

Share it with your friends!

1 1 1

Or review us on 1

3624573
50
5000
143199796