Question 1

Can you detect custom Stable Diffusion models?

Accepted Answer

Yes. Our engine detects the underlying latent diffusion process artifacts - VAE decoder grid patterns and frequency-domain signatures - shared by all SD variants, including custom fine-tuned models, LoRAs, and community checkpoints. Detection accuracy is highest for unmodified txt2img outputs and remains strong even with post-processing.

Question 2

What about img2img or inpainted images?

Accepted Answer

FauxLens detects AI involvement across the generation spectrum. Pure txt2img outputs are detected with highest confidence. Img2img outputs show mixed signals - the AI-modified regions carry SD artifacts while the original real photo regions show authentic camera noise. Inpainted images are flagged with confidence levels reflecting the degree of AI modification and the proportion of the image that was AI-generated.

Question 3

Can ControlNet images be detected?

Accepted Answer

Yes. ControlNet is a conditioning mechanism that guides Stable Diffusion generation using reference images (depth maps, edge maps, pose skeletons). The output is still generated through the full SD latent diffusion pipeline and inherits the same VAE decoder artifacts and frequency signatures. ControlNet does not mask or remove the generation fingerprint.

Question 4

Does accuracy differ between AUTOMATIC1111 and ComfyUI?

Accepted Answer

Slightly. Different frontends apply different default settings, samplers, and optional post-processing steps. AUTOMATIC1111 and ComfyUI both use the same base SD models, so the core forensic fingerprint is the same. However, AUTOMATIC1111 applies a face restoration pass by default for portrait images (GFPGAN/CodeFormer) which can introduce additional artifacts. ComfyUI typically outputs more raw images. FauxLens is trained on outputs from both frontends.

Question 5

Can anime-style Stable Diffusion fine-tunes be detected?

Accepted Answer

Yes. Anime fine-tunes (Anything V5, Counterfeit, AbyssOrangeMix) use the SD latent diffusion backbone and VAE decoder, so they carry the same core forensic fingerprint. Their visual style is highly distinctive, but the mathematical detection relies on pixel-level statistics rather than visual appearance. Detection accuracy for anime fine-tunes is comparable to the base SD 1.5 detection rate.

Question 6

What is the accuracy for detecting SD used in face swaps?

Accepted Answer

SD-based face swap techniques - particularly those using inpainting to replace faces in real photos - are detected with moderate-to-high confidence. The face region shows SD generation artifacts while the surrounding body and background show real camera noise. FauxLens flags the image as AI-involved and the per-layer analysis shows which forensic signals fired in which regions.

Question 7

Can locally run Stable Diffusion images be detected?

Accepted Answer

Yes. There is no forensic difference between images generated by the SD model running locally versus via an API. The mathematical artifacts are embedded by the model itself, not by the serving infrastructure. Running SD locally without network connectivity does not affect the forensic signature in any way.

Question 8

Does SD3 leave different fingerprints than SD 1.5?

Accepted Answer

Yes. SD3 uses a Multimodal Diffusion Transformer (MMDiT) architecture rather than the U-Net backbone used in SD 1.x and SDXL. This architectural difference produces different artifact patterns - particularly in how attention heads generate fine-scale texture. FauxLens maintains separate detection models for each SD architecture rather than a single cross-version classifier.

Question 9

Can FauxLens detect NSFW Stable Diffusion content?

Accepted Answer

FauxLens detects AI-generated images regardless of their content. The forensic analysis examines pixel-level mathematical patterns, not the semantic content of the image. Detection accuracy is the same for NSFW and SFW SD outputs - the generation artifacts are content-independent.

Question 10

Can I submit multiple Stable Diffusion images for batch processing?

Accepted Answer

The web tool processes one image per submission. For batch processing of multiple images, the FauxLens API supports multiple images per request with optimized throughput. Contact support@fauxlens.com for API access details.

Privacy & Transparency

Detect Stable Diffusion Images

How to Detect Stable Diffusion Images

The Stable Diffusion Ecosystem: Base Models and Fine-Tunes

SDXL, SD3, and Stability AI's Newer Models

Why Stable Diffusion Is the Most Commonly Misused Generator

Getting the Best Detection Results for Stable Diffusion Images

Frequently Asked Questions

Learn More

How to Detect Stable Diffusion Images: A Forensic Deep Dive

The Science of Deception: How AI Detection Works

Every AI Image Has a Hidden Fingerprint - Here's How Forensics Finds It

Can You Tell the Difference? Midjourney vs DALL-E vs Real Photos [Forensic Test]

More Tools