Question 1

What are the three attack surfaces of an AI system?

Accepted Answer

An attack surface is any digital location where an unauthorized person might try to break in or cause damage. The three surfaces are development-time threats that occur while the system is being built, runtime security threats that target the conventional servers and infrastructure hosting the model, and threats through use that happen during routine daily operations involving what users type in and what the system answers back.

Question 2

What is the difference between data poisoning and model poisoning?

Accepted Answer

Data poisoning means deliberately injecting malicious or corrupted information into the training dataset to manipulate how the system behaves, which can also affect retrieval augmented generation when an attacker sneaks false information into documents the AI reads. Model poisoning is different because the attacker directly tampers with the mathematical parameters, core architecture, or software libraries of the model itself, which can also happen if you buy a pre-trained system that was tampered with before delivery.

Question 3

How can an AI model be stolen without breaking into the server?

Accepted Answer

Beyond directly downloading the model files, an attacker can send thousands of carefully designed questions to the system and analyze the exact answers it gives to reverse engineer the mathematical logic and build an exact digital clone. It is like a rival chef tasting your signature dish every day until they figure out your secret recipe without ever stepping into your kitchen.

Question 4

What is a prompt injection attack?

Accepted Answer

A prompt injection targets generative systems with specifically crafted text commands that manipulate the system into ignoring its original safety rules, like hypnotizing a loyal guard into opening a vault. An indirect prompt injection happens when the system reads a seemingly normal file containing hidden malicious instructions. Developers defend with structural templates that sanitize inputs and filters that monitor and block inappropriate outputs.

AI Threats: Data Poisoning, Prompt Injection, and Model Theft

What this episode covers

Frequently Asked Questions

What are the three attack surfaces of an AI system?

What is the difference between data poisoning and model poisoning?

How can an AI model be stolen without breaking into the server?

What is a prompt injection attack?

📚 Master the ISACA AAIA Exam!