Question 1

Why is tokenized and vector data sensitive in AI?

Accepted Answer

Before deep learning can process text or images, the raw material is tokenized into smaller standardized chunks saved as binary files. Although these binary files, embeddings, and vector representations look like gibberish to a human, they contain the organization's core data and require strict access restrictions and encryption just like any traditional document.

Question 2

What is homomorphic encryption and why is it not widely used?

Accepted Answer

Homomorphic encryption is an advanced cryptographic technique that allows a machine to perform calculations and learn from data while the data remains securely locked in an encrypted state, like a scientist handling chemicals through gloves in a sealed glass box. It requires an enormous amount of computational power, making it too slow and expensive for everyday use right now, so organizations rely on layered defenses known as defense in depth.

Question 3

What should you back up in an AI system?

Accepted Answer

Because the original source files are likely already backed up elsewhere, the audit focus is on the unique artifacts generated during development: the cleaned, post-processed data, the binary files containing the tokens, the model weights, and the architecture parameters. This archiving is critical because generative systems are nondeterministic, so you must be able to reconstruct the environment to explain how a decision was made.

Question 4

What integrity attacks threaten AI systems?

Accepted Answer

The main attacks are data poisoning, where an attacker contaminates the training material so the machine learns the wrong lessons; model tampering, where a hacker alters the structural blueprints or learned weights; and embedding tampering, where the mathematical representations of concepts are corrupted. Integrity can also be destroyed accidentally by flawed extract, transform, and load logic that drops or corrupts data.

AI Data Security: Encoding, Access, Backup, and Integrity

What this episode covers

Frequently Asked Questions

Why is tokenized and vector data sensitive in AI?

What is homomorphic encryption and why is it not widely used?

What should you back up in an AI system?

What integrity attacks threaten AI systems?

📚 Master the ISACA AAIA Exam!