Question 1

What are the Five Vs of big data in AI?

Accepted Answer

The Five Vs are velocity, volume, value, variety, and veracity. Velocity measures how fast new information is generated and moved, volume is the sheer physical size of the stored information, value is the practical benefit a business can extract, variety is the diversity of formats such as text, audio, photos, and video, and veracity is whether the information is accurate, credible, and free from tampering.

Question 2

Why does consent matter when collecting data for AI training?

Accepted Answer

Using personal details for algorithmic training requires explicit permission based on the exact terms agreed during the initial collection. Frameworks like GDPR make this mandatory, and the EU AI Act requires explicit informed consent before real world testing of high risk tools. Organizations must track who opted in or out and be able to remove a user's data from the training pipeline if consent is revoked.

Question 3

What does fit for purpose mean for an AI project?

Accepted Answer

Fit for purpose means the tool is genuinely capable of achieving the specific business goal it was designed for. The three warning signs that a project is not fit for purpose are accessibility problems where the team cannot easily reach the needed data, quality problems where the data lacks the granularity, depth, volume, or veracity required, and regulatory problems where the intended use case is restricted or prohibited by regional law.

Question 4

What is data lag and how do you fix it?

Accepted Answer

Data lag is the gap that opens because training a model can take weeks or months, so by the time the system is deployed the historical data it memorized is already outdated, causing model drift and a drop in real time accuracy. It is solved either by periodically pausing and retraining the model on fresh datasets, or by using Retrieval Augmented Generation (RAG), which looks up up-to-date facts from an external database before answering.

What this episode covers

Frequently Asked Questions

What are the Five Vs of big data in AI?

What does fit for purpose mean for an AI project?

What is data lag and how do you fix it?

📚 Master the ISACA AAIA Exam!

AI Data Collection: Consent, Fit for Purpose, and Data Lag

What this episode covers

Frequently Asked Questions

What are the Five Vs of big data in AI?

Why does consent matter when collecting data for AI training?

What does fit for purpose mean for an AI project?

What is data lag and how do you fix it?

📚 Master the ISACA AAIA Exam!