Domain 3: AI Life Cycle Stages 1-4 Flashcards

Question

What are the **3 levels of probability** in the **3x3 matrix**?

Answer 1

1. Improbable 2. Occasional 3. Probable

Answer 2

1. Marginal 2. Moderate 3. Critical

Answer 3

A tool to **evaluate predictive performance of classification models** and where they get confused.

Answer 4

1. True positive 2. True negative 3. False positive 4. False negative

Answer 5

**Unauthorized** person **granted access**.

Answer 6

**Authorized** person **denied access**.

Answer 7

A **framework** to **categorize mitigation actions** early in AI system design and impact assessment.

Answer 8

Systematic measure focused on **day-to-day management and oversight** of AI systems. ## Footnote Examples: assign system responsibility, conduct audits and reviews, establish feedback mechanisms, respond to feedback and appeals, elevate issues, assign kill switch responsibility.

Answer 9

**Standardized evaluation method** to assess and **compare AI system performance** using specific criteria and metrics. ## Footnote Example: Stanford Holistic Evaluation of Language Models (HELM).

Answer 10

1. Plan and design. 2. Data collection and preparation. 3. Build and/or select model.* 4. Test, Evaluate, Verify, Validate. 5. Deploy / implement 6. Ongoing monitoring and maintenance 7. Decommission / retire. ## Footnote * On the exam, IAPP may refer to this as "develop".

Answer 11

1. Define the business problem and objectives. 2. Identify use cases. 3. Determine scope. 4. Evaluate data and data availability. 5. Establish governance structure.

Answer 12

* Establishing metrics to measure system success * Interviewing target users * Conducting market research.

Answer 13

* Impact. * Effort * Fit.

Answer 14

* How will the solution affect the organization? * Is the solution solving a big or small problem?

Answer 15

* What resources will be required to achieve the objective? * What's the timeline?

Answer 16

How well does the proposed solution suit the problem?

Answer 17

* Stakeholder engagement. * Establishing operational controls. * Performing impact assessments * Performing risk assessments.

Answer 18

1. Use case evaluation. 2. Stakeholder mapping. 3. Probability and Severity of Harms Matrix 4. Risk mitigation hierarchy 5. Benchmarking 6. Pre-deployment pilot.

Answer 19

* Probability and severity harms matrix * Risk mitigation hierarchy.

Answer 20

Pre-deployment pilot.

Answer 21

Risk Mitigation Hierarchy

Answer 22

A project management process that maps stakeholder interests with their appropriate function areas

Answer 23

Determining the appropriateness of the AI solution for the organization's specific business problem

Answer 24

The **correctness**, **completeness**, and **currency** of data

Answer 25

1. Cleansing 2. Labeling 3. Privacy

Answer 26

* When a model learns too precisely * "Memorize" quirks in training data

Answer 27

* Poor performance on new data sets. * Limited real-world applicability * Reduced prediction accuracy. ## Footnote These symptoms will present despite fantastic training performance.

Answer 28

When a model **fails to capture** data complexity

Answer 29

1. Too few parameters 2. Excessive regularization 3. Insufficient features.

Answer 30

* Poor predictions * Low accuracy * Weak performance on all data

Answer 31

Known verified facts that serve as reference data.

Answer 32

* Primary indicator of model performance. * Measures correctness of system outputs.

Answer 33

* Precision * Recall * F1 score.

Answer 34

Altering of data's format in order to make it compatible for model training.

Answer 35

* Preparation of data for a machine learning model. * Includes cleaning data.

Answer 36

* Steps taken to adjust a model's output * Done to improve fairness or meet business requirements

Answer 37

* Accuracy and consistency * That the data has not been altered in an unauthorized manner

Answer 38

The **monitoring** of the overall health of the **data ecosystem/pipeline**.

Answer 39

1. Collect 2. Process/use 3. Disclose/share. 4. Store/retain. 5. Destroy.

Answer 40

* Confirm lawful basis for processing. * Monitor data quality and representativeness. * Assess for bias. * Maintain reproducibility logs.

Answer 41

Comprehensive record of everything required to recreate a specific model version.

Answer 42

* Fairness metrics * Drift testing. * Edge case analysis. * Model explainability

Answer 43

* Monitoring real-time data inputs * Human in the loop. * Retraining trigger. * Enforcing access controls and logging

Answer 44

An automated alarm that starts the process of training a new Model version.

Answer 45

* Performance-based * Drift based * Time-based

Answer 46

Changes to the input data.

Answer 47

Changes to the **relationship** between the **input and output** data.

Answer 48

Secure **archival and deletion** of data, training artifacts, and logs in accordance with legal and regulatory requirements.

Answer 49

* Identify **information overlap**. * **Optimization** * **Removal** of unnecessary features. * Feature **regeneration** to accommodate concept drift

Answer 50

Tool that allows turning specific code **on or off** while the system is running ## Footnote AKA: feature toggle

Answer 51

1. Improve model performance. 2. Improve effectiveness and reduce cost. 3. Boost model explainability.

Answer 52

Managing an organization's data assets throughout the life cycle

Answer 53

* Data origin and creation details. * Who created and modified the data.

Answer 54

* How data flows through a system * Data Transformation and Movements * Data dependencies

Answer 55

Legal requirement: the data must be stored and processed **within a jurisdiction's geographical borders**.

Answer 56

Requirement for financial institutions to **verify customer identity**.

Answer 57

* Between structured and unstructured * Has organized properties without a rigid structure

Answer 58

Replacing personal identifiers

Answer 59

Removal of some personal identifiers.

Answer 60

Removal of all personal identifiers so that data subjects cannot be re-identified.

Answer 61

**Modification of sensitive data** so that it has little or no value to unauthorized users ## Footnote E.g., masking all but the last 4 of SSN, account number

Answer 62

* Homomorphic encryption. * Secure multi-party computation * Differential privacy. * Federated Learning

Answer 63

Mathematical process to encode data.

Answer 64

Computation/training on encrypted data.

Answer 65

Computation on combined data without revealing any information about the input data.

Answer 66

* Model trains on edge devices. * Model updates from edge devices sent to a single **global model**.

Answer 67

Private data **remains on the edge device**

Answer 68

* Algorithm **injects noise** into the data set. * Reverse-engineered data is noisy, not original personal data.

Answer 69

* Choose system architecture. * Train, validate, test model * Determine appropriate metrics, thresholds

Answer 70

Monitoring of AI systems.

Answer 71

* Minimize risk. * Regulatory compliance. * Implement Responsible AI.

Answer 72

Frameworks, policies, processes, and controls to measure, evaluate, and promote trustworthy AI

Answer 73

Assessment of an AI system to ensure operational compliance with laws, regulations, standards, and policies.

Answer 74

* Human in the loop. * Human out of the loop. * Human on the loop.

Answer 75

* First line * Second line. * Third line.

Answer 76

Business and functional area that **owns and manages the risk**

Answer 77

Individuals that **identify and mitigate** risk on a daily basis

Answer 78

Internal audit team

Answer 79

* Bias. * Accuracy * Reliability. * Robustness * Privacy. * Interpretability * Safety.

Answer 80

* Edge cases * Unseen data * Malicious Data * Data to assess system biases

Answer 81

A hypothetical reality that contradicts observed facts

Answer 82

* Model cards. * System cards. * Benchmarks. * Data provenance documentation / datasheets.

Answer 83

A **transparency document** that provides a **high-level overview** of the model(s), its training, and data

Answer 84

* A **framework** of technical and non-technical assessments and documentation * Demonstrates compliance with the **EU AI Act**.

Answer 85

Pre-market deployment.

Answer 86

* AI system provider. * Notified body.

Answer 87

An evaluation that determines whether the **same team** can obtain the **same results** multiple times under **identical conditions**.

Answer 88

Assessment of a model using **malicious inputs**. ## Footnote AKA red teaming

Answer 89

Process by which threats are identified, listed, and countermeasures prioritized.

Answer 90

Describes a model that fails when minor tweaks are made to input data

Answer 91

When new data **overwrites or weakens** weights in LLMs

Answer 92

* Business use case. * Timeline * Transparency documentation for regulators and consumers . * User interface copy. * Acceptable use policy * Frequently asked questions.

Answer 93

* Test * Evaluate * Verify * Validate.

Answer 94

* Assess the model's performance across different dimensions. * Ensure model meets business requirements.

Answer 95

Does the model work as intended?

Answer 96

How well does the system perform overall?

Answer 97

Was the system built correctly?

Answer 98

Does the system meet stakeholder requirements?

Answer 99

Sequentially builds simple models where **each improves on the previous one**.

Answer 100

The structure, component, and organization of an AI model.

Answer 101

* Straightforward data processing * Data travels in one direction, from input to output.

Answer 102

Use of multiple layers to filter and extract distinctive features from input data ## Footnote Excels in classification and visual tasks

Answer 103

Process data bi-directionally.

Answer 104

* Process data represented in graph structures. * Understand and analyze how data points are connected in a **social network**.

Answer 105

Use attention to learn relationships between components of the input sentence. ## Footnote E.g. words in a sentence or sentences in a paragraph

Answer 106

* Training multiple models. * Synthesizing models' outputs.

Answer 107

* Training the **same model** on **different subsets** of data. * Aggregating outputs of each model.

Domain 3: AI Life Cycle Stages 1-4 Flashcards

Examine responsible AI practices in system design, development, and testing. (131 cards)