Guidelines for Responsible AI Flashcards

Recognize how to design and evaluate AI systems that are fair, explainable, and aligned with ethical and legal standards. (20 cards)

1
Q

An AI researcher is developing a model to generate synthetic faces for a facial recognition application. During training, they realize that the dataset contains significantly fewer samples of certain ethnic groups, leading to biased model outputs.

Which technique can the researcher use to address this bias?

  1. Data augmentation for imbalanced classes
  2. Model monitoring for accuracy drift
  3. Retrieval Augmented Generation (RAG)
  4. Edge detection for image processing
A

1. Data augmentation for imbalanced classes

Data augmentation helps balance the dataset by creating new samples from underrepresented groups. By applying transformations such as rotation, cropping, or flipping to the existing images, the researcher can reduce the bias in the dataset and improve the fairness of the model’s outputs.

  • Model monitoring for accuracy drift is incorrect because monitoring for accuracy drift helps track model performance over time but does not correct bias in the training data. It identifies performance issues but doesn’t resolve them.
  • Retrieval Augmented Generation (RAG) is incorrect because RAG is a method for improving generative models by retrieving relevant information from external sources, which doesn’t help balance the data or correct the underlying bias in this scenario.
  • Edge detection for image processing is incorrect because edge detection is a technique used to identify boundaries in images, which is unrelated to addressing dataset bias. It focuses on feature extraction, not on balancing data or mitigating bias.

Reference:
Class Imbalance (CI)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A law firm is deploying a large language model (LLM) to automate the drafting of legal documents. The firm wants to ensure the model is developed responsibly to minimize risks, such as biased outputs.

Which two actions should the firm take?

(Select TWO.)

  1. Conduct fairness evaluations on the model’s outputs.
  2. Adjust the model’s temperature to increase output variety.
  3. Retrain the model with diverse datasets to reduce bias.
  4. Limit training epochs to prevent overfitting.
  5. Apply regularization techniques to tune hyperparameters.
A

1. Conduct fairness evaluations on the model’s outputs.
3. Retrain the model with diverse datasets to reduce bias.

Ensuring the model generates unbiased outputs is crucial for responsible AI deployment, especially in sensitive tasks like legal drafting. Using diverse data helps the model generalize better and reduces bias in its predictions.

  • Adjust the model’s temperature to increase output variety: Adjusting the temperature controls output randomness but does not address bias or fairness.
  • Limit training epochs to prevent overfitting: While useful for preventing overfitting, this doesn’t address bias or fairness concerns in model outputs.
  • Apply regularization techniques to tune hyperparameters: Regularization can improve model performance but does not directly address fairness or bias.

Reference:
Transform Responsible AI from Theory into Practice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which functionality is provided by Amazon SageMaker Clarify?

  1. Implements a Retrieval Augmented Generation (RAG) pipeline.
  2. Tracks the performance of machine learning models in production.
  3. Captures key metadata about machine learning models.
  4. Detects possible bias during the data preparation phase.
A

4. Detects possible bias during the data preparation phase.

Amazon SageMaker Clarify helps identify and mitigate bias in machine learning datasets and models, ensuring fairness during both data preparation and model training.

  • Implements a Retrieval Augmented Generation (RAG) pipeline is incorrect because RAG is a method used to improve generative models by retrieving relevant information, and SageMaker Clarify does not handle RAG workflows.
  • Tracks the performance of machine learning models in production is incorrect because tracking model performance in production is handled by tools like SageMaker Model Monitor, not SageMaker Clarify.
  • Captures key metadata about machine learning models is incorrect because documenting metadata about models is a feature of SageMaker Model Cards, not SageMaker Clarify.

Reference:
Amazon SageMaker Clarify

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A company is using a large language model (LLM) on Amazon Bedrock for sentiment analysis. The company wants to classify text passages as positive or negative.

Which prompt engineering strategy should the company use?

  1. Provide examples of text passages with their corresponding positive or negative labels, followed by the new passage to classify.
  2. Include a thorough explanation of sentiment analysis techniques and how LLMs work in the prompt.
  3. Input the new text passage without any examples or context and ask the model to classify it.
  4. Include the new text passage along with examples of other tasks, like text summarization or translation, in the prompt.
A

1. Provide examples of text passages with their corresponding positive or negative labels, followed by the new passage to classify.

Including labeled examples in the prompt helps guide the LLM by demonstrating how to classify sentiments, improving the model’s accuracy in identifying whether a new passage is positive or negative.

  • Include a thorough explanation of sentiment analysis techniques and how LLMs work in the prompt is incorrect because providing an explanation of the underlying techniques does not help the LLM in performing the specific task of sentiment classification. The model responds better to concrete examples rather than theoretical explanations.
  • Input the new text passage without any examples or context and ask the model to classify it is incorrect because not providing examples or context decreases the model’s ability to accurately perform sentiment classification, especially if it lacks prior instructions on how to approach the task.
  • Include the new text passage along with examples of other tasks, like text summarization or translation, in the prompt is incorrect because mixing different tasks like summarization or translation dilutes the model’s focus on sentiment analysis and can lead to incorrect results.

Reference:
Amazon Bedrock

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A university student is copying content from a generative AI system to write essays without proper attribution.

Which challenge of responsible generative AI does this scenario represent?

  1. Misinformation
  2. Hallucinations
  3. Plagiarism
  4. Bias
A

3. Plagiarism

Plagiarism occurs when content generated by AI is copied without proper attribution, leading to ethical and academic concerns. In this scenario, the student is using AI-generated text without crediting the source, which constitutes plagiarism.

  • Misinformation is incorrect because misinformation refers to false or inaccurate information being generated, not copying content without proper credit.
  • Hallucinations is incorrect because hallucinations refer to AI generating false or misleading information that wasn’t part of the original input.
  • Bias is incorrect because bias refers to unfair or unbalanced outputs from AI systems, not the unauthorized copying of content.

Reference:
Transform Responsible AI from Theory into Practice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

An AI-driven marketing agency uses machine learning models to predict consumer trends each season. The company’s AI practitioner is preparing a report to explain the models’ behavior and predictions to stakeholders, ensuring transparency and trust in the process.

What should the AI practitioner include in the report to meet these requirements?

  1. The raw data that was used to test the model
  2. A summary of the model’s memory usage during training
  3. Partial dependence plots (PDPs) to show how features affect predictions
  4. The dataset schema
A

3. Partial dependence plots (PDPs) to show how features affect predictions

Partial dependence plots (PDPs) help stakeholders understand how specific features influence the model’s predictions. By showing the relationship between inputs and outputs, PDPs provide insights into the model’s decision-making process, enhancing transparency and explainability.

  • The raw data that was used to test the model is incorrect because providing raw data doesn’t explain how the model makes predictions. Raw data alone does not enhance the interpretability of the model.
  • A summary of the model’s memory usage during training is incorrect because memory usage is related to model efficiency, not its transparency or explainability.
  • The dataset schema is incorrect because while the schema shows the structure of the dataset, it does not help explain how the model makes decisions.

Reference:
Model Explainability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A company built a deep learning model for object detection and deployed the model to production.

Which AI process occurs when the model analyzes a new image to identify objects?

  1. Training
  2. Inference
  3. Model deployment
  4. Bias correction
A

2. Inference

Inference is the process where a trained model is used to analyze new data and make predictions. In this case, when the deployed object detection model analyzes a new image to identify objects, it is performing inference. The model is applying what it learned during the training phase to make predictions on unseen data.

  • Training is incorrect because training refers to the process of teaching the model using a labeled dataset. The model has already been trained and is now being used to make predictions.
  • Model deployment is incorrect because model deployment refers to the process of moving the trained model into a production environment. The question focuses on what happens after the model has been deployed.
  • Bias correction is incorrect because bias correction involves adjusting the model to prevent biased outputs. The question does not mention any bias-related issues, and it focuses on object detection in a production environment.

Reference:
Deploy Models for Inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

An AI practitioner is using a large language model (LLM) to generate content for marketing campaigns. While the content sounds plausible and fact-based, some of the information is actually incorrect.

Which problem is the LLM experiencing?

  1. Data leakage
  2. Hallucination
  3. Feature selection
  4. Gradient explosion
A

2. Hallucination

Hallucination refers to the problem where a model generates content that appears factual but is not based on accurate or relevant data. In this case, the LLM is generating plausible-sounding but incorrect marketing content, which is a hallmark of hallucination in language models.

  • Data leakage is incorrect because data leakage happens when the model has access to information during training that it wouldn’t have during real-world use, which leads to overly optimistic results. It does not explain why the generated content is incorrect.
  • Feature selection is incorrect because feature selection refers to choosing the most relevant input variables in traditional machine learning models. It does not apply to the generation of incorrect text by LLMs.
  • Gradient explosion is incorrect because gradient explosion refers to issues in the training process of deep learning models, where gradients grow uncontrollably, but it is unrelated to generating incorrect but plausible text.

Reference:
Improve LLM performance with Human and AI Feedback on Amazon SageMaker for Amazon Engineering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

An educational platform is using a large language model (LLM) to grade student essays automatically. The platform wants to evaluate whether the LLM’s grading process shows any bias toward specific demographics or writing styles.

Which data source should the platform use to evaluate the LLM outputs with the least administrative effort?

  1. Historical grading records
  2. Standardized benchmark datasets
  3. Grading rubrics
  4. Peer-reviewed journal articles
A

2. Standardized benchmark datasets

Standardized benchmark datasets are specifically designed to evaluate models for bias and fairness. These datasets are pre-built and provide a low-effort way to assess how the LLM behaves across different demographics or writing styles, without the need for significant manual data collection or preparation.

  • Historical grading records is incorrect because using historical grading records requires significant manual work to analyze for bias, making it less efficient than using benchmark datasets.
  • Grading rubrics is incorrect because rubrics provide grading criteria but do not serve as a data source for evaluating the model’s fairness or bias.
  • Peer-reviewed journal articles is incorrect because journal articles can provide theoretical insights but are not practical data sources for evaluating model outputs for bias.

Reference:
Transform Responsible AI from Theory into Practice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A healthcare organization is building a generative AI-based solution to recommend treatment plans based on patient data. The organization wants to ensure the AI model operates responsibly and minimizes biases that could negatively impact patient outcomes.

Which actions should the organization take to meet these requirements?

(Select TWO.)

  1. Identify potential biases or discrepancies in the patient data.
  2. Run the model continuously to improve prediction consistency.
  3. Regularly assess the model’s outputs for fairness and share the results with healthcare regulators.
  4. Use F1-score evaluations to ensure the model provides the highest accuracy in all cases.
  5. Ensure that the model processes data at a sufficient speed for real-time recommendations.
A

1. Identify potential biases or discrepancies in the patient data.
3. Regularly assess the model’s outputs for fairness and share the results with healthcare regulators.

Identify potential biases or discrepancies in the patient data: By addressing biases or discrepancies in the patient data, the organization ensures that the model doesn’t make biased treatment recommendations that favor or disadvantage certain patient groups.

Regularly assess the model’s outputs for fairness and share the results with healthcare regulators: Ongoing evaluation of the model’s outputs for fairness helps the organization remain compliant with healthcare regulations and ensures the model is used responsibly.

  • Run the model continuously to improve prediction consistency is incorrect because continuously running the model does not directly address the issue of bias or fairness in its outputs.
  • Use F1-score evaluations to ensure the model provides the highest accuracy in all cases is incorrect because the F1 score measures model performance, but it does not address fairness, transparency, or bias in healthcare recommendations.
  • Ensure that the model processes data at a sufficient speed for real-time recommendations is incorrect because processing speed does not impact responsible AI usage or the reduction of bias in patient treatment recommendations.

Reference:
Transform Responsible AI from Theory into Practice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A company is using a large language model (LLM) to build a chatbot. The company wants to prevent the chatbot from being tricked into giving harmful answers or exposing sensitive information through clever prompts.

Which action will help reduce these risks?

  1. Add safeguards in the prompts to help the LLM detect and block tricky inputs.
  2. Increase the number of tokens the model can process to handle more context.
  3. Use a model that updates frequently for better accuracy.
  4. Lower the temperature setting to make the model’s responses shorter.
A

1. Add safeguards in the prompts to help the LLM detect and block tricky inputs.

Adding safeguards in the prompts teaches the LLM to detect suspicious or harmful inputs and avoid giving unsafe responses or revealing sensitive information. This helps protect the chatbot from being manipulated by malicious prompts.

  • Increase the number of tokens the model can process to handle more context is incorrect because increasing the token limit affects how much text the model can handle but doesn’t prevent manipulation.
  • Use a model that updates frequently for better accuracy is incorrect because frequent updates improve accuracy but don’t directly prevent the chatbot from being tricked by malicious prompts.
  • Lower the temperature setting to make the model’s responses shorter is incorrect because lowering the temperature makes responses more predictable, but it doesn’t protect against prompt manipulation.

Reference:
Transform Responsible AI from Theory into Practice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A company uses a machine learning model to analyze footage from a security camera for potential thefts. The company has found that the model disproportionately flags individuals from a particular ethnic group.

Which type of bias is affecting the model output?

  1. Measurement bias
  2. Sampling bias
  3. Cognitive bias
  4. Algorithmic bias
A

2. Sampling bias

Sampling bias occurs when the data used to train the model is not representative of the entire population, leading to biased predictions. In this case, the model might have been trained on data that over-represents or under-represents certain ethnic groups, resulting in disproportionate flagging of individuals from that group.

  • Measurement bias is incorrect because measurement bias occurs when data is inaccurately measured or recorded, not when specific groups are disproportionately flagged.
  • Cognitive bias is incorrect because cognitive bias refers to the biases in human decision-making, not the biases that occur in training data for machine learning models.
  • Algorithmic bias is incorrect because while algorithmic bias can happen, the issue here is rooted in the data sampling process rather than the algorithm itself.

Reference:
Transform Responsible AI from Theory into Practice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A healthcare company is customizing a foundation model (FM) for diagnostic purposes. The company needs the model to be transparent and explainable in order to comply with healthcare regulations.

Which solution will meet these requirements?

  1. Generate metrics, reports, and explanations using Amazon SageMaker Clarify.
  2. Configure security and compliance settings using AWS Shield.
  3. Encrypt and secure the data using AWS Key Management Service (KMS).
  4. Use Amazon Transcribe to convert medical reports into text for further analysis.
A

1. Generate metrics, reports, and explanations using Amazon SageMaker Clarify.

Amazon SageMaker Clarify helps in improving transparency and explainability by generating metrics, reports, and explanations about how the model makes its predictions. This is essential for meeting regulatory requirements in healthcare.

  • Configure security and compliance settings using AWS Shield is incorrect because AWS Shield is focused on protecting against DDoS attacks and is unrelated to model transparency and explainability.
  • Encrypt and secure the data using AWS Key Management Service (KMS) is incorrect because while securing data is important, encryption does not address transparency or explainability requirements for the model’s predictions.
  • Use Amazon Transcribe to convert medical reports into text for further analysis is incorrect because Amazon Transcribe is for speech-to-text conversion, which is unrelated to ensuring model transparency or explainability.

Reference:
Amazon SageMaker Clarify

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A pharmaceutical company is evaluating its security responsibilities while developing AI-driven drug discovery solutions. The company is using the Generative AI Security Scoping Matrix to assess different approaches.

Which solution scope gives the company the MOST ownership of security responsibilities?

  1. Using a pre-built AI model from a third-party platform to analyze clinical trial data.
  2. Developing a healthcare chatbot using an existing generative AI service hosted by a cloud provider.
  3. Fine-tuning a third-party generative AI model with patient records and research data.
  4. Designing and training a new AI model from scratch using proprietary biomedical datasets.
A

4. Designing and training a new AI model from scratch using proprietary biomedical datasets.

Designing and training a model from scratch gives the company full control and responsibility over the entire lifecycle, including data security, model training, and infrastructure. The company must manage all aspects of compliance and security, giving it the most ownership of the process.

  • Using a pre-built AI model from a third-party platform to analyze clinical trial data is incorrect because the security and maintenance of the model are primarily managed by the third-party platform, leaving the company with minimal control.
  • Developing a healthcare chatbot using an existing generative AI service hosted by a cloud provider is incorrect because the cloud provider handles much of the foundational security for the AI model, leaving the company with less responsibility.
  • Fine-tuning a third-party generative AI model with patient records and research data is incorrect because, while the company is responsible for the fine-tuning process, the core model’s security and maintenance remain the responsibility of the third-party provider.

Reference:
Transform Responsible AI from Theory into Practice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

An e-commerce company is using a large language model (LLM) to generate product descriptions. The company wants to ensure that the LLM-generated content does not unintentionally promote harmful stereotypes or offensive language. The company needs to review and evaluate the content for such issues with minimal manual intervention.

Which solution should the company use to meet these requirements?

  1. User feedback reports
  2. Internal moderation guidelines
  3. Pre-built bias detection tools
  4. Historical product reviews
A

3. Pre-built bias detection tools

Pre-built bias detection tools can automatically scan the LLM-generated content for biases, harmful language, or stereotypes, allowing the company to ensure compliance and maintain quality with minimal manual effort.

  • User feedback reports is incorrect because relying on user feedback introduces delays and requires more manual review after the content has been generated and published.
  • Internal moderation guidelines is incorrect because guidelines alone do not automatically review or detect bias. They require manual enforcement, which increases administrative overhead.
  • Historical product reviews is incorrect because product reviews are user-generated content and do not help directly in evaluating the bias of newly generated product descriptions.

Reference:
Pre-training Data Bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A financial institution is deploying a large language model (LLM) to generate customer financial reports. The institution wants to ensure the model operates responsibly, minimizing risks like generating incorrect or misleading information.

Which two actions should the institution take?

(Select TWO.)

  1. Implement human-in-the-loop (HITL) reviews for high-risk outputs.
  2. Adjust the learning rate to optimize the model’s performance.
  3. Use explainability tools to make the model’s decision-making process transparent.
  4. Limit the token length to reduce the complexity of the generated reports.
  5. Increase the model’s batch size to handle large datasets efficiently.
A

1. Implement human-in-the-loop (HITL) reviews for high-risk outputs.
3. Use explainability tools to make the model’s decision-making process transparent.

Human-in-the-loop (HITL) reviews help ensure that sensitive or high-risk outputs, such as financial reports, are reviewed by human experts to verify accuracy and prevent potential errors.

Explainability tools provide transparency into how the model makes decisions, helping to reduce risks associated with generating misleading information by ensuring stakeholders can understand the model’s reasoning.

  • Adjust the learning rate to optimize the model’s performance is incorrect because adjusting the learning rate is related to model optimization, not directly addressing the risks of generating incorrect or biased outputs.
  • Limit the token length to reduce the complexity of the generated reports is incorrect because limiting token length doesn’t necessarily improve the model’s accuracy or responsibility in generating financial reports.
  • Increase the model’s batch size to handle large datasets efficiently is incorrect because increasing batch size focuses on processing efficiency and does not address minimizing risks like generating incorrect or misleading information.

Reference:
Transform Responsible AI from Theory into Practice

17
Q

A company is using a generative AI model to draft business reports. However, the model occasionally generates factual errors or outputs that seem plausible but are incorrect.

Which disadvantage of generative AI is the company facing?

  1. Nondeterminism
  2. Hallucinations
  3. Interpretability
  4. Inaccuracy
A

2. Hallucinations

Hallucinations occur when generative AI produces outputs that are factually incorrect or seem plausible but are not based on the input data. This is a common challenge in generative AI models.

  • Nondeterminism is incorrect because nondeterminism refers to variability in the model’s outputs when given the same input, not producing incorrect information.
  • Interpretability is incorrect because interpretability refers to how easily the model’s decisions can be understood, not the generation of incorrect outputs.
  • Inaccuracy is incorrect because while hallucinations lead to inaccuracy, the specific issue here is that the model generates information that was never part of the input data.

Reference:
Overseeing AI Risk in a Rapidly Changing Landscape

18
Q

A financial company is using a generative AI model to summarize market trends. The model produces slightly different summaries each time, even when given the same input data.

Which disadvantage of generative AI is the company experiencing?

  1. Hallucinations
  2. Interpretability
  3. Inaccuracy
  4. Nondeterminism
A

4. Nondeterminism

Nondeterminism refers to the variability in the model’s outputs, meaning the model can generate different responses even when given the same input. This can be problematic in scenarios where consistency is required.

  • Hallucinations is incorrect because hallucinations refer to generating factually incorrect outputs, which is not the issue described here.
  • Interpretability is incorrect because interpretability refers to how easily the model’s reasoning can be understood, not the variability of its output.
  • Inaccuracy is incorrect because inaccuracy refers to incorrect predictions, but the issue here is that the model is inconsistent rather than outright incorrect.

Reference:
What Is Generative AI?

19
Q

A financial services company is using a large language model (LLM) on Amazon Bedrock to generate personalized financial advice for clients. The company wants to ensure that the model’s outputs are safe, free of harmful content, and compliant with industry regulations.

Which solution will help the company meet these requirements?

  1. Implement Guardrails for Amazon Bedrock to filter and monitor the model’s outputs.
  2. Use Amazon SageMaker to train the model on additional financial datasets.
  3. Enable multi-language support to improve response diversity.
  4. Use Amazon Comprehend to analyze the sentiment of the model’s outputs.
A

1. Implement Guardrails for Amazon Bedrock to filter and monitor the model’s outputs.

Guardrails for Amazon Bedrock help ensure that the model’s outputs are safe and aligned with company policies or regulatory requirements. They enable the company to filter out harmful or non-compliant content, making the use of LLMs in sensitive industries, like financial services, more secure and reliable.

  • Use Amazon SageMaker to train the model on additional financial datasets is incorrect because training the model on new data does not ensure that the model will avoid generating harmful or non-compliant content in real time.
  • Enable multi-language support to improve response diversity is incorrect because multi-language support improves language coverage but does not address concerns around safety or compliance of model outputs.
  • Use Amazon Comprehend to analyze the sentiment of the model’s outputs is incorrect because sentiment analysis focuses on determining the tone of the text and does not provide filtering or compliance checks.

Reference:
Amazon Bedrock Guardrails

20
Q

A company is using prompt engineering to generate responses from a large language model (LLM). However, they are concerned that attackers might manipulate the prompts to make the model generate inappropriate or harmful content.

Which risk of prompt engineering is the company trying to mitigate?

  1. Jailbreaking
  2. Poisoning
  3. Model hijacking
  4. Data exposure
A

1. Jailbreaking

Jailbreaking occurs when attackers or users manipulate prompts to bypass restrictions, causing the model to generate inappropriate or harmful content that it is not supposed to produce.

  • Poisoning is incorrect because poisoning refers to maliciously altering the training data to negatively affect model behavior, not prompt manipulation.
  • Model hijacking is incorrect because model hijacking refers to an attacker taking control of the model’s behavior, not manipulating prompts to bypass restrictions.
  • Data exposure is incorrect because data exposure refers to the unintended leakage of sensitive information, not the manipulation of model prompts.

Reference:
Secure RAG Applications Using Prompt Engineering on Amazon Bedrock