An AI researcher is developing a model to generate synthetic faces for a facial recognition application. During training, they realize that the dataset contains significantly fewer samples of certain ethnic groups, leading to biased model outputs.
Which technique can the researcher use to address this bias?
1. Data augmentation for imbalanced classes
Data augmentation helps balance the dataset by creating new samples from underrepresented groups. By applying transformations such as rotation, cropping, or flipping to the existing images, the researcher can reduce the bias in the dataset and improve the fairness of the model’s outputs.
Reference:
Class Imbalance (CI)
A law firm is deploying a large language model (LLM) to automate the drafting of legal documents. The firm wants to ensure the model is developed responsibly to minimize risks, such as biased outputs.
Which two actions should the firm take?
(Select TWO.)
1. Conduct fairness evaluations on the model’s outputs.
3. Retrain the model with diverse datasets to reduce bias.
Ensuring the model generates unbiased outputs is crucial for responsible AI deployment, especially in sensitive tasks like legal drafting. Using diverse data helps the model generalize better and reduces bias in its predictions.
Reference:
Transform Responsible AI from Theory into Practice
Which functionality is provided by Amazon SageMaker Clarify?
4. Detects possible bias during the data preparation phase.
Amazon SageMaker Clarify helps identify and mitigate bias in machine learning datasets and models, ensuring fairness during both data preparation and model training.
Reference:
Amazon SageMaker Clarify
A company is using a large language model (LLM) on Amazon Bedrock for sentiment analysis. The company wants to classify text passages as positive or negative.
Which prompt engineering strategy should the company use?
1. Provide examples of text passages with their corresponding positive or negative labels, followed by the new passage to classify.
Including labeled examples in the prompt helps guide the LLM by demonstrating how to classify sentiments, improving the model’s accuracy in identifying whether a new passage is positive or negative.
Reference:
Amazon Bedrock
A university student is copying content from a generative AI system to write essays without proper attribution.
Which challenge of responsible generative AI does this scenario represent?
3. Plagiarism
Plagiarism occurs when content generated by AI is copied without proper attribution, leading to ethical and academic concerns. In this scenario, the student is using AI-generated text without crediting the source, which constitutes plagiarism.
Reference:
Transform Responsible AI from Theory into Practice
An AI-driven marketing agency uses machine learning models to predict consumer trends each season. The company’s AI practitioner is preparing a report to explain the models’ behavior and predictions to stakeholders, ensuring transparency and trust in the process.
What should the AI practitioner include in the report to meet these requirements?
3. Partial dependence plots (PDPs) to show how features affect predictions
Partial dependence plots (PDPs) help stakeholders understand how specific features influence the model’s predictions. By showing the relationship between inputs and outputs, PDPs provide insights into the model’s decision-making process, enhancing transparency and explainability.
Reference:
Model Explainability
A company built a deep learning model for object detection and deployed the model to production.
Which AI process occurs when the model analyzes a new image to identify objects?
2. Inference
Inference is the process where a trained model is used to analyze new data and make predictions. In this case, when the deployed object detection model analyzes a new image to identify objects, it is performing inference. The model is applying what it learned during the training phase to make predictions on unseen data.
Reference:
Deploy Models for Inference
An AI practitioner is using a large language model (LLM) to generate content for marketing campaigns. While the content sounds plausible and fact-based, some of the information is actually incorrect.
Which problem is the LLM experiencing?
2. Hallucination
Hallucination refers to the problem where a model generates content that appears factual but is not based on accurate or relevant data. In this case, the LLM is generating plausible-sounding but incorrect marketing content, which is a hallmark of hallucination in language models.
Reference:
Improve LLM performance with Human and AI Feedback on Amazon SageMaker for Amazon Engineering
An educational platform is using a large language model (LLM) to grade student essays automatically. The platform wants to evaluate whether the LLM’s grading process shows any bias toward specific demographics or writing styles.
Which data source should the platform use to evaluate the LLM outputs with the least administrative effort?
2. Standardized benchmark datasets
Standardized benchmark datasets are specifically designed to evaluate models for bias and fairness. These datasets are pre-built and provide a low-effort way to assess how the LLM behaves across different demographics or writing styles, without the need for significant manual data collection or preparation.
Reference:
Transform Responsible AI from Theory into Practice
A healthcare organization is building a generative AI-based solution to recommend treatment plans based on patient data. The organization wants to ensure the AI model operates responsibly and minimizes biases that could negatively impact patient outcomes.
Which actions should the organization take to meet these requirements?
(Select TWO.)
1. Identify potential biases or discrepancies in the patient data.
3. Regularly assess the model’s outputs for fairness and share the results with healthcare regulators.
Identify potential biases or discrepancies in the patient data: By addressing biases or discrepancies in the patient data, the organization ensures that the model doesn’t make biased treatment recommendations that favor or disadvantage certain patient groups.
Regularly assess the model’s outputs for fairness and share the results with healthcare regulators: Ongoing evaluation of the model’s outputs for fairness helps the organization remain compliant with healthcare regulations and ensures the model is used responsibly.
Reference:
Transform Responsible AI from Theory into Practice
A company is using a large language model (LLM) to build a chatbot. The company wants to prevent the chatbot from being tricked into giving harmful answers or exposing sensitive information through clever prompts.
Which action will help reduce these risks?
1. Add safeguards in the prompts to help the LLM detect and block tricky inputs.
Adding safeguards in the prompts teaches the LLM to detect suspicious or harmful inputs and avoid giving unsafe responses or revealing sensitive information. This helps protect the chatbot from being manipulated by malicious prompts.
Reference:
Transform Responsible AI from Theory into Practice
A company uses a machine learning model to analyze footage from a security camera for potential thefts. The company has found that the model disproportionately flags individuals from a particular ethnic group.
Which type of bias is affecting the model output?
2. Sampling bias
Sampling bias occurs when the data used to train the model is not representative of the entire population, leading to biased predictions. In this case, the model might have been trained on data that over-represents or under-represents certain ethnic groups, resulting in disproportionate flagging of individuals from that group.
Reference:
Transform Responsible AI from Theory into Practice
A healthcare company is customizing a foundation model (FM) for diagnostic purposes. The company needs the model to be transparent and explainable in order to comply with healthcare regulations.
Which solution will meet these requirements?
1. Generate metrics, reports, and explanations using Amazon SageMaker Clarify.
Amazon SageMaker Clarify helps in improving transparency and explainability by generating metrics, reports, and explanations about how the model makes its predictions. This is essential for meeting regulatory requirements in healthcare.
Reference:
Amazon SageMaker Clarify
A pharmaceutical company is evaluating its security responsibilities while developing AI-driven drug discovery solutions. The company is using the Generative AI Security Scoping Matrix to assess different approaches.
Which solution scope gives the company the MOST ownership of security responsibilities?
4. Designing and training a new AI model from scratch using proprietary biomedical datasets.
Designing and training a model from scratch gives the company full control and responsibility over the entire lifecycle, including data security, model training, and infrastructure. The company must manage all aspects of compliance and security, giving it the most ownership of the process.
Reference:
Transform Responsible AI from Theory into Practice
An e-commerce company is using a large language model (LLM) to generate product descriptions. The company wants to ensure that the LLM-generated content does not unintentionally promote harmful stereotypes or offensive language. The company needs to review and evaluate the content for such issues with minimal manual intervention.
Which solution should the company use to meet these requirements?
3. Pre-built bias detection tools
Pre-built bias detection tools can automatically scan the LLM-generated content for biases, harmful language, or stereotypes, allowing the company to ensure compliance and maintain quality with minimal manual effort.
Reference:
Pre-training Data Bias
A financial institution is deploying a large language model (LLM) to generate customer financial reports. The institution wants to ensure the model operates responsibly, minimizing risks like generating incorrect or misleading information.
Which two actions should the institution take?
(Select TWO.)
1. Implement human-in-the-loop (HITL) reviews for high-risk outputs.
3. Use explainability tools to make the model’s decision-making process transparent.
Human-in-the-loop (HITL) reviews help ensure that sensitive or high-risk outputs, such as financial reports, are reviewed by human experts to verify accuracy and prevent potential errors.
Explainability tools provide transparency into how the model makes decisions, helping to reduce risks associated with generating misleading information by ensuring stakeholders can understand the model’s reasoning.
Reference:
Transform Responsible AI from Theory into Practice
A company is using a generative AI model to draft business reports. However, the model occasionally generates factual errors or outputs that seem plausible but are incorrect.
Which disadvantage of generative AI is the company facing?
2. Hallucinations
Hallucinations occur when generative AI produces outputs that are factually incorrect or seem plausible but are not based on the input data. This is a common challenge in generative AI models.
Reference:
Overseeing AI Risk in a Rapidly Changing Landscape
A financial company is using a generative AI model to summarize market trends. The model produces slightly different summaries each time, even when given the same input data.
Which disadvantage of generative AI is the company experiencing?
4. Nondeterminism
Nondeterminism refers to the variability in the model’s outputs, meaning the model can generate different responses even when given the same input. This can be problematic in scenarios where consistency is required.
Reference:
What Is Generative AI?
A financial services company is using a large language model (LLM) on Amazon Bedrock to generate personalized financial advice for clients. The company wants to ensure that the model’s outputs are safe, free of harmful content, and compliant with industry regulations.
Which solution will help the company meet these requirements?
1. Implement Guardrails for Amazon Bedrock to filter and monitor the model’s outputs.
Guardrails for Amazon Bedrock help ensure that the model’s outputs are safe and aligned with company policies or regulatory requirements. They enable the company to filter out harmful or non-compliant content, making the use of LLMs in sensitive industries, like financial services, more secure and reliable.
Reference:
Amazon Bedrock Guardrails
A company is using prompt engineering to generate responses from a large language model (LLM). However, they are concerned that attackers might manipulate the prompts to make the model generate inappropriate or harmful content.
Which risk of prompt engineering is the company trying to mitigate?
1. Jailbreaking
Jailbreaking occurs when attackers or users manipulate prompts to bypass restrictions, causing the model to generate inappropriate or harmful content that it is not supposed to produce.
Reference:
Secure RAG Applications Using Prompt Engineering on Amazon Bedrock