Security, Compliance, and Governance for AI Solutions Flashcards

Apply knowledge of AI-specific security concerns such as prompt injection and content moderation to ensure compliant and secure systems. (20 cards)

1
Q

A company is utilizing the Amazon Titan foundation model (FM) via Amazon Bedrock. To improve the model’s output, the company needs to incorporate data from its own private data sources.

What solution should the company implement to meet this requirement?

  1. Use a different foundation model.
  2. Choose a lower temperature value.
  3. Create an Amazon Bedrock knowledge base.
  4. Enable model invocation logging.
A

3. Create an Amazon Bedrock knowledge base.

Create an Amazon Bedrock knowledge base is correct because creating a knowledge base in Amazon Bedrock allows the model to integrate private data sources. The knowledge base enables the foundation model to access relevant internal data during inference, which can improve the quality and relevance of the model’s responses.

  • Use a different foundation model is incorrect because the need here is to incorporate private data, not to switch models. Using a different foundation model will not address the requirement to supplement the model with the company’s internal data.
  • Choose a lower temperature value is incorrect because the temperature parameter affects the randomness of the model’s output, not its data sources. Lowering the temperature can make responses more focused, but it does not enable the use of private data in the model’s processing.
  • Enable model invocation logging is incorrect because model invocation logging helps in tracking and auditing model usage, but it does not help incorporate private data into the model’s output. Logging does not address the requirement for using internal data sources during inference.

Reference:
Amazon Bedrock

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which option is a use case for generative AI models?

  1. Improving network security by using intrusion detection systems.
  2. Creating photorealistic images from text descriptions for digital marketing.
  3. Enhancing database performance by using optimized indexing.
  4. Analyzing financial data to forecast stock market trends.
A

2. Creating photorealistic images from text descriptions for digital marketing.

Generative AI models excel at creating new content, such as images, text, or audio. One of the primary applications of these models is generating visual content from text prompts, which is highly valuable in fields like digital marketing for producing creative, unique visuals.

  • Improving network security by using intrusion detection systems is incorrect because intrusion detection systems rely on monitoring patterns and anomalies, which is more suited for traditional machine learning models, not generative AI. Generative AI is typically not used for security applications like detecting network intrusions.
  • Enhancing database performance by using optimized indexing is incorrect because improving database performance involves algorithms and techniques focused on query optimization and indexing, which are not tasks suited for generative AI models. Generative AI is used for creating new data, not for database optimization.
  • Analyzing financial data to forecast stock market trends is incorrect because predicting stock market trends typically involves statistical models or machine learning algorithms that analyze historical data patterns. Generative AI models are not designed for this kind of data forecasting, but for creating new content like images, text, or even audio.

Reference:
Transform your business with generative AI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A company wants to train a large language model (LLM) using only its private data. In addition to performance, the company is focused on minimizing the environmental footprint during training.

Which Amazon EC2 instance type should the company choose to achieve this?

  1. Amazon EC2 M series
  2. Amazon EC2 Inf series
  3. Amazon EC2 P series
  4. Amazon EC2 Trn series
A

4. Amazon EC2 Trn series

The EC2 Trn series (using AWS Trainium processors) is designed to optimize the energy efficiency of large-scale machine learning training tasks. It consumes less power while delivering high-performance training for LLMs, reducing the environmental impact.

  • Amazon EC2 M series is incorrect because the M series is a general-purpose instance type and is not optimized for energy efficiency or machine learning model training, making it less suitable for minimizing environmental impact.
  • Amazon EC2 Inf series is incorrect because the Inf series is optimized for inference tasks rather than training. While it can provide efficient inference, it is not designed for the training phase, especially for LLMs.
  • Amazon EC2 P series is incorrect because the P series is optimized for high-performance GPU workloads, but it is less efficient in terms of energy consumption compared to the Trn series, which is built specifically for efficient ML training.

Reference:
AWS Trainium

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

An e-commerce company is using Amazon Bedrock to power a product recommendation system. The company wants to ensure that the system does not generate recommendations based on customers’ sensitive personal information, such as payment details or personal addresses. Additionally, the company requires notifications when any policy violations occur.

Which solution meets these requirements?

  1. Use Amazon Macie to scan the recommendation system’s output for sensitive data and configure alerts for policy violations.
  2. Set up AWS CloudTrail to monitor the system’s output and notify the company when sensitive data is detected.
  3. Implement Guardrails for Amazon Bedrock to prevent sensitive content from being included in recommendations. Configure Amazon CloudWatch alarms for policy violation notifications.
  4. Enable Amazon SageMaker Model Monitor to track data quality and notify the company if sensitive data is found in the training data.
A

3. Implement Guardrails for Amazon Bedrock to prevent sensitive content from being included in recommendations. Configure Amazon CloudWatch alarms for policy violation notifications.

Guardrails can help filter out sensitive data from being used or included in the system’s responses. Combined with CloudWatch alarms, the company can receive alerts whenever violations occur.

  • Use Amazon Macie to scan the recommendation system’s output for sensitive data and configure alerts for policy violations is incorrect because Amazon Macie is primarily used for scanning data in S3 buckets for sensitive information, not for monitoring model outputs in real-time.
  • Set up AWS CloudTrail to monitor the system’s output and notify the company when sensitive data is detected is incorrect because CloudTrail is used for logging API calls and activities in AWS, not for detecting sensitive data in real-time model outputs.
  • Enable Amazon SageMaker Model Monitor to track data quality and notify the company if sensitive data is found in the training data is incorrect because SageMaker Model Monitor is designed to track model performance and data drift, not for filtering sensitive information in real-time outputs or ensuring compliance.

Reference:
Amazon Bedrock Guardrails

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A financial institution has trained a large language model (LLM) on Amazon Bedrock using a dataset that contains sensitive financial records. The institution needs to ensure the model does not generate responses that reveal or are influenced by the confidential financial data.

What action should the institution take to prevent this?

  1. Apply dynamic data masking to hide sensitive data in inference responses.
  2. Encrypt the sensitive data in inference outputs using Amazon SageMaker.
  3. Use AWS Key Management Service (AWS KMS) to encrypt the sensitive data within the model.
  4. Delete the trained model, remove the sensitive financial data from the dataset, and retrain the model.
A

4. Delete the trained model, remove the sensitive financial data from the dataset, and retrain the model.

Once a model is trained on sensitive data, the only way to ensure it doesn’t generate responses based on that data is to remove the sensitive data from the training set and retrain the model. This guarantees the model won’t use confidential information in its responses.

  • Apply dynamic data masking to hide sensitive data in inference responses is incorrect because masking hides specific fields in outputs but doesn’t prevent the model from generating content influenced by the sensitive data.
  • Encrypt the sensitive data in inference outputs using Amazon SageMaker is incorrect because encryption protects data in transit or at rest but doesn’t stop the model from referencing the sensitive data during inference.
  • Use AWS Key Management Service (AWS KMS) to encrypt the sensitive data within the model is incorrect because encryption ensures data protection but doesn’t prevent the model from generating outputs that may still be influenced by the sensitive data.

Reference:
Data Protection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

An AI researcher is using an Amazon Bedrock base model to generate product descriptions for an e-commerce platform. The researcher needs to store logs of each model invocation, including input and output data, for later review and analysis.

What is the best strategy to meet this requirement?

  1. Configure AWS CloudTrail to log the model’s input and output data.
  2. Enable invocation logging in Amazon Bedrock to track inputs and outputs.
  3. Use AWS Audit Manager to log the input and output data of the model.
  4. Set up logging through Amazon EventBridge for capturing model responses.
A

2. Enable invocation logging in Amazon Bedrock to track inputs and outputs.

Amazon Bedrock provides native invocation logging, which allows users to store input and output data for each invocation. This logging is essential for tracking model performance and ensuring data integrity during operations.

  • Configure AWS CloudTrail to log the model’s input and output data is incorrect because AWS CloudTrail logs API activity and service-level actions, but it doesn’t log the specific inputs and outputs of model invocations.
  • Use AWS Audit Manager to log the input and output data of the model is incorrect because AWS Audit Manager is used for auditing and compliance checks, not for capturing real-time invocation logs of AI models.
  • Set up logging through Amazon EventBridge for capturing model responses is incorrect because Amazon EventBridge routes events between AWS services but is not intended for storing or managing detailed model invocation logs.

Reference:
Monitor model invocation using CloudWatch Logs and Amazon S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A company wants to develop a large language model (LLM) application using Amazon Bedrock with customer data stored in Amazon S3. The company’s security policy mandates that each team can only access data for their own customers.

Which solution will meet these requirements?

  1. Create an Amazon Bedrock custom service role for each team that has access to only the team’s customer data.
  2. Assign one service role to all teams and use Amazon S3 event triggers to restrict data access based on customer information.
  3. Use AWS Secrets Manager to encrypt customer data and allow each team to decrypt only their specific customer information.
  4. Set up a shared Bedrock role and log data access with Amazon CloudWatch to monitor unauthorized access attempts.
A

1. Create an Amazon Bedrock custom service role for each team that has access to only the team’s customer data.

Creating a separate custom service role for each team ensures that access to customer data is restricted in accordance with the company’s security policies. By using custom roles, each team only has permissions to access the specific data associated with their customers in Amazon S3.

This strategy aligns with the principle of least privilege, providing granular control over data access and ensuring that teams do not have unauthorized access to another team’s data. It also simplifies management, as each team can only perform operations within the bounds of their assigned role, preventing accidental or malicious access to restricted information.

  • Assign one service role to all teams and use Amazon S3 event triggers to restrict data access based on customer information is incorrect because event triggers are not designed to control real-time access to specific data. A single role for all teams does not provide the necessary access control, and event triggers are more suited for initiating workflows, not for restricting data access dynamically.
  • Use AWS Secrets Manager to encrypt customer data and allow each team to decrypt only their specific customer information is incorrect because AWS Secrets Manager is used for managing secrets, such as API keys or database credentials, rather than controlling access to large datasets. While encryption ensures data protection, it does not solve the problem of restricting access to customer-specific data in S3.
  • Set up a shared Bedrock role and log data access with Amazon CloudWatch to monitor unauthorized access attempts is incorrect because logging access with CloudWatch provides visibility into unauthorized actions but does not prevent unauthorized access from happening in the first place. A shared role would still allow teams to access data they shouldn’t, violating the security policy.

Reference:
Policies and Permissions in AWS Identity and Access Management

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A financial services company is deploying a chatbot using a fine-tuned Amazon SageMaker JumpStart model to handle customer queries about loans. The company must ensure that the chatbot complies with various financial regulatory frameworks for secure data handling.

Which two capabilities can the company demonstrate to meet these compliance requirements?

  1. Automated scaling of model inference
  2. Intrusion detection and monitoring
  3. Encryption of sensitive data
  4. Optimizing server uptime
  5. Using containerized microservices for scaling
A

2. Intrusion detection and monitoring
3. Encryption of sensitive data

Intrusion detection and monitoring is correct because compliance in the financial sector often requires monitoring for suspicious or unauthorized activity, ensuring the system is protected from potential breaches.

Encryption of sensitive data is correct because encrypting customer and financial data is critical to comply with regulatory standards such as PCI-DSS and GDPR. Proper encryption ensures that sensitive information is protected both in transit and at rest.

  • Automated scaling of model inference is incorrect because auto-scaling improves resource efficiency but does not directly address regulatory compliance.
  • Optimizing server uptime is incorrect because ensuring uptime is important for reliability but does not fulfill compliance requirements related to security and data protection.
  • Using containerized microservices for scaling is incorrect because microservices are an architectural choice for building scalable applications but do not directly relate to compliance with security and data protection regulations.

Reference:
AWS Compliance Programs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A cybersecurity company regularly assesses its internal processes with assistance from independent software vendors (ISVs). The company requires email notifications when compliance reports from the ISVs are available for review.

Which AWS service can the company use to meet this requirement?

  1. AWS Audit Manager
  2. AWS Artifact
  3. AWS Security Hub
  4. Amazon SNS (Simple Notification Service)
A

2. AWS Artifact

AWS Artifact provides access to compliance-related documents, such as security and compliance reports from third-party ISVs. The company can use AWS Artifact to download and monitor these reports and configure notifications when new reports are available.

  • AWS Audit Manager is incorrect because AWS Audit Manager helps automate the process of auditing by collecting evidence, but it does not provide access to compliance reports from ISVs.
  • AWS Security Hub is incorrect because AWS Security Hub is focused on centralizing security findings from AWS services, not on handling compliance reports from ISVs.
  • Amazon SNS (Simple Notification Service) is incorrect because while SNS handles notifications, it is not a service that directly manages or provides compliance reports.

Reference:
AWS Artifact

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A financial institution is using Amazon Bedrock to build an AI application hosted in a VPC. Due to regulatory compliance standards, the VPC must not have any internet access.

Which AWS service or feature will help meet these requirements?

  1. Amazon Macie
  2. Amazon Route 53
  3. NAT gateway
  4. AWS PrivateLink
A

4. AWS PrivateLink

AWS PrivateLink enables the financial institution to securely access Amazon Bedrock services from within a VPC without exposing traffic to the public internet. This ensures compliance with regulations that restrict internet access.

  • Amazon Macie is incorrect because Amazon Macie is used for identifying sensitive data and monitoring security risks, not for managing VPC internet access.
  • Amazon Route 53 is incorrect because Amazon Route 53 is a DNS service, and it doesn’t provide the private connectivity needed to avoid internet exposure.
  • NAT gateway is incorrect because a NAT gateway allows private instances to access the internet, which contradicts the requirement of preventing internet access.

Reference:
AWS PrivateLink

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A cybersecurity firm wants to use AI to enhance the protection of its web application from potential threats. The AI solution must be able to identify whether an IP address originates from a suspicious source.

Which solution will meet these requirements?

  1. Build a chatbot for user support.
  2. Develop an anomaly detection system.
  3. Create a system for translating IP logs into multiple languages.
  4. Implement a text summarization system.
A

2. Develop an anomaly detection system.

Anomaly detection systems are designed to identify unusual patterns or deviations from normal behavior. In this case, it can be used to detect suspicious IP addresses by identifying traffic patterns or access behaviors that do not match normal activity, providing an extra layer of security.

  • Build a chatbot for user support is incorrect because chatbots are designed for customer interaction, not for identifying or analyzing security threats.
  • Create a system for translating IP logs into multiple languages is incorrect because translating logs does not help in detecting suspicious IP addresses or enhancing security.
  • Implement a text summarization system is incorrect because text summarization condenses large pieces of text and is unrelated to identifying security threats from IP addresses.

Reference:
What is Anomaly Detection?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A security company is using Amazon Bedrock to run foundation models (FMs). The company wants to ensure that only authorized users can invoke the models and needs to detect any unauthorized access attempts to refine AWS Identity and Access Management (IAM) policies.

Which AWS service should the company use to identify unauthorized users trying to access Amazon Bedrock?

  1. AWS Shield
  2. AWS CloudTrail
  3. Amazon Macie
  4. Amazon Inspector
A

2. AWS CloudTrail

AWS CloudTrail records all API calls and actions across AWS services, including attempts to invoke Amazon Bedrock models. By reviewing these logs, the company can identify unauthorized access attempts and set appropriate IAM policies for future model use.

  • AWS Shield is incorrect because AWS Shield provides protection against distributed denial of service (DDoS) attacks, not for identifying unauthorized access attempts within AWS services.
  • Amazon Macie is incorrect because Macie is designed to detect sensitive data in Amazon S3, not for monitoring access attempts to AWS services.
  • Amazon Inspector is incorrect because Amazon Inspector is used to assess security vulnerabilities in EC2 instances, not to track or identify unauthorized users attempting to access AWS services.

Reference:
AWS CloudTrail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A legal firm is using a foundation model (FM) from Amazon Bedrock to power its AI legal search tool. The firm wants to fine-tune the model using its own proprietary legal documents to improve the tool’s accuracy.

Which strategy will successfully fine-tune the model?

  1. Organize the dataset into a JSON file and upload it directly to the model for fine-tuning.
  2. Purchase additional storage on Amazon S3 to store the training data.
  3. Integrate AWS CloudWatch for real-time model monitoring and adjustments.
  4. Provide labeled data with the prompt field and the completion field.
A

4. Provide labeled data with the prompt field and the completion field.

Providing labeled data with specific inputs (prompts) and expected outputs (completions) allows the foundation model to learn from the company’s proprietary documents. Fine-tuning requires structured data with clear prompts and corresponding completions to improve the model’s performance in generating accurate responses for legal searches.

  • Organize the dataset into a JSON file and upload it directly to the model for fine-tuning is incorrect because while data format is important, uploading a simple JSON file without labeled prompts and completions does not ensure effective fine-tuning.
  • Purchase additional storage on Amazon S3 to store the training data is incorrect because buying storage does not directly help in fine-tuning the model. The focus should be on preparing the data for model training, not storage.
  • Integrate AWS CloudWatch for real-time model monitoring and adjustments is incorrect because AWS CloudWatch is used for monitoring and logging, but it does not contribute to the actual fine-tuning process of the model.

Reference:
Prompt engineering concepts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A financial services company wants to use large language models (LLMs) securely on Amazon Bedrock for processing sensitive financial data. The company needs to ensure secure access and prevent unauthorized users from interacting with the models.

How can the company securely use LLMs on Amazon Bedrock?

  1. Use prompt design to minimize errors and set up strict access controls with AWS Identity and Access Management (IAM) roles.
  2. Enable Amazon GuardDuty to monitor for suspicious activity on the models.
  3. Use AWS WAF to protect the LLMs from unauthorized traffic and requests.
  4. Configure AWS Direct Connect to provide low-latency access to the models.
A

1. Use prompt design to minimize errors and set up strict access controls with AWS Identity and Access Management (IAM) roles.

By designing prompts carefully to reduce potential errors and configuring IAM roles with least privilege access, the company can ensure that only authorized users can interact with the models. This is critical for securing sensitive financial data and maintaining a controlled environment.

  • Enable Amazon GuardDuty to monitor for suspicious activity on the models is incorrect because GuardDuty monitors for security threats and vulnerabilities but does not directly control access or manage the secure use of LLMs on Amazon Bedrock.
  • Use AWS WAF to protect the LLMs from unauthorized traffic and requests is incorrect because AWS WAF focuses on protecting web applications from threats like SQL injection and does not handle the access control needs for using LLMs on Amazon Bedrock.
  • Configure AWS Direct Connect to provide low-latency access to the models is incorrect because Direct Connect provides dedicated network connections, but it doesn’t address security concerns for controlling access to the LLMs.

Reference:
AWS Identity and Access Management

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

A financial services company is using a foundation model (FM) on Amazon Bedrock to power an AI assistant for customer queries. The FM needs access to encrypted transaction data stored in an Amazon S3 bucket. The data is encrypted using Amazon S3 managed keys (SSE-S3), and the FM encounters a failure when attempting to retrieve the data.

Which solution will resolve the issue?

  1. Ensure the IAM role assumed by Amazon Bedrock has permissions to decrypt the S3 data with the SSE-S3 encryption key.
  2. Disable encryption on the S3 bucket to allow the FM unrestricted access to the data.
  3. Use a Lambda function to decrypt the data before providing it to the FM.
  4. Adjust the model’s prompt to specify that it should access encrypted data from the S3 bucket.
A

1. Ensure the IAM role assumed by Amazon Bedrock has permissions to decrypt the S3 data with the SSE-S3 encryption key.

To access the encrypted transaction data in the S3 bucket, Amazon Bedrock needs the proper IAM role with permissions to decrypt the data using the SSE-S3 encryption key. Without these permissions, the FM cannot access the data, resulting in the failure. Adding decryption permissions resolves this issue securely.

  • Disable encryption on the S3 bucket to allow the FM unrestricted access to the data is incorrect because disabling encryption compromises data security, which is not advisable for sensitive financial information.
  • Use a Lambda function to decrypt the data before providing it to the FM is incorrect because a more direct solution is to give the FM appropriate permissions, rather than introducing additional complexity with Lambda functions.
  • Adjust the model’s prompt to specify that it should access encrypted data from the S3 bucket is incorrect because the issue is related to IAM permissions and encryption, not the content of the prompt.

Reference:
IAM Roles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A company is developing an AI-based document processing system. While the system automates most tasks, the company wants to ensure that sensitive or uncertain AI-generated results are reviewed by humans for accuracy and compliance.

Which feature of Amazon Augmented AI (A2I) helps meet this requirement?

  1. Automatically generates human-in-the-loop workflows for manual review of sensitive data.
  2. Offers real-time model retraining based on human feedback to improve AI accuracy.
  3. Enables human reviewers to validate AI-generated predictions through pre-built workflows.
  4. Automatically labels training data without requiring human intervention.
A

3. Enables human reviewers to validate AI-generated predictions through pre-built workflows.

Amazon Augmented AI (A2I) allows companies to integrate human-in-the-loop workflows, ensuring that human reviewers can validate AI-generated results when accuracy or compliance is critical. This feature helps improve the quality of AI predictions by incorporating human oversight into specific tasks.

  • Automatically generates human-in-the-loop workflows for manual review of sensitive data is incorrect because while A2I enables human-in-the-loop workflows, these workflows need to be configured by the user and are not automatically generated.
  • Offers real-time model retraining based on human feedback to improve AI accuracy is incorrect because A2I focuses on human validation rather than automatically retraining the model in real-time based on feedback.
  • Automatically labels training data without requiring human intervention is incorrect because A2I is specifically designed to involve human intervention where necessary, not for automatically labeling data.

Reference:
Amazon Augmented AI

17
Q

A healthcare company is deploying a chatbot using a fine-tuned Amazon SageMaker JumpStart model to assist patients with medical inquiries. The company must ensure that the chatbot complies with healthcare regulations regarding data privacy and security.

Which two capabilities can the company demonstrate to meet these compliance requirements?

(Select TWO.)

  1. Encryption of sensitive data
  2. Monitoring and logging access to patient information
  3. Automated scaling of model inference
  4. Using multi-language support to comply with regional regulations
  5. Optimizing server response times to reduce latency
A

1. Encryption of sensitive data
2. Monitoring and logging access to patient information

Encryption of sensitive data ensures that patient information is securely protected both in transit and at rest, which is a critical requirement for healthcare regulations like HIPAA.

Monitoring and logging access to patient information allows the company to track who accesses sensitive data, which is essential for compliance with healthcare regulations and maintaining data security.

  • Automated scaling of model inference is incorrect because while automated scaling improves performance, it doesn’t directly address compliance requirements for secure data handling.
  • Using multi-language support to comply with regional regulations is incorrect because multi-language support is not a core compliance requirement for data privacy and security.
  • Optimizing server response times to reduce latency is incorrect because reducing latency improves user experience but doesn’t address compliance with regulations regarding secure data handling.

Reference:
AWS Compliance

18
Q

A retail company needs to ensure its customer data is stored and processed in compliance with various regulatory standards. The company wants to automate the process of accessing security and compliance reports from third-party vendors to review their adherence to these standards.

Which AWS service can the company use to meet this requirement?

  1. AWS Security Hub
  2. AWS Artifact
  3. Amazon Macie
  4. Amazon CloudWatch
A

2. AWS Artifact

AWS Artifact provides access to compliance-related reports and security documentation from third-party vendors. It allows companies to download reports that demonstrate adherence to regulatory standards and review the compliance of their vendors.

  • AWS Security Hub is incorrect because AWS Security Hub aggregates security findings from multiple AWS accounts and services but does not provide access to compliance reports from third-party vendors.
  • Amazon Macie is incorrect because Amazon Macie is used for data security, specifically detecting sensitive data and potential data breaches, not for accessing compliance reports.
  • Amazon CloudWatch is incorrect because CloudWatch is used for monitoring and logging performance metrics of AWS resources, not for accessing security or compliance reports.

Reference:
AWS Artifact

19
Q

A company is developing a machine learning model and needs to ensure that their ML pipelines are repeatable and scalable as the project grows. They want to automate tasks like data preprocessing, model training, and deployment while ensuring consistency across experiments.

Which MLOps practice should the company implement?

  1. Experimentation
  2. Data augmentation
  3. Model monitoring
  4. Automation of repeatable processes
A

4. Automation of repeatable processes

Automation of repeatable processes is essential for ensuring that tasks like data preprocessing, model training, and deployment can be consistently reproduced at scale. Automating these tasks helps ensure scalability, repeatability, and consistency across multiple experiments and model iterations.

  • Experimentation is incorrect because experimentation involves trying different models or techniques, but doesn’t specifically address the need for automation and repeatability.
  • Data augmentation is incorrect because data augmentation refers to increasing the dataset size by creating variations of the data, not automating ML processes.
  • Model monitoring is incorrect because model monitoring focuses on tracking model performance in production, not automating tasks during development and deployment.

Reference:
MLOps Checklist Components

20
Q

A media company is developing an AI system to analyze large amounts of user-generated content, such as videos and images. The company needs to ensure that any sensitive information, such as email addresses or phone numbers, is detected and protected before using the data in its AI pipelines.

Which AWS service should the company use to meet this requirement?

  1. Amazon Macie
  2. Amazon Rekognition
  3. AWS Shield
  4. Amazon Inspector
A

1. Amazon Macie

Amazon Macie automatically detects and helps secure sensitive information, such as email addresses and phone numbers, within datasets. It is ideal for ensuring that sensitive data is identified and protected before being processed in AI systems.

  • Amazon Rekognition is incorrect because Rekognition is used for analyzing and identifying objects, faces, and scenes in images and videos, but it doesn’t focus on detecting sensitive data like PII.
  • AWS Shield is incorrect because AWS Shield provides DDoS protection for web applications but does not help with detecting or securing sensitive data in datasets.
  • Amazon Inspector is incorrect because Inspector is used for security vulnerability assessments and not for detecting or securing sensitive information in content.

Reference:
What is Amazon Macie?