Amazon (MLS-C01) Exam Questions And Answers page 2
A company is converting a large number of unstructured paper receipts into images. The company wants to create a model based on natural language processing (NLP) to find relevant entities such as date, location, and notes, as well as some custom entities such as receipt numbers.
The company is using optical character recognition (OCR) to extract text for data labeling. However, documents are in different structures and formats, and the company is facing challenges with setting up the manual workflows for each document type. Additionally, the company trained a named entity recognition (NER) model for custom entity detection using a small sample size. This model has a very low confidence score and will require retraining with a large dataset.
Which solution for text extraction and entity detection will require the LEAST amount of effort?
The company is using optical character recognition (OCR) to extract text for data labeling. However, documents are in different structures and formats, and the company is facing challenges with setting up the manual workflows for each document type. Additionally, the company trained a named entity recognition (NER) model for custom entity detection using a small sample size. This model has a very low confidence score and will require retraining with a large dataset.
Which solution for text extraction and entity detection will require the LEAST amount of effort?
Extract text from receipt images by using a deep learning OCR model from the AWS Marketplace. Use the NER deep learning model to extract entities.
Extract text from receipt images by using Amazon Textract. Use Amazon Comprehend for entity detection, and use Amazon Comprehend custom entity recognition for custom entity detection.
Extract text from receipt images by using a deep learning OCR model from the AWS Marketplace. Use Amazon Comprehend for entity detection, and use Amazon Comprehend custom entity recognition for custom entity detection.
Exploratory Data Analysis
Model Development
A company is launching a new product and needs to build a mechanism to monitor comments about the company and its new product on social media. The company needs to be able to evaluate the sentiment expressed in social media posts, and visualize trends and configure alarms based on various thresholds.
The company needs to implement this solution quickly, and wants to minimize the infrastructure and data science resources needed to evaluate the messages. The company already has a solution in place to collect posts and store them within an Amazon S3 bucket.
What services should the data science team use to deliver this solution?
The company needs to implement this solution quickly, and wants to minimize the infrastructure and data science resources needed to evaluate the messages. The company already has a solution in place to collect posts and store them within an Amazon S3 bucket.
What services should the data science team use to deliver this solution?
Train a model in Amazon SageMaker by using the BlazingText algorithm to detect sentiment in the corpus of social media posts. Expose an endpoint that can be called by AWS Lambda. Trigger a Lambda function when posts are added to the S3 bucket to invoke the endpoint and record the sentiment in an Amazon DynamoDB table and in a custom Amazon CloudWatch metric. Use CloudWatch alarms to notify analysts of trends.
Train a model in Amazon SageMaker by using the semantic segmentation algorithm to model the semantic content in the corpus of social media posts. Expose an endpoint that can be called by AWS Lambda. Trigger a Lambda function when objects are added to the S3 bucket to invoke the endpoint and record the sentiment in an Amazon DynamoDB table. Schedule a second Lambda function to query recently added records and send an Amazon Simple Notification Service (Amazon SNS) notification to notify analysts of trends.
Trigger an AWS Lambda function when social media posts are added to the S3 bucket. Call Amazon Comprehend for each post to capture the sentiment in the message and record the sentiment in an Amazon DynamoDB table. Schedule a second Lambda function to query recently added records and send an Amazon Simple Notification Service (Amazon SNS) notification to notify analysts of trends.
Trigger an AWS Lambda function when social media posts are added to the S3 bucket. Call Amazon Comprehend for each post to capture the sentiment in the message and record the sentiment in a custom Amazon CloudWatch metric and in S3. Use CloudWatch alarms to notify analysts of trends.
Exploratory Data Analysis
Model Development
A company is observing low accuracy while training on the default built-in image classification algorithm in Amazon SageMaker. The Data Science team wants to use an Inception neural network architecture instead of a ResNet architecture.
Which of the following will accomplish this? (Choose two.)
Which of the following will accomplish this? (Choose two.)
Customize the built-in image classification algorithm to use Inception and use this for model training.
Create a support case with the SageMaker team to change the default image classification algorithm to Inception.
Bundle a Docker container with TensorFlow Estimator loaded with an Inception network and use this for model training.
Use custom code in Amazon SageMaker with TensorFlow Estimator to load the model with an Inception network, and use this for model training.
Download and apt-get install the inception network code into an Amazon EC2 instance and use this instance as a Jupyter notebook in Amazon SageMaker.
Machine Learning Implementation and Operations
AWS Machine Learning Services
A company is running a machine learning prediction service that generates 100 TB of predictions every day. A Machine Learning Specialist must generate a visualization of the daily precision-recall curve from the predictions, and forward a read-only version to the Business team.
Which solution requires the LEAST coding effort?
Which solution requires the LEAST coding effort?
Run a daily Amazon EMR workflow to generate precision-recall data, and save the results in Amazon S3. Give the Business team read-only access to S3.
Generate daily precision-recall data in Amazon QuickSight, and publish the results in a dashboard shared with the Business team.
Run a daily Amazon EMR workflow to generate precision-recall data, and save the results in Amazon S3. Visualize the arrays in Amazon QuickSight, and publish them in a dashboard shared with the Business team.
Generate daily precision-recall data in Amazon ES, and publish the results in a dashboard shared with the Business team.
Exploratory Data Analysis
A company is setting up an Amazon SageMaker environment. The corporate data security policy does not allow communication over the internet.
How can the company enable the Amazon SageMaker service without enabling direct internet access to Amazon SageMaker notebook instances?
How can the company enable the Amazon SageMaker service without enabling direct internet access to Amazon SageMaker notebook instances?
Create Amazon SageMaker VPC interface endpoints within the corporate VPC.
Create VPC peering with Amazon VPC hosting Amazon SageMaker.
Create a NAT gateway within the corporate VPC.
Route Amazon SageMaker traffic through an on-premises network.
Exploratory Data Analysis
Model Development
A company is using Amazon Polly to translate plaintext documents to speech for automated company announcements. However, company acronyms are being mispronounced in the current documents.
How should a Machine Learning Specialist address this issue for future documents?
How should a Machine Learning Specialist address this issue for future documents?
Convert current documents to SSML with pronunciation tags.
Create an appropriate pronunciation lexicon.
Output speech marks to guide in pronunciation.
Use Amazon Lex to preprocess the text files for pronunciation
Model Development
Machine Learning Implementation and Operations
A company is using Amazon Textract to extract textual data from thousands of scanned text-heavy legal documents daily. The company uses this information to process loan applications automatically. Some of the documents fail business validation and are returned to human reviewers, who investigate the errors. This activity increases the time to process the loan applications.
What should the company do to reduce the processing time of loan applications?
What should the company do to reduce the processing time of loan applications?
Configure Amazon Textract to route low-confidence predictions to Amazon SageMaker Ground Truth. Perform a manual review on those words before performing a business validation.
Use an Amazon Textract synchronous operation instead of an asynchronous operation.
Configure Amazon Textract to route low-confidence predictions to Amazon Augmented AI (Amazon A2I). Perform a manual review on those words before performing a business validation.
Use Amazon Rekognition's feature to detect text in an image to extract the data from scanned images. Use this information to process the loan applications.
Data Engineering
A company needs to quickly make sense of a large amount of data and gain insight from it. The data is in different formats, the schemas change frequently, and new data sources are added regularly. The company wants to use AWS services to explore multiple data sources, suggest schemas, and enrich and transform the data. The solution should require the least possible coding effort for the data flows and the least possible infrastructure management.
Which combination of AWS services will meet these requirements?
Which combination of AWS services will meet these requirements?
• Amazon EMR for data discovery, enrichment, and transformation
• Amazon Athena for querying and analyzing the results in Amazon S3 using standard SQL
• Amazon QuickSight for reporting and getting insights
• Amazon Athena for querying and analyzing the results in Amazon S3 using standard SQL
• Amazon QuickSight for reporting and getting insights
• Amazon Kinesis Data Analytics for data ingestion
• Amazon EMR for data discovery, enrichment, and transformation
• Amazon Redshift for querying and analyzing the results in Amazon S3
• Amazon EMR for data discovery, enrichment, and transformation
• Amazon Redshift for querying and analyzing the results in Amazon S3
• AWS Glue for data discovery, enrichment, and transformation
• Amazon Athena for querying and analyzing the results in Amazon S3 using standard SQL
• Amazon QuickSight for reporting and getting insights
• Amazon Athena for querying and analyzing the results in Amazon S3 using standard SQL
• Amazon QuickSight for reporting and getting insights
• AWS Data Pipeline for data transfer
• AWS Step Functions for orchestrating AWS Lambda jobs for data discovery, enrichment, and transformation
• Amazon Athena for querying and analyzing the results in Amazon S3 using standard SQL
• Amazon QuickSight for reporting and getting insights
• AWS Step Functions for orchestrating AWS Lambda jobs for data discovery, enrichment, and transformation
• Amazon Athena for querying and analyzing the results in Amazon S3 using standard SQL
• Amazon QuickSight for reporting and getting insights
Exploratory Data Analysis
Machine Learning Implementation and Operations
A company offers an online shopping service to its customers. The company wants to enhance the site s security by requesting additional information when customers access the site from locations that are different from their normal location. The company wants to update the process to call a machine learning (ML) model to determine when additional information should be requested.
The company has several terabytes of data from its existing ecommerce web servers containing the source IP addresses for each request made to the web server. For authenticated requests, the records also contain the login name of the requesting user.
Which approach should an ML specialist take to implement the new security feature in the web application?
The company has several terabytes of data from its existing ecommerce web servers containing the source IP addresses for each request made to the web server. For authenticated requests, the records also contain the login name of the requesting user.
Which approach should an ML specialist take to implement the new security feature in the web application?
Use Amazon SageMaker Ground Truth to label each record as either a successful or failed access attempt. Use Amazon SageMaker to train a binary classification model using the factorization machines (FM) algorithm.
Use Amazon SageMaker to train a model using the IP Insights algorithm. Schedule updates and retraining of the model using new log data nightly.
Use Amazon SageMaker Ground Truth to label each record as either a successful or failed access attempt. Use Amazon SageMaker to train a binary classification model using the IP Insights algorithm.
Use Amazon SageMaker to train a model using the Object2Vec algorithm. Schedule updates and retraining of the model using new log data nightly.
Model Development
Machine Learning Implementation and Operations
A company provisions Amazon SageMaker notebook instances for its data science team and creates Amazon VPC interface endpoints to ensure communication between the VPC and the notebook instances. All connections to the Amazon SageMaker API are contained entirely and securely using the AWS network. However, the data science team realizes that individuals outside the VPC can still connect to the notebook instances across the internet.
Which set of actions should the data science team take to fix the issue?
Which set of actions should the data science team take to fix the issue?
Modify the notebook instances' security group to allow traffic only from the CIDR ranges of the VPC. Apply this security group to all of the notebook instances' VPC interfaces.
Create an IAM policy that allows the sagemaker:CreatePresignedNotebooklnstanceUrl and sagemaker:DescribeNotebooklnstance actions from only the VPC endpoints. Apply this policy to all IAM users, groups, and roles used to access the notebook instances.
Add a NAT gateway to the VPC. Convert all of the subnets where the Amazon SageMaker notebook instances are hosted to private subnets. Stop and start all of the notebook instances to reassign only private IP addresses.
Change the network ACL of the subnet the notebook is hosted in to restrict access to anyone outside the VPC.
Exploratory Data Analysis
Model Development
Comments