Instant Access Google.Professional-Machine-Learning-Engineer.v2023-02-17.q116 Actual Practice Test Engine for Free (Page 15)

Question 66

Your team needs to build a model that predicts whether images contain a driver's license, passport, or credit card. The data engineering team already built the pipeline and generated a dataset composed of 10,000 images with driver's licenses, 1,000 images with passports, and 1,000 images with credit cards. You now have to train a model with the following label map: ['driversjicense', 'passport', 'credit_card']. Which loss function should you use?

A.Categorical hinge
B.Binary cross-entropy
C.Categorical cross-entropy
D.Sparse categorical cross-entropy

Question 67

You work for an advertising company and want to understand the effectiveness of your company's latest advertising campaign. You have streamed 500 MB of campaign data into BigQuery. You want to query the table, and then manipulate the results of that query with a pandas dataframe in an Al Platform notebook. What should you do?

A.Use Al Platform Notebooks' BigQuery cell magic to query the data, and ingest the results as a pandas dataframe
B.Download your table from BigQuery as a local CSV file, and upload it to your Al Platform notebook instance Use pandas. read_csv to ingest the file as a pandas dataframe
C.From a bash cell in your Al Platform notebook, use the bq extract command to export the table as a CSV file to Cloud Storage, and then use gsutii cp to copy the data into the notebook Use pandas. read_csv to ingest the file as a pandas dataframe
D.Export your table as a CSV file from BigQuery to Google Drive, and use the Google Drive API to ingest the file into your notebook instance

Question 68

An online reseller has a large, multi-column dataset with one column missing 30% of its data. A Machine Learning Specialist believes that certain columns in the dataset could be used to reconstruct the missing data.
Which reconstruction approach should the Specialist use to preserve the integrity of the dataset?

A.Listwise deletion
B.Last observation carried forward
C.Multiple imputation
D.Mean substitution

Question 69

A data scientist has developed a machine learning translation model for English to Japanese by using Amazon SageMaker's built-in seq2seq algorithm with 500,000 aligned sentence pairs. While testing with sample sentences, the data scientist finds that the translation quality is reasonable for an example as short as five words. However, the quality becomes unacceptable if the sentence is 100 words long.
Which action will resolve the problem?

A.Choose a different weight initialization type.
B.Adjust hyperparameters related to the attention mechanism.
C.Add more nodes to the recurrent neural network (RNN) than the largest sentence's word count.
D.Change preprocessing to use n-grams.

Question 70

You work for a bank and are building a random forest model for fraud detection. You have a dataset that includes transactions, of which 1% are identified as fraudulent. Which data transformation strategy would likely improve the performance of your classifier?

A.Z-normalize all the numeric features.
B.Write your data in TFRecords.
C.Oversample the fraudulent transaction 10 times.
D.Use one-hot encoding on all categorical features.

Question 66

Question 67

Question 68

Question 69

Question 70

Download PDF File