Given a feature set with rows that contain missing continuous values, and assuming the data is normally distributed, what is the best way to fill in these missing features?
A. Fill in missing features with the average of observed values for that feature in the entire dataset.
B. Fill in missing features with random values for that feature in the training set.
C. Delete entire rows that contain any missing features.
D. Delete entire columns that contain any missing features.
正解:A
解説: (Pass4Test メンバーにのみ表示されます)
質問 2:
Which of the following describes a neural network without an activation function?
A. An unsupervised learning technique
B. A form of a linear regression
C. A form of a quantile regression
D. A radial basis function kernel
正解:B
解説: (Pass4Test メンバーにのみ表示されます)
質問 3:
Which of the following principles supports building an ML system with a Privacy by Design methodology?
A. Avoiding mechanisms to explain and justify automated decisions.
B. Utilizing quasi-identifiers and non-unique identifiers, alone or in combination.
C. Understanding, documenting, and displaying data lineage.
D. Collecting and processing the largest amount of data possible.
正解:C
解説: (Pass4Test メンバーにのみ表示されます)
質問 4:
Your dependent variable Y is a count, ranging from 0 to infinity. Because Y is approximately log-normally distributed, you decide to log-transform the data prior to performing a linear regression.
What should you do before log-transforming Y?
A. Divide all the Y values by the standard deviation of Y.
B. Add 1 to all of the Y values.
C. Subtract the mean of Y from all the Y values.
D. Explore the data for outliers.
正解:B
解説: (Pass4Test メンバーにのみ表示されます)
質問 5:
Which of the following approaches is best if a limited portion of your training data is labeled?
A. Dimensionality reduction
B. Probabilistic clustering
C. Reinforcement learning
D. Semi-supervised learning
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 6:
Which of the following text vectorization methods is appropriate and correctly defined for an English-to-Spanish translation machine?
A. Using TF-IDF because in translation machines, we do not care about the order of the words.
B. Using Word2vec because in translation machines, we need to consider the order of the words.
C. Using Word2vec because in translation machines, we do not care about the order of the words.
D. Using TF-IDF because in translation machines, we need to consider the order of the words.
正解:B
解説: (Pass4Test メンバーにのみ表示されます)
質問 7:
Which two of the following statements about the beta value in an A/B test are accurate? (Select two.)
A. The Beta value is the rate of type I errors for the test.
B. The statistical power of a test is the inverse of the Beta value, or 1 - Beta.
C. The Beta in an Alpha/Beta test represents one of the two variants of the A/B test.
D. The Beta value is the rate of type II errors for the test.
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 8:
Which of the following algorithms is an example of unsupervised learning?
A. Random forest
B. Neural networks
C. Ridge regression
D. Principal components analysis
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
Nagano -
AIP-210基礎の基礎からしっかりと学習できます。
イラストも満載の上、解説が丁寧で分かりやすいのでしっかりと頭に入ってきます。