How does the CRISP-DM (Cross-Industry Standard Process for Data Mining) methodology begin its process model?
A. Evaluation
B. Business Understanding
C. Model Building
D. Data Preparation
正解:B
質問 2:
Which of the following is a critical first step in understanding a business problem for data science projects?
A. Deploying the model
B. Selecting the machine learning algorithm
C. Defining the project scope
D. Choosing the visualization tools
正解:C
質問 3:
In the deployment phase, why is it important to know the different data sources available in Cloud Pak for Data?
A. To limit the deployment to only use local file storage
B. To ensure that all data sources are manually processed
C. Because only one type of data source can be used in any deployment
D. To effectively integrate and manage data from various sources for analysis and model training
正解:D
質問 4:
When comparing models to choose the best one, which factor is least likely to be considered?
A. The color scheme of the model's output visualizations
B. The complexity of the model
C. The explainability of the model's predictions
D. The performance of the model on validation data
正解:A
質問 5:
When selecting a small number of algorithms based on model requirements, what factor should you primarily consider?
A. Compatibility of the algorithm with the data characteristics and the predictive task.
B. The algorithm that requires the least amount of data preprocessing.
C. Choosing algorithms that are only based on supervised learning.
D. The popularity of the algorithm in recent academic papers.
正解:A
質問 6:
In the case of imbalanced data, what technique is recommended to ensure that the train and test sets have similar distributions of the target variable?
A. Using only the majority class for splitting
B. Random split without considering the target variable
C. Splitting based on the order of data collection
D. Stratified split
正解:D
質問 7:
Which Python library is commonly used for data manipulation and analysis, and is available in Cloud Pak for Data?
A. TensorFlow
B. Pandas
C. PyTorch
D. Keras
正解:B
質問 8:
Cloud Pak for Data's integration with Spark allows users to:
A. Perform complex computations on small datasets only
B. Use Spark exclusively for data visualization purposes
C. Avoid using any form of data processing or analysis
D. Leverage distributed computing for processing large datasets efficiently
正解:D
取池** -
C1000-154自学者向けの教科書だと思います。Pass4Testさん本当にありがとうございます。