When designing an experiment to compare the performance of two LLMs on a question-answering task, which statistical test is most appropriate to determine if the difference in their accuracy is significant, assuming the data follows a normal distribution?
A. Paired t-test
B. Mann-Whitney U test
C. Chi-squared test
D. ANOVA test
正解:A
解説: (Pass4Test メンバーにのみ表示されます)
質問 2:
What is 'chunking' in Retrieval-Augmented Generation (RAG)?
A. A technique used in RAG to split text into meaningful segments.
B. Rewrite blocks of text to fill a context window.
C. A method used in RAG to generate random text.
D. A concept in RAG that refers to the training of large language models.
正解:A
解説: (Pass4Test メンバーにのみ表示されます)
質問 3:
When comparing and contrasting the ReLU and sigmoid activation functions, which statement is true?
A. ReLU is less computationally efficient than sigmoid, but it is more accurate than sigmoid.
B. ReLU and sigmoid both have a range of 0 to 1.
C. ReLU is more computationally efficient, but sigmoid is better for predicting probabilities.
D. ReLU is a linear function while sigmoid is non-linear.
正解:C
解説: (Pass4Test メンバーにのみ表示されます)
質問 4:
In neural networks, the vanishing gradient problem refers to what problem or issue?
A. The problem of overfitting in neural networks, where the model performs well on the trainingdata but poorly on new, unseen data.
B. The issue of gradients becoming too large during backpropagation, leading to unstable training.
C. The problem of underfitting in neural networks, where the model fails to capture the underlying patterns in the data.
D. The issue of gradients becoming too small during backpropagation, resulting in slow convergence or stagnation of the training process.
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 5:
You have access to training data but no access to test data. What evaluation method can you use to assess the performance of your AI model?
A. Cross-validation
B. Greedy decoding
C. Randomized controlled trial
D. Average entropy approximation
正解:A
解説: (Pass4Test メンバーにのみ表示されます)