Which data format stores all of the data in a binary format making the files more compact, and will even add in markers to help Map Reduce jobs determine where to break large files for more efficient processing?
A. Parquet
B. ORC
C. Sequence File
D. Avro
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 2:
Which of the following statements regarding Big R is TRUE?
A. A data analyst using Big R employs MapReduce programming principles
B. Unless specified otherwise, Big R automatically assumes all data to be integers
C. Big R's 'bigr.frame' is equivalent to R's 'data.frames'
D. When you execute Big R "apply" function, Big R transparently extracts data out of HDFS into the Big R engine
正解:B
解説: (Pass4Test メンバーにのみ表示されます)
質問 3:
A news organization wants to analyze all the news stories coming in real time and make them available to their users based on the user's interest. Given this requirement, which of the following would you recommend?
A. Netezza
B. Cloudant
C. Hadoop
D. Spark
正解:D
質問 4:
Which of the following statements is TRUE regarding Cloud deployment models?
A. Performance and scalability requirements are a critical factor for deciding between Platform as a Serviceand Infrastructure as a Service deployment models
B. In an infrastructure as a service deployment, the cloud provider provides security patching, monitoring andfail over capabilities
C. Applications with extremely high transactions volumes are good candidates for Platform as a Service
D. In a platform as a Service offering, the customer has root access to the servers
正解:A
質問 5:
"The programming model for client developers will hide the complexity of interfacing to legacy systems" is an example of which of the following?
A. A client imperative
B. An empathy statement
C. A use case
D. An architectural decision
正解:D
質問 6:
You need to create an online repository for a company. Data includes pdf, documents, html, images, etc. Total data volume is approximately 1PB. Online repository must be highly available. You are required to propose the least costly solution. Which technology would be preferred for this requirement?
A. RDBMS
B. Hadoop
C. IBM Infosphere Streams
D. Spark
正解:B
質問 7:
It's helpful to look at the characteristics of big data along certain lines - for example, how the data is collected, analyzed and processed. There are many characteristics to consider. Which one of the following is NOT a characteristic that should be considered?
A. Software
B. Data source
C. Processing methodology
D. Data frequency and size
正解:A
解説: (Pass4Test メンバーにのみ表示されます)
質問 8:
Which of the following is the section of the Component Model that details how the solution
integrates?
A. Component Interface Diagram
B. Component Interaction Diagram
C. Component Reaction Diagram
D. Component Relationship Diagram
正解:D