Databricks spark architecture question analysis
posted on 03 Sep 2020
As for the basics of the Spark architecture, the following concepts are assessed by this exam:
In addition, candidates are asked to apply their knowledge of the following to make optimal decisions when working with Spark. Candidates should be able to interpret how these topics affect a Spark session and how they can use them to improve performance.
What something is or what something does:
Which of the following describes a worker node?
Your knowledge of what something is or what something does.
Which of the following describes the relationship between worker nodes and executors?
Your ability to predict results (i.e. if “x” occurs, what happens?).
If Spark is running in cluster mode, which of the following statements about nodes is correct?
Your ability to categorize ideas/things.
What of the following DataFrame operations is always classified as a narrow transformation?
DataFrame.select()
DataFrame.sort()
DataFrame.distinct()
DataFrame.join()
DataFrame.repartition()
See picture below.
See picture above.