There is lot of talk about the Big Data Systems, many new system architecture with different functionalities have came up. I am sharing my knowledge about some Big Data Systems layered architecture. I plan to cover these topics in depth:
1. HDFS (Hadoop Distributed File Systems),
2. YARN (Yet Another Resource Negotiator),
4. Apache Spark,
5. Spark Streaming, Spark SQL
I will cover these topics with their system overview, history, architecture and where and how they are used in practice and some resourceful links to get more information about them.