· Definition: Big Data, NoSQL
· The need for Big Data technology
· Tradition technologies Vs Big Data technologies
· Big Data project requirements
· Big Data Project workflow
· Big Data project definitions
· Data sources & development resources
· Big Data technologies evaluation
· Streaming Concept
· Apache Kafka
· Apache Flume
· ELK package
· What is Hadoop?
· Hadoop Architecture
· Hadoop File System (HDFS)
· Hadoop MapReduce
· Apache YARN
· Apache Oozie, ZooKeeper
· Project non-functional support
· Hadoop Distribution
· Hadoop as a service
· Can Big Data project switch environments?
· Hadoop deployment requirements
· Hadoop Performance Best Practices
· POC environment
· Using Apache Pig! & Apache Sqoop for POC
· Apache Storm
· Apache Spark
· Big Data – Development methodologies
· ETL development cycle & deployment
· Tests Cycle
· Key-Values Stores
· Column Family Stores (Wide Column Stores)
· Document Databases
· Graph Databases
· Mathematical Graph as a DB
· Product logic
· Apache Hive
· Apache Impala
· Big Data to OLAP
· BI Visualization
· Scaling BI over Big Data
· Big Data – system ATP
· Trends & Conclusions
· Q&A
· Course’s Evaluation