# Summary

- [推荐系统lambda架构](README.md)
- [day05 hadoop框架及其子组件HDFS](day05/ha.md)
  - [1.hadoop框架](day05/ha1.0.md)
    - [1.1 什么是Hadoop](day05/ha1.1.md)
    - [1.2 hadoop核心组件](day05/ha1.2.md)
    - [1.3 hadoop优势](day05/ha1.3.md)
    - [1.4 hadoop生态系统](day05/ha1.4.md)
    - [1.5 hadoop发行版本的选择](day05/ha1.5.md)
  - [2.分布式存储系统hdfs](day05/ha2.0.md)
    - [2.1 HDFS概述](day05/ha2.1.md)
    - [2.2 HDFS架构](day05/ha2.2.md)
    - [2.3 HDFS环境搭建](day05/ha2.3.md)
    - [2.4 HDFS shell操作](day05/ha2.4.md)
    - [2.5 python操作HDFS](day05/ha2.5.md)
    - [2.6 HDFS读写流程及其常见问题](day05/ha2.6.md)
    - [2.7 HDFS的优缺点](day05/ha2.7.md)
- [day06 YARN和MapReduce](day06/ha3.md)
  - [1.资源调度框架YARN](day06/ha3.0.md)
    - [1.1 什么是YARN](day06/ha3.1.md)
    - [1.2 分布式处理框架MapReduce](day06/ha3.2.md)
    - [1.3 Hadoop Streaming 实现单词统计](day06/ha3.3.md)
    - [1.4 利用MRJob编写和运行MapReduce代码](day06/ha3.4.md)
    - [1.5 join文件合并](day06/ha3.5.md)
  - [2.hadoop加强](day06/ha4.0.md)
    - [2.1 HDFS元数据管理](day06/ha4.1.md)
    - [2.2 安全模式](day06/ha4.2.md)
    - [2.3 HadoopArchives](day06/ha4.3.md)
    - [2.4 HadoopHighAvailability](day06/ha4.4.md)
    - [2.5 HadoopFederation](day06/ha4.5.md)
- [day07 flume和Kafka](day07/f.md)
  - [1.flume](day07/f1.0.md)
    - [1.1 实时流处理概述](day07/f1.1.md)
    - [1.2 flume概述](day07/f1.2.md)
    - [1.3 flume采集系统的结构图](day07/f1.3.md)
    - [1.4 Flume安装部署](day07/f1.4.md)
    - [1.5 flume简单案例](day07/f1.5.md)
    - [1.6 Flume实战案例](day07/f1.6.md)
    - [1.7 flume的负载均衡和容错](day07/f1.7.md)
  - [2.Kafka](day07/k1.0.md)
    - [2.1 Kafka概述](day07/k1.1.md)
    - [2.2 Kafka的安装部署](day07/k1.2.md)
    - [2.3 Kafka Python API](day07/k1.3.md)
    - [2.4 Kafka与flume整合](day07/k1.4.md)
    - [2.5 Kafka的整体结构图](day07/k1.5.md)
- [day08 spark-core](day08/s.md)
  - [1.spark-core](day08/s1.0.md)
    - [1.1 spark概述](day08/s1.1.md)
    - [1.2 spark-core概述](day08/s1.2.md)
    - [1.3 如何生成RDD](day08/s1.3.md)
    - [1.4 RDD的三类算子](day08/s1.4.md)
    - [1.5 SPARK RDD开发实战](day08/s1.5.md)
    - [1.6 利用spark进行其他常见分析](day08/s1.6.md)
- [day09 spark-sql概述](day09/s.md)
  - [1.spark-sql](day09/s1.0.md)
    - [1.1 spark-sql概述](day09/s1.1.md)
    - [1.2 DataFrame](day09/s1.2.md)
    - [1.3 JSON数据的处理](day09/s1.3.md)
    - [1.4 物联网实战](day09/s1.4.md)
    - [1.5 数据清洗](day09/s1.5.md)
- [day10 Spark-Streaming与Spark-mllib](day10/s.md)
  - [1.Spark Streaming](day10/s1.0.md)
    - [1.1 Spark-Streaming概述](day10/s1.1.md)
    - [1.2 DSteam的操作](day10/s1.2.md)
    - [1.3 Spark-Streaming编码实战](day10/s1.3.md)
    - [1.4 Spark Streaming对接Kafka](day10/s1.4.md)
    - [1.5 Spark-Streaming的状态操作](day10/s1.5.md)
    - [1.6 Spark-Streaming与外部数据源交互](day10/s1.6.md)
    - [1.7 Spark-Streaming对接flume](day10/s1.7.md)
  - [2.spark-mllib](day10/ml1.0.md)
    - [2.1 初识spark-mllib](day10/ml1.1.md)
    - [2.2 逻辑回归实战-数据预处理](day10/ml1.2.md)
    - [2.3 逻辑回归实战-数据筛选](day10/ml1.3.md)
    - [2.4 逻辑回归实战-模型训练和模型评估](day10/ml1.4.md)