CDH培训——ClouderaDeveloperTrainingforSparkandhadoop

Cloudera Developer Training for Spark and hadoop

目前创新互联已为数千家的企业提供了网站建设、域名、虚拟空间、网站运营、企业网站设计、合阳网站维护等服务,公司将坚持客户导向、应用为本的策略,正道将秉承"和谐、参与、激情"的文化,与客户和合作伙伴齐心协力一起成长,共同发展。

Course Time:2016年6月27-30日

Course Location:上海市 浦东新区 张江高科 伯克利工程创新中心

Contact us:400-679-6113

QQ:1438118790

Certification:CCA-175

Learn how toimport data into your Apache Hadoop closter and process it with spark、hive、flume、sqoop、impala and other Hadoop ecosystem tools.

Audience and Prerequisites

This coursedesigned for developers and engineers who have programming experience. Apachespark examples and hands-on exercises are presented in Scala and Python, so theability to program in one of those languages is required. Basic familiaritywith the Linux command line is assumed. Basic knowledge of SQL is helpful. Priorknowledge of Hadoop is not required.

Course outline:DeveloperTraining for Spark and hadoop

  • Introduction to Hadoop and the Hadoop ecosystem

  • Hadoop architecture and HDFS

  • Importing relational data with Apache spoop

  • Introduction to impala and hive

  • Modeling and managing data with impala and hive

  • Data formats

  • Data partitioning

  • Capturing data with Apache flume

  • Spark basics

  • Working with RDDs in spark

  • Writing and deploying spark applications

  • Parallel programming with spark

  • Spark caching and persistence

  • Common patterns in spark data processing

  • Preview:spark SQL


当前名称:CDH培训——ClouderaDeveloperTrainingforSparkandhadoop
链接分享:http://pwwzsj.com/article/jeheic.html