Amazon Web Services (AWS) - Big Data
Amazon Web Services - Big Data training will introduce participants into the world of Big Data processing using AWS platform. Course will cover practical knowledge about frameworks like Hadoop, Pig, Hive and AWS services, which enable processing of huge number of data in large, distributed clusters like EMR, DynamoDB, RedShift or Amazon Kinesis. And all this is based on interesting practical exercises.
Course agenda
- Public Compute Cloud and Big Data introduction
- Amazon Web Services product suite supporting Big Data: EMR, Kinesis, DynamoDB i RedShift
- Introduction to Hadoop, Hive, Pig and Streaming
- Introduction
- MapReduce, Hadoop and HDFS
- Architecture and Instance Groups
- Instance types
- Using Hive within EMR
- Using Pig within EMR
- Using Impala within EMR
- Storing data with HBase
- Data Visualization
- Pricing
- Introduction
- NoSQL
- Data types and supported operations
- Consistency
- DynamoDB API
- DynamoDB and Big Data, EMR integration
- Pricing
- Introduction
- Kinesis and Big Data, integration with EMR
- Configuration
- Pricing
- Introduction
- RedShift and Big Data
- Configuration
- Pricing
- Introduction
- CloudWatch Architecutre
- Alarms and metrics
- Using CloudWatch
- Pricing
- Each training ends with test.
- 3 days
- Kraków, Katowice, Wrocław
- Lecture + Workshop
- Training is designed for Big Data architects and developers. Programming and database knowledge is required. Java knowledge is recommended.
- After accomplishing the course participants will have hands-on knowledge about Big Data processing, MapReduce approach, EMR service, Hadoop, Pig and Hive. They will know how to provision Big Data environments and how to use DynamoDB, Redshift and Kinses for data processing.