My Big Data Analytics Skills

 

→ 1. Data Preparation

  • Planning
  • Data Collection
  • Data Selection

→ 2. Tools for Data Preparation

  • SQL DB / NoSql DB / MySQL DB
  • Key / Value pair
  • MongoDB
  • Cassandra
  • Graph DB’s (Neo4j)

→ 3. Data Preparation – Import/Export

  • Sqoop
  • Flume

→ 4. Pre-Processing

  • Data Cleaning
  • Data Filtering
  • Data Completion
  • Data Correction
  • Data Standardization
  • Data Transformation

→ 5. Tools for Data Pre-Processing

  • Data pre-processing using Pig
  • Writing Pig Latin scripts and processing data
  • Data pre-processing using Hive
  • Writing Hive Scripts and processing data

→ 6. Data Analysis

  • Recommendation
  • Classification
  • Clustering
  • Mahout

→ a. Recommendataion

  • Making recommendations, various techniques

→ b. Classification

  • Classification process
  • Naive Bayes Classifier
  • Decision Trees

→ c. Clustering

  • Clustering basics and fundamentals
  • Hierarchical Clustering
  • K-Means Clustering
  • Exploring Distance Measures

→ 7. Data Visualization using SAS or R

  • Basics and Fundamentals of Data Visualization
  • Data Frames
  • Vectorized operations on Data Frames
  • Selection
  • Projection
  • Transformation
  • Graphs

 

 

Advertisements