Big Data Analytics - Petta Bytes Big Data Analytics - Petta Bytes
Open Contact Form
Please Contact Us

Your Name (required)

Your Email (required)

Your Mobile Number (required)


Your Message


Big Data Analytics

Big Data Analytics

A 200 Hours course that provides you hands on experience on R, Hadoop, Spark and MongoDB with practical exposure to business problems and case-studies

Duration: 200 hours

Fee (INR): 85,000 + Service Tax

Upcoming Batches

Location: Petaa Bytes Analytics Pvt. Ltd., Andheri (W). D. N. Metro Station, Near Star Bazaar

Mode: Class Room

From: 05-Aug-2017 10:00am (+05:30)

Fees: Rs(INR) 85,000 + Service Tax

Note: Week-End Batch. Course Duration 200 Hours (4 Months)

Course Objectives

The Big Data Analytics course is designed specifically to provide you with practical experience sharing and training on Big Data Analytics. As part of the course we will cover the following:

  • Big Data Technologies – Hadoop, Spark & MongoDB (NoSQL)
  • Data Mining Tool – R 
  • Data Mining Techniques –
    • Neural Network
    • Random Forest
    • Classification Tree
    • Logistic Regression
    • Clustering
    • Linear Regression
  • SQL Programming
  • Business Domain Understanding – Retail Banking Analytics

Course Structure

The course is structured a well-rounded perspective on Data Science by finely covering

  • Business Domain Knowledge
  • Analytical Tools & Big Data Technologies
  • Statistics & Data Mining Techniques

Who Should Attend?

Any one who wish to make a career as Data Scientist. Candidates having familiarity with mathematics and some coding background are likely to benefit more from this course.

Before enrolling for the course, we honestly recommend you read our blog “-->Is Analytics Career for me?

Download Brochure


Tools Used

  • Hadoop (Map Reduce, Pig, Hive, HBase, Sqoop, Flume, Oozie)
  • Spark, MongoDB, R, R Studio, SQL, MS Excel

Industry Testimonials 

"I live in the US, and am a Finance consultant, with an IT background. Technology is the driver today for the finance industry and it helps for us consultants to know technology better, which helps the business better. Big Data is slowly taking over older forms of market and customer intelligence initiatives in the banking industry here. I was looking to gain knowledge and significant competitive advantage but did not want to spend thousands of dollars on courses in the US, which are led by academics, which is not always practical. A friend told me about Petaa Bytes' innovative global delivery at almost half of the cost I would've paid here in New York. I enrolled and completed courses for Hadoop and Apache Spark. Shrikant understands the need of the student, he trained me from a business perspective, not a developer per se. It has helped me gain a new customer. I would recommend Petaa Bytes to anyone interested in putting a foot forward in this exciting new world-"

- Amit Sonawane, Consultant, JP Morgan, USA


Student Testimonials

"If anybody wants to experience a sound learning aura and that too from a calm and composed personality like Shrikant sir, please go for Petaa Bytes Training. I joined here back in August 2015 to learn Hadoop and this has been my best learning experience in life so far. The best part of the course was personal attention given to me and to each and every student, which is very difficult to find these days when you join a training institute. I was also given career advice which helped me in refining my doubts for my career in Big Data field. Now that I am confident enough with what I learnt here, just waiting to be called in an arena(interviews) where I can showcase my knowledge"

- Mohit Sudhera, TCS, Mumbai

"It was an outstanding classroom training felicitated by Mr Shrikant Gawande. This happens to be one of my best training experiences where he mixed up the technological aspects with real life scenarios in a manner which eased our pressure of learning something really new. He also used his vast industrial exposure to a great extent and understand the real life needs of the technicalities. I would refer his name to have the training on Big Data and Hadoop which shall provide anyone with a perfect blend of conceptual learning and hands on detailing"

- Abhishek Das, Cap Gemini, Mumbai

"Petaa Bytes was one of the better technology trainings I have attended in my 14 years of IT career. Having attended a lot of corporate and professional trainings, the differentiating factor was the staff and the curriculum. Shrikant, being an accomplished IT professional himself, can relate to the problems that are faced during project implementations and hence that attribute is added to the way he conducts the trainings and makes us aware of the industrial application of this new technology. Equipped with this knowledge, I look forward to being a part of the Big Data thought leadership teams in future"

- Amit G, Congizant, Mumbai

Course Content in Detail

Main Head Description Hours
SQL What is database? SQL Programming Basics (Insert, Update, Delete & Select Queries), Normalization & Denormalization 8
R Introduction to R, Data Structures, Importing – Exporting Data, Data Manipulation, Sorting, Merging, Aggregating, Functions, Programming Structures, Charts & Graphs 20
Hadoop HDFS Architecture, Hadoop Multinode Installation,Map Reduce, Advanced Map Reduce, Multiple Input Formats, Apache Pig, HIVE, NoSQL (Hbase), Hadoop 2.0 YARN, Sqoop, Flume, Apache Oozie, Setting up Hadoop  on Cloud using EC2. 60
Spark Scala programming language, Spark Eco System, RDD(Transformations, Actions, Loading Data, Key-Value Pair, MapReduce) Spark Streaming, GraphX, SparkSQL and Performance Tuning in Spark 40
MongoDB  Architecture of mongoDB and Design Goals. Introduction to JSON and BSON, CRUD Operations, Scalability and Availability , Indexing and Aggregation Framework 24
Retail Banking Analytics Retail Banking Product Overview – Liabilities, Assets & Cards, Application of Analytics in Risk & Marketing Functions, Customer Lifecycle Management 12
Data Mining Techniques  Data Mining Introduction, Supervised & Unsupervised Learning Techniques, CRISP-DM (Data Mining Process), Basic Statistics & Number Skills, Linear Regression, Logistic Regression, Clustering, Classification Tree, Random Forest, Neural Networks  35
  Total 200

What do you get from Course?

  • Training from industry expert.
  • Hands-on experience
  • Training Presentations and data files. 
  • Interaction with faculty outside the class on email / phone
  • Attend the training sessions and you get Certificate of Participation
  • Complete and submit all the graded assignments and you get Certificate of Completion 

Contact Details?

Email :;

Phone : +91- 8939694874 / 7506523339

Partner Website :

Who will be my Instructors?

Your primary trainers are people from industry having 15 years of industry experience

Shrikant Gawande

  +91-98196 52958

Cloudera Certified

Rajesh Jakhotia

  +91-93228 94874

  -->LinkedIn Profile