1. Big Data [Hadoop] Developer Module
Note:
- The participants will be extracting and configuring Hadoop and all eco-systems on a Ubuntu 14.4 VMWare Image. Initially exposure to Cloudera Image will also be given.
- The above 2 images are going to be with the participants for life.
- Understanding of how Hadoop works on AWS will also be explained and the participants will have to register with AWS for working on the cloud.
Objective
The participants will learn the Installation of Hadoop Cluster, understand the basic, advanced concepts of Map Reduce and the best practices for Apache Hadoop Development as experienced by the Developers, Architects and Data Analysts of core Apache Hadoop. They will also learn the following during the duration of the course
- Hadoop Ecosystem
- Best programming practices for Map Reduce
- System administration issues with other Hadoop projects such as Hive, Pig, and Scoop
- Configuration Map Reduce environment with Eclipse IDE
- Running MR Unit Tests on MR Code
- Advanced Map Reduce Algorithms and techniques
- Working with Pig and HIVE
- Working with NoSQL with emphasis on HBase
- Understanding Sqoop
- Quick Overview of CDH and HDP
Note: The course will be have 40% of theoretical discussion and 60% of actual hands on
Duration:
30 ~ 32 hours
Audience
This course is designed for anyone who is
- Wanting to architect a project using Hadoop and its Eco System components.
- Wanting to develop Map Reduce programs
- A Business Analyst or Data Warehousing person looking at alternative approach to data analysis and storage.
Pre-Requisites
- The participants should have at least basic knowledge of Java.
- Any experience of Linux environment will be very helpful.
Course Outline
- What is Big Data & Why Hadoop?
- Big Data Characteristics
- Challenges with traditional system
- Use Cases of Big Data
- Hadoop Overview & Ecosystem
- Anatomy of Hadoop Cluster
- Eco-System Components
- Hadoop Architecture
- Components in Hadoop
- Interaction between different Components
- Basic Understanding of each component
- HDFS – Hadoop Distributed File System
- Name Nodes and Data Nodes
- Hands-On Exercise
- Steps in File Write in to HDFS and File Read
- Pseudo – Cluster Distribution of Vanilla Hadoop
- Hadoop 2.7.2 Extraction and Installation
- Configuration / XML Files
- Formatting the NN and starting the services
- File Ingestion in to HDFS
- Location of Metadata and Data
- Map Reduce Anatomy
- Why Map Reduce Paradigm
- How Map Reduce Works?
- 6 Phases in a Map Reduce Process
- YARN
- What is YARN?
- Architecture of YARN
- What happens on an Application Submission?
- Teragen and Terasort examples
- Map Reduce Examples
- Setting up Eclipse Development Environment
- Creating Map Reduce Projects,
- Word Count Example in Eclipse
- Code Walk through
- Find the Max Temp from a Dataset
- Advance Map Reduce
- Using Combiners
- Using Partitioner
- Unit testing the code.
- Using Hive
- Hive as a Data Warehouse
- Creating External & Internal Tables plus Loading Data
- Writing HSQL queries for data retrieval
- Creating Partitions of data in HDFS via Hive
- Creating Buckets of data in Hive and selecting Data
- Writing Custom UDF code
- Re-directing results to a file in HDFS
- Different File Formats
- Using Pig
- Why Pig and its benefits
- Word Count using Pig
- Loading data into PigStorage
- Querying data from PigStorage
- ETL Example
- Semi-Structured data example
- Sqoop
- Why Sqoop
- Importing and exporting data from using RDBMS
- Incremental Imports
- Hadoop Best Practices and Use Cases
- No-Sql Introduction
- What is NoSQL?
- Variation of NoSQL
- Advantage of Columnar Database
- HBase
- HBase Overview and Architecture
- HBase v/s RDBMS
- HBase Table Design
- Column Families and Regions
- HBase Java API code
- HBase Installation
- HBase shell commands
- Understanding Cloudera Distribution
- What is CDH?
- Components in CDH
- Understanding Horton Works Distribution
- What is HDP?
- Components in HDP
Take Away from the Course
- Understanding of What and Why of Hadoop with its Eco-System Components.
- Ability to write Map Reduce programs in a given scenario
- Ability to correctly architect and implement the Best Practices in Hadoop Development
- Ability to Manage and Monitor Hadoop
- Ability to Manage the different Hadoop Components when talking to each other.
Hadoop Trainings
1) JPMC – 18 batches of Hadoop Developer and 2 batches of Hadoop Admin
2) Edureka – 24 batches of online Hadoop Developer Training
3) Microsoft India – 5 batches of HD Insight Developer & 2 batches of Analyst
4) SAS – 3 batches of customized Hadoop Developer – Analyst training
5) 20 + Hadoop Trainings via Nichetek Inc, Zarantec and Cavalier IT
Awanish said
Hi
I would like to know the details about this course(EJB&Struts) like when new batch will be commencing and what will be the timing. You can contact me at 9004428298 or drop me an email.
Thanks
Awanish
Shraddha Rane said
Sir I would like to know the details about the course Core and advance Java, EJB&Struts and Hibernate as topics and when next batch will be starting from and timings. Please send me an email about the same.
Mohan Seth said
Sir I would like to know the details about the course EJB&Struts as in teh topics and when nxt batch will be starting from and timings. Pls send me an email about teh same.
Regards,
Mohan.
Mohan Seth said
Sir I would like to know the details about the course EJB&Struts as in teh topics and when nxt batch will be starting from and timings. Pls send me an email about teh same.
Regards,
Mohan.
Sushant Pawar said
Hi Sir,
I would like to know whether you provide trainning of Big data (Hadoop) administration.
Regards
Sushant
admin said
Hi Sushant, I very much provide training on Big Data Hadoop. where are you located?If you can call me on 9821422745 afte 8.30pm on weekdays, I can give you the options and we can take it forward. Looking forward to speaking with you. Thanks
Suresh said
Hi,
I would like to know the Hadoop training details like whether it is online or class room training , course details and fee structure
Thanks
Suresh
admin said
Hi Suresh, depending on your location I can give you the options. could you please call me on 9821422745 after 8.30pm any weekday and we can work it out. thanks and look forward to speaking with you.
Jayanth Iyer said
Hello Sir,
I am interested in the Hadoop Developer module. Please let me know if there are any batches post September 2014.
Thanks,
Jayanth
admin said
Jayanth, There is a batch starting this sunday 2nd august. Post this there will be one in October. Do give me a call in september to check for the same. thanks
Gaurav Keswani said
Interested in the Hadoop training. Please let me know about the next batch you will be starting 🙂
admin said
Hi Gaurav,
My next Hadoop batch is scheduled to start on 4th October.
Its a saturday sunday batch in the morning from 10.30am to 2.00pm.
If you are interested to join the same, please call me on 9821422745 or sms me and we can discuss.
Best,
Venkat
Gaurav Keswani said
Hi Venkat,
Just my 2 cents. 4th October is a long weekend and a lot of us will be going out. Is it possible for you to start the course from the next weekend?
Thanks,
Gaurav
Harshad said
Hello,
I would like to enquire about Hadoop courses that are scheduled in coming months.
I stay in Mumbai. Please let me know the course date and fees for the same.
Regards,
Harshad.
admin said
Hi Harshad
Pleased to receive your message.
My next Hadoop batch is scheduled to start on 4th October.
Its a saturday sunday batch in the morning from 10.30am to 2.00pm.
If you are interested to join the same, please call me on 9821422745 or sms me and we can discuss.
Best,
Venkat
Amit said
Dear sir plz suggest me who can do Big Data [Hadoop] Developer course.
i mean to say what are requirements to learn Big Data [Hadoop] Developer course.
Thank you
admin said
HI Amit,
I had missed your email before and hence this late reply. Anyone who is in the Data Space needs to work on Hadoop sooner or later and needs to learn that. So folks DB Admins, Data Warehouse folks all of them would need Hadoop.
Cheers
Venkat
Clyde Lobo said
Is knowledge of java a pre requisite for enrolling in this course?
devendra Thomare said
Hello,
could you please let us know if there is any batch for Hadoop starting in Feb/March.
could you please let us know schedule(weekend/weekdays/timings) and fee structure.
Thanks,
Devendra
Manthan Ginoya said
Hi,
I am interested in joining the Hadoop Developer classes. Can you let me know when the next batch is commencing.
Also, would like to know if there is any prerequisite to entering the course. I have experience in Java and PL/SQL.
sharat said
Hi, would like to know if there are any hadoop course scheduled for weekends
Ankur garg said
Hi,
I am a Java developer and into Cognos also.Currently i am looking for hadoop training from beginning to advanced level.Can you please share the batch timings, venue and cost for the same.
Thanks
Ankur
Pavan said
Hi Venkat..
Please let me know if there are any new Bigdata courses in March/April.
I am interested to join. Do let me know.
-Thanks.
Pravin said
Hi Sir,
I want to know the information about hadoop batches ,timings and course fees.i had completed dot net course from you in may 2012.So looking for job in(Hadoop) this field.
Shailesh said
Hi Venkat,
Please let me know if there are any new Bigdata weekend courses in April.
I am interested to join. Do let me know.
Thanks
veda vikash said
I would like to know more details of hadoop training and its fee strucutre. I have attended the basic hadoop training from Venkat already. I would like to dig deeper more into hadoop.
Shashank More said
Hello Sir,
i am shashank, currently having 3+ yrs of exp in informatica … as career booster can hadoop gives me a good exposure ??? or is it relevant bcoz i am ETL developer (DWH) ??? …
Naveen Jain said
Hello Venkat, I would like to join Hadoop batch, in May / June.
I have 10 yrs. of exp on all DB’s like SQL Server, SYbase, Oracle and currently I am working onRainStor archival DB as well.
But i don’t have any knowledge on JAVA, for Hadoop…do i need to learn JAVA first ?
Could Please provide me complete details …
Sanjeev Pandey said
Hi Sir,
I am interested in learning Big Data Hadoop. Please let me know, when you are going to start new batch for the same.
–Sanjeev
Honey said
Hi,
Sir when is your next batch would be starting for hadoop . And is there any classes on weekends .
Thanks & regards
Honey jain
Paresh said
Hi Venkat ,
I have 10 yrs of exp in manual testing and would like to learn hadoop. Could you please let me know how I can start. ? Like if I’m a tester then to what area of hadoop I can learn? I.e. hadoop developer or Big data analyst ? Also to learn hadoop is java knowledge is required ? I don’t have any knowledge of java.
Could you please let me know when your hadoop new batch is starting ?
Waiting for your reply.
Thanks ,
Paresh
Bhagesh said
Hello Sir,
Wants to learn Hadoop. Can you tell me when your batch is starting for the same?
I am basically from testing background (almost 8 years of experience) and not having knowledge in JAVA and UNIX.
From testing point of view, I am looking for learning a Hadoop and it’s eco-system (Pig, hive & HBase). Are you covering / giving more hands on experience during the course on these eco-system which helps to the testers / Data Analyst and move our carrier from manual testing to Hadoop – Big data analyst?
Waiting for your reply.
Thanks
Ashish said
I am looking for Big Data and Hadoop developer course .I saw your course module looks very impressive for me .Request you to contact me for the same .Number is +918898394018
Harshada said
Hi sir
I am Java developer can you u suggest me which course is suitable for me. I want to do Hadoop plus Java developer.as per my knowledge Hadoop is file system which access by Java code. If possible I want batch in malad west for weekdays or sat sun in weekend
admin said
Hi Harshada,
I only do weekend batches and you should do Hadoop Developer module.
Cheers
Venkat
Pooja said
Hi sir,
I want to learn hadoop…so please tell me is there any weekend batches in month of june.
Nagesh said
what are the prerequisites for hadoop..do u need prior programming knowledge and linux knowledge
when will next batch for hadoop will start
Afroz said
Hi Sir,
Myself Afroz, living in Pune. I’m interested in joining the Hadoop developer course and would like to when will the new batch starts and the fee structure for the course.
Thanks,
Afroz