online haddop from easylearning guru

Information about online haddop from easylearning guru

Published on August 6, 2014

Author: easylearning



PowerPoint Presentation: Welcome to the World of Big Data & Hadoop Agenda : Agenda What is Big Data ? Different Kinds of Big Data Big Data Global Market Hadoop Global job trends What is Hadoop ? What is Big Data?: What is Big Data? Big data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. Types of Big Data ?: Types of Big Data ? Traditional RDBMS deals with only Structured data. Need of a technology which deals with Semi-structured data, Unstructured data and Structured data as well Semi-Structured Data The 3V’s of Big Data: The 3V’s of Big Data Sources of Data: Sources of Data Social Media & Networks (All of us are generating data) Mobile Devices (Tracking all the objects all the time) Sensor Technology & Networks (Measuring all kinds of data) Scientific Instruments (Collecting all sorts of data) PowerPoint Presentation: Where Big Data is used ? PowerPoint Presentation: Facebook Scenario Facebook on an average generates 70 thousand MB in 1 minute. 1 hour = 70,000 MB *60 = 4.2 Million MB 1 Day = 4.2 Million *24 MB = 10.8 Billion MB = 98438 GB 1 week = 6.9 thousand GB = 690 TB 4 weeks = 690 TB * 4 = 2756 TB = 2.7 PB 52 weeks = 2.7 PB * 52 = 143.3 PB And that’s aloooooooooot of data ! Various Bigdata Technologies: Various Bigdata Technologies Big Data Global Market: Big Data Global Market Sources : Dice, LinkedIn. Hadoop Global Job Trends: Hadoop Global Job Trends Top Hadoop Technology Companies Sources : Dice, LinkedIn. More than 17,000 employees with Hadoop skill across these companies PowerPoint Presentation: Sources : Dice, LinkedIn. Hadoop Global Job Trends What is Hadoop ?: What is Hadoop ? Hadoop was created by Doug Cutting and Mike Cafarella. Hadoop provides the reliable shared storage and analysis system. It is designed to scale up from a single server to thousand of machines, with a high degree of fault tolerance. Hadoop History: Hadoop History Hadoop Core Components: Hadoop Core Components Core Hadoop has two main systems: Hadoop Distributed File System:   The Hadoop file system is a Distributed file system which holds the large amount of data across multiple nodes in a cluster. MapReduce : MapReduce is a distributed programming paradigm used to analyze the data in the HDFS. Hadoop Distributed File System (HDFS): Hadoop Distributed File System (HDFS) A given file is broken down into blocks ( default=64MB), then blocks are replicated across cluster (default=3 ). Optimized for throughput. HDFS allows you to put/get/delete files. Follows the philosophy “Write Once and Read Multiple times” Block Replication for: - Durability, High Availability and Throughput . PowerPoint Presentation: MapReduce Flow MapReduce Framework: MapReduce Framework Map Reduce works by breaking the processing into two phases : Map Phase and Reduce Phase. PowerPoint Presentation: PowerPoint Presentation: What we offer… PowerPoint Presentation: PowerPoint Presentation: Syllabus Introduction Big Data Hadoop Hadoop HDFS MapReduce PIG Pig 1 Pig 2 Hive Hive 1 Hive 2 Hbase Zookeeper Sqoop Yarn Project Class PowerPoint Presentation: Thank you for watching the Live Demo for Hadoop. You can always contact us on: Your queries are always welcome. Phone : +91 124 4763660 (India) Email : [email protected] Skype Id : Website :

Related presentations

Other presentations created by easylearning

Java Essentials For Hadoop
25. 08. 2014

Java Essentials For Hadoop