HADOOP FOR USERS TRAINING

Hadoop for Users Training
Friday, 7 May 2010

The Hadoop for Users training is organised by the Infocomm Development Authority of Singapore (IDA) as part of the capabilities development activities in Cloud Computing for Singapore under the Open Cirrus Cloud Computing Testbed Initiative.

About Open Cirrus Cloud Computing Testbed

In July 2008, IDA Singapore became a Centre of Excellence for Cloud Computing, in partnership with Hewlett-Packard, Yahoo! and Intel, where it creates opportunities for research and development in Cloud Computing, enhances local capabilities and enables users to gain easy access to this next generation service.

About Hadoop

Hadoop is a framework for running applications on large clusters built of commodity hardware. The framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named Map/Reduce, where the application is divided into many small fragments of work, each of which may be executed or re-executed on any node in the cluster. In addition, it provides a distributed file system that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both Map/Reduce and the distributed file system are designed so that node failures are automatically handled by the framework.

Applications of Hadoop can be found in Internet scale data intensive applications, such as distributed grep, distributed sort, web link-graph reversal, term-vector per host, web access log stats analysis, inverted index construction, document clustering, machine learning, machine translation, natural language processing and matchmaking. Users of Hadoop include Yahoo!, eBay, Amazon, Facebook, and NYTimes.

Training Conducted by

The main instructor is Mr. Napat Chalakornkosol of the National Grid Office at IDA.

Agenda

9:00am - 10:30am Introduction to Cloud Computing & Hadoop
10:30am - 11:00am Tea break
11:00am - 12:30pm Setup Hadoop Cluster
12:30pm - 1:30pm Lunch break
1:30pm - 3:30pm Programming on Hadoop (Part 1) 
3:30pm - 4:00pm Tea break
4:00pm - 5:00pm Programming on Hadoop (Part 2)

Seating Capacity

This workshop is limited to 30 participants. Participation will be on a first-come-first-served basis.

Registration Fee

The above training is free-of-charge to all Singapore-based organisations and companies. Interested institutes of higher learning (IHLs) and other organisations in Singapore should consolidate staff and student registration.


Platinum Sponsors: Alatum IBM      
Silver Sponsor: IGEL        
Bronze Sponsors: 1 Degree North Microsoft NCS
Venue Sponsor: SMU SIS   Lanyard & Bag Sponsor: IBM  
Organized By: IDA ngo SCS SITF
A*Star NTU NUS SMU SIS  

Shadow