Toronto PASS

EventDisplay

Next Meeting

Tuesday,

April

22

TorPASS 2014 Apr: Discovery of Hadoop Under a Relational Lens Scope

Discovery of Hadoop Under a Relational Lens Scope

This session aims to demystify the Big Data concept. Explaining the jargon of Hadoop and NoSQL. Show what Hadoop can do that Relational cannot and vice-versa.

·         High level introduction to Hadoop and NoSQL

·         Similarities and Differences between Hadoop and Relational.

·         Alternative dev environment based on Open Source tools and Linux

·         Demo1: Show how to setup a Virtual Machine sandbox running Hortonworks Data Platform v2.20. Test the VM by running some quick exercises.

·         Demo2: Presentation of a 3 nodes Hadoop Cluster. Running on a “private cloud” based on Hyper-V 2012 R2.
From a reference scenario using SQL Server 012 “Raw/semi-structured data -> ETL -> Relational -> Queries -> BI Analytics Visualization”. Show three different implementations using a Hadoop Cluster to achieve the same results. 1) Using a MapReduce Java program 2) Using Pig script 3) Using Hive.


Agenda:

5:30 - 6:00 PM Check in, networking and pizza

6:00 - 8:00 PM Program

8:00 - 8:30 Wrap up and prize draw.  We have swag!

8:30 - ?? Beverages at a local pub

 

Parking and Transit

There is a street car stop within one block.  We recommend parking at the Green P lot that is just one block away.  We recommend you pay the max rate for parking until 7 AM the next morning, which amounts to about $6 depending on when you arrive.

Access

The doors to the building lock after 6PM.  Your registration for the meeting will include contact information to reach us to have someone escort you in.  The best bet is to arrive before 6PM and head to the fourth floor and head to the end of the hallway.

Featured Presentation:

Discovery of Hadoop Under a Relational Lens Scope

Tri Nguyen T4G

This session aims to demystify the Big Data concept. Explaining the jargon of Hadoop and NoSQL. Show what Hadoop can do that Relational cannot and vice-versa. • High level introduction to Hadoop and NoSQL • Similarities and Differences between Hadoop and Relational. • Alternative dev environment based on Open Source tools and Linux • Demo1: Show how to setup a Virtual Machine sandbox running Hortonworks Data Platform v2.20. Test the VM by running some quick exercises. • Demo2: Presentation of a 3 nodes Hadoop Cluster. Running on a “private cloud” based on Hyper-V 2012 R2. From a reference scenario using SQL Server 2012 “Raw/semi-structured data -> ETL -> Relational -> Queries -> BI Analytics Visualization”. Show three different implementations using a Hadoop Cluster to achieve the same results. 1) Using a MapReduce Java program 2) Using Pig script 3) Using Hive.

About Tri:
None

Back to Top
cage-aids
cage-aids
cage-aids
cage-aids