Table of Contents |
---|
Hive Meetups
January 2016 Hive User Group Meetup
- Monitoring Hive - Szehon Ho
- What's new in Hive 2.0 - Sergey Shelukhin
- Hive on Spark: now and future - Xuefu Zhang
November 2015 Hive Contributor Meetup
- Metrics in Hive - Mohit Sabharwal
- Hive at Uber - Mohammad Islam
- Capybara Hive Integration Testing - Alan Gates
- Random Query Gen- Hive Meetup - Szehon Ho
- Lets talk sessionstate - Anthony Hsu
- Lessons learned from enterprise users - Xuefu Zhang
- Hardening Hive - John P
- LLAP: Sub-Second Execution in Hive - Gopal V
April 2015 Hive Contributor Meetup Presentations
- SQLLine - The tale of the twins that everyone loved too much – Julian Hyde
- ResultSet Compression in Apache Hive – Rohit Dholakia, George Chow
- High availability/Rolling upgrade for Hiveserver2 - Vaibhav Gumashta
- The birds and bees talk (Replication) - Sushanth Sowmyan
February 2015 Hive User Meetup Presentation
- Hive Join Optimizations: MR and Spark (Szehon Ho)
- Cascading and Hive (Ryan Desmond)
November 2013 Hive Contributors Meetup Presentations
- Using Dynamic Compilation with Hive (Edward Capriolo)
- Let There Be Tez: Current Status and Demo (Gunther Hagleitner)
- Hive Authorization Improvement Proposal - HIVE-5837 (Thejas Nair)
- Insert, update, and delete in Hive with full ACID support - HIVE-5317 (Owen O'Malley)
- Apache Sentry (Brock Noland)
- Cost Based Optimizer Design - HIVE-5775 (John Pullokkaran)
- Join syntax improvement - HIVE-5555 (Ashutosh Chauhan)
June 2013 Hadoop Summit Hive Meetup Presentations
- Hive Correlation Optimizer (Yin Huai)
February 2013 Hive User Group Meetup
- Hive at KIXEYE and in the Gaming Industry (Aaron Sun)
- Brickhouse: Klout's Open Source UDF Library for Hive (Jerome Banks)
- Hive Client/Server Deployment Options (Prasad Mujumdar)
- Case Study: Utilizing Windowing and Partitioned Table Functions with Hive (Murtaza Doctor)
- New Features in the Next Version of Hive (Ashutosh Chauhan)
June 2012 Hadoop Summit Hive Meetup Presentations
- What's News in Apache Hive 0.9.0 (Ashutosh Chauhan, Hortonworks)
- HiveServer2 Project (Carl Steinbach, Cloudera)
- Column Statistics in Hive (Shreepadma Venugopalan, Cloudera)
- HCatalog: Extending Hive's Metadata to Pig and MR (Alan Gates, Hortonworks)
- Continuous Aggregations with Hive (Viral Bajaria, Hulu)
November 2011 NYC Hive Meetup Presentations
- Breaking First Normal Form (Edward Capriolo, Media 6 Degrees)
- HAWK: Performance Monitoring for Hive (JunHo Cho, NexR)
- RHive: Integrating R and Hive (JunHo Cho, NexR)
- Replacing the Hive Execution Layer with Pervasive Turbo Rush (Jim Falgout, Pervasive Software)
- Using JDBC with Hive (Bennie Schut, eBuddy)
Older Hive Presentations
- High Volume Updates in Hive from 2012 Hadoop Summit (Owen O'Malley)
- Join Strategies in Hive from the 2011 Hadoop Summit (Liyin Tang, Namit Jain)
- Replacing an Oracle DB/DW with Hadoop/Hive from the 2011 Hadoop Summit Hive Contributor Meeting (JunHo Cho, NexR)
- RCFile Paper at ICDE 2011, Hannover, Germany (Yongqiang He (Facebook), Rubao Lee (OSU), Yin Huai (OSU), Zheng Shao (Facebook), Namit Jain (Facebook), Xiaodong Zhang (OSU), Zhiwei Xu (ICT, CAS))
- Join Optimization in Hive (Liyin Tang, Facebook)
- Hive Presentation at ApacheCon NA 2010 (John Sichi, Facebook)
- Using Hadoop and Hive to Optimize Travel Search (Jonathan Seidman and Ramesh Venkataramaiah, Orbitz)
- HBase Meetup HUG10 - Hive/HBase Integration (John Sichi, Facebook)
- Hive User Group Presentation - Hive Quick Start Tutorial (Carl Steinbach, Cloudera)
- Hive User Group Presentation - New Features and APIs from Facebook (Ning Zhang, Yongqiang He, Namit Jain, John Sichi, Paul Yang, Zheng Shao, Facebook)
- Hive User Group Presentation from Netflix (Eva Tse and Jerome Boulon, Netflix
Presentations about Hive
A list of presentations mainly focused on Hive.
- Hive ApacheCon 2008, New Oreleans, LA (Ashish Thusoo, Facebook)
- Facebook and Open Source, UIUC, (Zheng Shao, Facebook)
- Hive: Data Warehousing with Hadoop, NYC Hadoop User Meetup (Jeff Hammerbacher, Cloudera)
- Hive: Data Warehousing Analytics on Hadoop, UC Berkeley, (Joydeep Sarma, Namit Jain, Zheng Shao, Facebook)
- An Introduction to Hive, Jeff Hammerbacher, Facebook
- Large Scale Data Processing using commodity SW/HW, IIT-Delhi CS Dept., (Joydeep Sen Sarma, Facebook)
- Data Warehousing & Analytics on Hadoop, Percon Conference, Santa Clara, CA, USA (Ashish Thusoo, Prasad Chakka, Facebook)
- Hive: Hadoop Summit 2009, Santa Clara, CA, USA (Namit Jain, Zheng Shao, Facebook)
- Hive: VLDB 2009, Lyon, France (Facebook)
- Hive Object Model, (Zheng Shao, Facebook)
- Hive User Group Meeting August 2009, (Facebook)
- Rethinking the Data Warehouse with Hadoop and Hive (Ashish Thusoo, Facebook at Hadoop World NYC 2009)
- Hive Anatomy – System & Pseudo-code level Architecture Review, (Ning Zhang, internal presentations, Facebook)
- User Defined Table Generating Functions, (Paul Yang, Facebook)
- Hive Training – Motivations and Real World Use Cases, (Ning Zhang, Facebook, at Cloudera's training session)
- Hive Presentation at QCon Nov 2009 (Ashish Thusoo and Namit Jain, Facebook)
- Hive Paper and Presentation at ICDE 2010, Long Beach, California (Raghotham Murthy and Namit Jain)
- Hive Presentation at QCon Nov 2009 (Ashish Thusoo and Namit Jain, Facebook)
- Hive Training – Motivations and Real World Use Cases, User Group Presentation from Netflix (Eva Tse and Jerome Boulon, Netflix)Hive User Group Presentation - New Features and APIs from Facebook (Ning Zhang, Yongqiang He, Namit Jain, John Sichi, Paul Yang, Zheng ShaoFacebook, at Cloudera's training session)
- User Defined Table Generating Functions, (Paul Yang, Facebook)
- Hive User Group Presentation - Hive Quick Start Tutorial (Carl Steinbach, Cloudera)
- HBase Meetup HUG10 - Hive/HBase Integration (John Sichi, Facebook)
- Using Hadoop and Hive to Optimize Travel Search (Jonathan Seidman and Ramesh Venkataramaiah, Orbitz)
- Hive Presentation at ApacheCon NA 2010 (John Sichi, Facebook)
- Join Optimization in Hive (Liyin Tang, Facebook)
- RCFile Paper at ICDE 2011, Hannover, Germany (Yongqiang He (Facebook), Rubao Lee (OSU), Yin Huai (OSU), Zheng Shao (Facebook), Namit Jain (Facebook), Xiaodong Zhang (OSU), Zhiwei Xu (ICT, CAS)) Replacing an Oracle DB/DW with Hadoop/Hive at 2011 Hadoop Summit Hive Contributor Meeting (JunHo Cho, NexR)
- Anatomy – System & Pseudo-code level Architecture Review, (Ning Zhang, internal presentations, Facebook)
- Rethinking the Data Warehouse with Hadoop and Hive (Ashish Thusoo, Facebook at Hadoop World NYC 2009)
- Hive User Group Meeting August 2009, (Facebook)
- Hive Object Model, (Zheng Shao, Facebook)
- Hive: VLDB 2009, Lyon, France (Facebook)
- Hive: Hadoop Summit 2009, Santa Clara, CA, USA (Namit Jain, Zheng Shao, Facebook)
- Data Warehousing & Analytics on Hadoop, Percon Conference, Santa Clara, CA, USA (Ashish Thusoo, Prasad Chakka, Facebook)
- Large Scale Data Processing using commodity SW/HW, IIT-Delhi CS Dept., (Joydeep Sen Sarma, Facebook)
- An Introduction to Hive, Jeff Hammerbacher, Facebook
- Hive: Data Warehousing Analytics on Hadoop, UC Berkeley, (Joydeep Sarma, Namit Jain, Zheng Shao, Facebook)
- Hive: Data Warehousing with Hadoop, NYC Hadoop User Meetup (Jeff Hammerbacher, Cloudera)
- Facebook and Open Source, UIUC, (Zheng Shao, Facebook)
- Hive ApacheCon 2008, New Oreleans, LA (Ashish Thusoo, Facebook)
Related Work
- Processing Theta-Joins using MapReduce (A. Okcan, M. Riedewald)
- Optimizing Joins in a Map-Reduce Environment (F. Afrati, J. Ullman)
- Efficient Parallel Set-Similarity Joins Using MapReduce (R. Vernica, M. Carey, C. Li)
- A Comparison of Join Algorithms for Log Processing in MapReduce (S. Blanas, J. Patel, V. Ercegovac, J. Rao, E. Shekita, Y. Tian)
- HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads (A. Abouzeid, K. Bajda-Pawlikowski, D. Abadi, A. Silberschatz, A. Rasin)
- Tenzing: A SQL Implementation On The MapReduce Framework
- Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience
- YSmart: Yet Another SQL-to-MapReduce Translator (R. Lee, T. Luo, Y. Huai, F. Wang, Y. He, X. Zhang)
- Query Optimization Using Column Statistics in Hive (A. Gruenheid, E. Omiecinski, L. Mark)