Project Page
- Sqoop Project Overview
- Apache Project Website
- Mailing Lists
- Source Code of Sqoop project (GIT)
- Source Code of Sqoop website (SVN)
- IssueTracker (JIRA)
- Project Reports
- Sqoop Developer Meetings
General information:
- Sqoop Presentations and Blogs
Apache Sqoop Blog
Future Work
Sqoop 1.99.* Releases
1.99.6 Release ( TBD )
- Kite Connector Hbase support( basic read/write and DFM)
- Sqoop Config as Top Level Entity
1.99.5 Release
Feature Docs
- Intermediate Data Format API and custom implementations
- Sqoop MR Execution Engine - API and internals
- Sqoop Server Internals
- Sqoop Repository API
- Security Guide (Kerberos Authentication)( Server and Client )
- Status of DFM in Connectors
Design Docs
- Delta Fetch And Merge Design
- Avro Intermediate Data Format
- High Level Design of Role Based Access Controller
- Kite Connector Design - Hdfs/Hive
1.99.4 Release
Feature Docs
Feature Tickets
- Sqoop 2 From/to Refactoring
- Sqoop 2 Schema Matching
- Sqoop 2 Connector/Repository API enhancements
- Sqoop 2 Configurable Entity
Before 1.99.4
- Sqoop2 CSV Intermediate representation
- Sqoop2 - Resource Layout
- Sqoop2 Object design
- Sqoop2 Datatype Support
- Sqoop2 Weekly Meeting Minutes
- Sqoop2 Feature Requests
- Sqoop2 Proposal
Sqoop 1 Releases
Release 1.4.5
Archived
- Archived Resources
Sqoop User Resources
Community
Sqoop Developer Resources
General Guidelines for Development
Note |
---|
If you are contributing to Sqoop 2, refer the guidelines for Sqoop2 for coding guidelines and review guidelines |
- Developers Quick Hack Sheet
- How-to Sqoop Design wiki - Must read if you are contributing more than a few lines of code
What is Apache Sqoop?
Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. You can use Sqoop to import data from external structured datastores into Hadoop Distributed File System or related systems like Hive and HBase. Conversely, Sqoop can be used to extract data from Hadoop and export it to external structured datastores such as relational databases and enterprise data warehouses.
Sqoop provides a pluggable connector mechanism for optimal connectivity to external systems. The Sqoop extension API provides a convenient framework for building new connectors. New connectors can be dropped into Sqoop installations to provide connectivity to various systems. Sqoop itself comes bundled with various connectors that can be used for popular database and data warehousing systems.
Getting started
Section | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
|
Resources
...
Column | ||
---|---|---|
| ||
Community
|
Column | ||
---|---|---|
| ||
Users |
...
width | 25% |
---|
Developers
...
...
- How to Update Project Website (Post Release)
- Pre-commit hook
Sqoop 2 (1.99.* releases)
...
width | 25% |
---|
Archived
...
- Coding Guidelines - for the contributor
- Sqoop 2 Review Guidelines (per component) - for the reviewer
- FAQ Sqoop2 Integration tests
Sqoop 1
...