Welcome to the Apache Flume wiki!
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. Its main goal is to deliver data from applications to Apache Hadoop's HDFS. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic applications. Please click here for the user guide.
It is written primarily in Java and has been tested on unix-like systems:
- Ubuntu 9.4+ (DEB compatible)
- Centos 5.3+ (RPM compatible)
- RHEL 5.5+
- SLES 11
- Mac OS X
Releases
Resources
Project Administration
- Apache Project Reports
- Apache Mailing Lists
- Apache Flume Issue Tracker
- IRC channel #flume on irc.freenode.net
User Resources
Developer Resources
Publications
- Apache Flume: Distributed Log Collection for Hadoop
by Steve Hoffman. Publisher: PACKT Publishing
Release: July 2013, Pages: 108, ISBN: 1782167919
Thanks!
YourKit is kindly supporting open source projects with its full-featured Java Profiler. YourKit, LLC is the creator of innovative and intelligent tools for profiling Java and .NET applications. Take a look at YourKit's leading software products: YourKit Java Profiler and YourKit .NET Profiler.