github.com/nathanmarz/storm

Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more

Open this visualization on its own page →

Contributors

66

Lines of Code

5,848

From

2011-09-17

To

2013-12-13

About nathanmarz/storm

Storm is a distributed stream processing framework that enables real-time computation at scale. Written in Java, it handles continuous data streams through a topology of interconnected components called spouts and bolts, allowing developers to process unbounded sequences of events with fault tolerance and high throughput. Beyond stream processing, Storm also supports distributed RPC and other real-time computation patterns.

The project was originally created by Nathan Marz and has since been adopted by the Apache Software Foundation as an incubating project. This repository serves as a mirror of the official Apache-hosted codebase, with the primary development now occurring under the Apache umbrella. Users and developers are directed to contribute through Apache's infrastructure, including their JIRA issue tracker and official mailing lists for both end users and developers.

Storm is designed for organizations needing to process high-volume data streams with low latency, making it suitable for use cases like real-time analytics, monitoring, fraud detection, and event processing. Its distributed architecture ensures that computations can scale horizontally across multiple nodes while maintaining resilience to node failures.

Share this video