Implementing Real-Time Trending Topics With a Distributed Rolling Count Algorithm in Storm
Storm demo with a reasonably complex topology. ‘how to implement a distributed, real-time trending topics algorithm in Storm. It uses the latest features available in Storm 0.8 (namely tick tuples) and should be a good starting point for anyone trying to implement such an algorithm for their own application. The new code is now available in the official storm-starter repository, so feel free to take a deeper look.’
(tags: storm distcomp distributed tick-tuples demo)
-
‘a UNIX init scheme with service supervision’ – philosophically similar to daemontools, widely packaged, LSB init.d-script-compliant, BSD-licensed
‘The Uni?ed Logging Infrastructure for Data Analytics at Twitter’ [PDF]
A picture of how Twitter standardized their internal service event logging formats to allow batch analysis and analytics. They surface service metrics to dashboards from Pig jobs on a daily basis, which frankly doesn’t sound too great…
(tags: twitter analytics event-logging events logging metrics)
Ivan Beshoff, Last Survivor Of Mutiny on the Potemkin, founded Beshoffs
wow. there’s a factoid! the “Beshoffs” chain of chippers in Dublin were founded by this historic figure, who died in 1987
(tags: factoids beshoffs chips dublin history small-world battleship-potemkin russia)