The innards of Evernote’s new business analytics data warehouse
replacing a giant MySQL star-schema reporting server with a Hadoop/Hive/ParAccel cluster
(tags: horizontal-scaling scalability bi analytics reporting evernote via:highscalability hive hadoop paraccel)
HBase Real-time Analytics & Rollbacks via Append-based Updates
Interesting concept for scaling up the write rate on massive key-value counter stores:
‘Replace update (Get+Put) operations at write time with simple append-only writes and defer processing of updates to periodic jobs or perform aggregations on the fly if user asks for data earlier than individual additions are processed. The idea is simple and not necessarily novel, but given the specific qualities of HBase, namely fast range scans and high write throughput, this approach works very well.’
(tags: counters analytics hbase append sematext aggregation big-data)
Cliff Click in “A JVM Does What?”
interesting YouTubed presentation from Azul’s Cliff Click on some java/JVM innards
(tags: presentation concurrency jvm video java youtube cliff-click)