In out last adventure we got did a basic example of Apache Streams with Twitter data. This week we’re going to extend that example with Facebook data! Also note, if this seems a little light it is because it’s not that different from the last post and the full explanations are there. Our goal here […]Read more "Getting to Know Your Friends with Apache Streams"
Apache Streams is a utility for easily interacting with an ever growing galaxy of social media APIs, collecting data into a common format, and persisting to file or DB.
This post is the first of many to explore this exciting project.Read more "Dipping Your Toes in Apache Streams"
In this post we’re going to really show off the coolest (imho) use-case of Apache Mahout – roll your own distributed algorithms. All of these posts are meant for you to follow-along at home, and it is entirely possible, you don’t have access to a large YARN cluster. That’s OK. Short story- they’re free on […]Read more "Deep Magic Volume2: Absurdly Large OLS with Apache Mahout"
Fishing for tweet with FlinkRead more "Big Data for n00bs: My first streaming Flink program (twitter)"
Big Data for n00bs: is a series I am working on of absolute simplest working examples for people just getting started in Big Data. In this post we Explore ‘Gelly’ the graph processing library of Apache Flink.Read more "Big Data for n00bs: Gelly on Apache Flink"
I was at Apache Big Data last week and got to talking to some of the good folks at the Apache Mahout project. For those who aren’t familiar, Apache Mahout is a rich Machine Learning and Linear Algebra Library that originally ran on top of Apache Hadoop, and as of recently runs on top of […]Read more "Deep Magic Volume 1: Visualizing Apache Mahout in R via Apache Zeppelin (incubating)"
So, I was on a call for the November i-Com Data Science Board meeting Thursday morning. There were supposed to be 5 minute presentations with discussions on a few topics, however a couple of the presenters couldn’t make the call, including the presenter on Streaming Data Processing, and Real Time Analytics. So I think to […]Read more "Why Flink is going to upend the Digital Advertising industry in 5 minutes."