Throughout the week, I read a lot of blog-posts, articles, etc., that has to do with things that interest me
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This is the “roundup” of the posts that has been most interesting to me, for the week just gone by.
Streaming
- Crossing the Streams – Joins in Apache Kafka. Kafka 0.10.1, introduced support for “Interactive Queries”, an API that allows querying stateful stream transformations without going through another Kafka topic. This blogpost looks at how to join streams, and what type of joins that exists.
- Disaster Recovery for Multi-Datacenter Apache Kafka Deployments. This post points to a white-paper how to set up Kafka across geo-locations for disaster recovery.
Data Science
- How Did We Build Book Recommender Systems in an Hour Part 1 — The Fundamentals. First part of a series how to build a recommender system.
- Preview: ALTREP promises to bring major performance improvements to R. David from Revolution Analytics talks about changes to the R engine, to improve performance and reduce memory usage.
- Cheat Sheets for AI, Neural Networks, Machine Learning, Deep Learning & Big Data. As the title say, cheat sheets for a lot of things data science.
- Distributed deep neural networks over the cloud, the edge, and end devices. Adrian from the morning paper looks at a whitepaper about distributed deep neural networks.
- Tutorial: Launch a Spark and R cluster with HDInsight. This post by David from Revolution Analytics points to a tutorial how to get up and running with a Spark cluster and R. Cool stuff!!
SQL Server R Services
In last weeks roundup I mentioned I’d be ready with Internals - XI soon:ish, and it would cover SQL Server R Services internal data transfer protocol Binary eXchange Language (BXL). I will be ready soon:ish with Internals - XI, but it will most likely cover something else than BXL, just so you know :). If you are interested; Internals - X is here.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.