Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
Distributed Computing
- The Kubernetes Effect. The first of two links this week from InfoQ that have to do with Kubernetes. This article discusses the evolution of Kubernetes and what Kubernetes can do for us.
Misc.
- Think before you code. In this post Murat talks about different coding styles, and how important it is to think before you code. To think about the design and code, does not necessarily mean that you think, think, think, and strive for the perfect design (In my mind there is no such thing as a perfect design). A fascinating article and I thoroughly enjoyed it!
Streaming
- Managing and Operating Kafka Clusters in Kubernetes. The second article this week about Kubernetes and this article discusses how Kubernetes can make running Kafka clusters much easier.
- Migrating Batch ETL to Stream Processing: A Netflix Case Study with Kafka and Flink. This article is about Netflix journey from an ETL based approach to moving data, to a real-time process using Kafka and Flink. A must read!!
- Apache Flink Master Branch Monthly: New in Flink in January 2018. This post is highlighting a selection of features that have been merged into Flink’s master branch during January.
SQL Server R Services
The Microsoft SQL Server R Services series is coming to an end. Last week I posted the last post of the Internals part, and I am now working on a post about sp_execute_external_script
.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.