Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
Distributed Computing
- Calvin: fast distributed transactions for partitioned database systems. In this white-paper dissection by Adrian, he looks at Calvin which is a transaction scheduling and data replication layer that uses a deterministic ordering guarantee to reduce the high contention costs associated with distributed transactions significantly.
SQL Server
- SQL Server 2019 community technology preview 2.4 is now available. What the title says. I downloaded the CTP a couple of days ago, and when I am done with this post, I will install it. Oh, word of warning - if you want to install the SQL Server 2019 Big Data Cluster, please remember to uninstall and reinstall
mssqlctl
, so you get the latest version.
Streaming
- The Importance of Distributed Tracing for Apache-Kafka-Based Applications. This blog post looks at how to instrument Kafka-based applications with distributed tracing capabilities to make dataflows between event-based components more visible. Very interesting!
- Consuming Messages Out of Apache Kafka in a Browser. The post covers what the title says; how to consume Kafka messages in a browser.
- Exploring KSQL Stream-Stream Joins. This is an excellent post by Robin where he covers KSQL and stream to stream joins! I need to go off and
play withresearch this now!
WIND (What Is Niels Doing)
I am still working on the post about CREATE EXTERNAL LIBRARY
in the Install R Packages in SQL Server ML Services series. Expect it towards the end of this coming week.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.