Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
SQL Server
- SQL Server 2017 Improved Resource Usage On Smaller Machines. Bob Dorr talks about improvements he did in SQL Server 2017 CU4 to improve resource usage on smaller machines. Some interesting tidbits about SQLCLR and garbage collection!
- SSMS 17.6 is now available: Managed Instance and many bug fixes. A blog post announcing the release of SQL Server Management Studio (SSMS) 17.6. Go and grab it before Microsoft runs out of stock!
.NET
- Exploring the internals of the .NET Runtime. An excellent blog post by Matthew how he drills down into and figures out how the .NET runtime works. A bonus to this post is the link to the Book of the Runtime.
Distributed Computing
- Anatomical similarities and differences between Paxos and blockchain consensus protocols. An interesting blog post by Murat where he compares the Paxos protocol with blockchain consensus protocols.
Streaming
- Devoxx 2017 – Rethinking Services With Stateful Streams. A presentation from Devoxx 2017, where Ben Stopford talks about microservices and streaming using Kafka. This presentation is a must watch if you do microservices!
- Azure Event Hubs integration with Apache Spark now generally available. This post is an announcement from the Microsoft Azure EventHub team that integration between EventHubs and Apache Spark in Azure is now available across regions. The Spark integration is cool, but I would like to see integration with Apache Flink as well.
- KSQL in Action: Enriching CSV Events with Data from RDBMS into AWS. In last weeks roundup I linked to a post by Robin Moffat where he discussed Apache Kafka integration with relational databases. In this link, Robin takes the database integration a step further and shows how we can enrich streaming data with database data in real-time!
Data Science
- Azure Databricks, industry-leading analytics platform powered by Apache Spark. Another Azure announcement: this is about how Azure Databricks - an Apache Spark based analytics platform optimised for Azure - is now available across regions in Azure.
- R and Docker. In the roundup from last week I linked to a post about Data Science and Docker, and in this post from David, he talks about R and Docker images.
SQL Server R Services
Over one year and 24 posts later I have finally finished my Microsoft SQL Server R Services series, and the last post was:
- Microsoft SQL Server R Services: sp_execute_external_script - III. In this post I discussed parallelism when executing
sp_execute_external_script
. If I may say so myself, I quite liked it!
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.