Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
Cloud
- Hello, Pulumi!. Back in 2017, I wrote in Weekly Roundup 22 about Joe Duffy and his startup Pulumi. At that time, no one knew what Pulumi was all about. Well, that changed with today’s linked post. Joe explains about Pulumi, and it does sound extremely exciting. Go forth and read all about it!
Databases
- The Future of Distributed Databases Is Relational. This is an InfoQ presentation about creating a more modern relational database. It is about Postgres and the journey to scale out and make it truly distributed. Fascinating!
Streaming
- Streaming SQL to Unify Batch & Stream Processing w/ Apache Flink @Uber. Another InfoQ presentation. This presentation is about exploring SQL’s role in the world of streaming data and its implementation in Apache Flink and covering streaming semantics, event time, and incremental results. Interesting!
- Getting Started with dA Platform on Google Kubernetes Engine. This article describes the setup of the dA Platform using Google Cloud’s Kubernetes engine.
- Introducing Confluent Hub. From the post: “Confluent Hub is a place for the Apache Kafka and Confluent Platform community to come together and share the components the community needs to build better streaming data pipelines and event-driven applications.”. Sounds intriguing!
Data Science
- The Data Analysis Maturity Model – Level Two: Reliable Data Storage and Query Systems. The second post in a series by Buck Woody about defining a Data Analysis Maturity Model. In this post, Buck discusses data storage and querying.
- PYPL Language Rankings: Python ranks #1, R at #7 in popularity. A post by David at Revolution Analytics about a popularity index of programming languages.
SQL Server Machine Learning Services
Boy, have I been a busy bee this week. I have managed to get two posts published, which must be a record. But none of the posts is the follow-up post to my [sp_execute_external_script and SQL Compute Context - I][ctx1] post from five weeks ago. This is getting stupid!
- Installing R Packages in SQL Server Machine Learning Services - I. This post, which is the first post in a series, came about due to a mate of mine asking how he could install R packages into SQL Server Machine Learning Services. In this particular post, we look at using R packet managers to install packages.
- sp_execute_external_script and Permissions. While researching the Installing R Packages series I came about some issues related to permissions and
sp_execute_external_script
. This post tries to clarify and explain how it works.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.