Throughout the week, I read a lot of blog-posts, articles, etc., that has to do with things that interest me
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This is the “roundup” of the posts that has been most interesting to me, for the week just gone by.
Distributed Computing
- Patterns of Microservices Architecture. Phil Calcado is an expert on microservices architecture. The post I am pointing to contains links to articles about patterns in microservices architecture.
Data Science
- Selling Analytics to your Stakeholders. If you are a data scientist, or a data nerd in general, or a manager - this article is a MUST read. Enough said!
- Tutorial: Publish an R function as a SQL Server stored procedure with the sqlrutils package. “Aw schucks, blush” - the guys at Revlution Analytics points to a blog-post of mine, how to use the
sqlrutils
package to create SQL Server stored procedures from R code. Thanks guys!!! - Using Microsoft R Server Operationalization on HDInsight. Microsoft R Server has an awesome operationalization story! This blog-post talks about how to do it on HDInsight!
The guys at Revolution Analytics have really been busy the last week. Apart from the post above about sqlrutils
, the rest of the content under Data Science comes from them!
- Tutorial: Deep Learning with R on Azure with Keras and CNTK. Microsoft Cognitive Toolkit (CNTK) is a commercial-grade and open-source framework for deep learning tasks. Unfortunately for you if you are an R programmer, CNTK does not have an R interface (too bad, so sad!). Fortunately however, the RStudio team has developed an R interface for Keras making it possible to run CNTK (plus other deep learning backends), from within an R session.
- dplyrXdf 0.10.0 beta prerelease. The dplyr package is a toolkit for data transformation and manipulation. It is a great package, but it has the “usual” R limitations of everything being in-memory. The dbplyrXdf package however, does not requires all data to be stored in memory, but it allows it to be stored on disk as well, which will make it ideal for handling really large datasets.
- In case you missed it: July 2017 roundup. The monthly roundup from Revolution Analytics, quite a few interesting things to have a look at!
SQL Server R Services
I am still working on episode Internals - IX in the SQL Server 2016 R Services series. I had aimed to have it published this weekend, but work “reared its ugly head, so …”. However, I am fairly confident I’ll be able to publish it before the coming weekend! If you are interested in the previous “episodes” they can be found here.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.