Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This is the “roundup” of the posts that has been most interesting to me over the Christmas and New Year period.
Misc.
- Identifying impactful service system problems via log analysis. This is the dissection by Adrian of a white-paper about Log3C, which is a general framework that identifies service system problems from system logs. Go off and read Adrian’s take on it, and then go and download Log3C!
- Towards a theory of software development expertise. The last paper Adrian looked at in 2018. The paper is about what is a good software developer, and how do you get better?
Databases
- DevOps for the Database. An InfoQ presentation trying to find the answer to why it is difficult to apply principles of DevOps to databases.
Distributed Computing
- Two-phase commit and beyond. An excellent post by Murat discussing the two-phase commit protocol. It is an excellent read, and if you have any interest in distributed computing, you should read the post.
Data Science
- GitHub Python Data Science Spotlight: AutoML, NLP, Visualization, ML Workflows. A blog post which spotlights a select group of open source Python data science projects with GitHub repos. I am particularly interested in Auto-Keras, and MLFlow.
.NET
- How to Test ASP.NET Core Web API. This is an InfoQ article investigating testing ASP.NET Core 2.0 Web API solutions. The article looks at internal testing with Unit Testing and externally testing with a new testing framework in ASP.NET Core called Integration Testing.
- A Quick Tour of the .NET CLI. Another InfoQ article, this time it covers how several .Net OSS tools take advantage of the dotnet cli and how we can use the new cli tooling in our daily development.
Streaming
- Microsoft open sources Trill to deliver insights on a trillion events a day. A blog post introducing Trill; a high-performance streaming analytics engine. This is something I want to keep my eyes on!
- Easy Ways to Generate Test Data in Kafka. As the title says, a post discussing how to generate data for testing in Kafka.
What Did Niels Do During the Holidays
I was a, (somewhat), busy bee during the holidays and managed to publish three blog posts:
- SQL Server 2019 Extensibility Framework & Java - Null Values. A post where I look at how to handle null values when calling Java code from SQL Server 2019.
- SQL Server ML Services - Multiple Input Data Sets. We look at how to push in multiple datasets to external scripts.
- SQL Server 2019 Extensibility Framework & Java - Misc. “Stuff”. We look at SQL Server 2019 Java Extensions, and Java packages, the CLASSPATH and JAR files.
Right now I am working on two more posts related to the SQL Server 2019 Extensibility Framework. I hope to publish at least one this coming week.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.