Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
.NET
- Update on .NET Core 3.0 and .NET Framework 4.8. A blog post from the .NET engineering team, where they talk about the future of the .NET Framework and .NET Core. I wonder if this post was prompted by speculations recently about the future of the .NET Framework, where there were questions whether the .NET Framework 4.8 would be the last version, and all development would be concentrated on .NET Core.
Azure
- Enabling real-time data warehousing with Azure SQL Data Warehouse. This post is an announcement how Striim now fully supports SQL Data Warehouse as a target for Striim for Azure. Striim is a system which enables continuous non-intrusive performant ingestion of enterprise data from a variety of sources in real time.
Streaming
- Is Event Streaming the New Big Thing for Finance?. An excellent blog post by Ben Stopford where he discusses the use of event streaming in the financial sector.
- Troubleshooting KSQL – Part 2: What’s Happening Under the Covers?. The second post by Robin Moffat about debugging of KSQL. In this post - Robin, as the title says, goes under the covers to figure out what happens with KSQL queries.
- 6 things to consider when defining your Apache Flink cluster size. This post discusses how to plan and calculate a Flink cluster size. In other words; how to define the number of resources you need to run a specific Flink job.
MS Ignite
- Syllabuck: Ignite 2018 Conference. A great list of MS Ignite sessions that Buck Woody found interesting! Now I know what to do in my spare time!
Data Science
- Customized regression model for Airbnb dynamic pricing. This post by Adrian is about a white-paper which details the methods that Airbnb use to suggest prices to listing hosts.
- Cleaning and Preparing Data in Python. A post which lists Python methods and functions that helps to clean and prepare data.
- The Microsoft Infer.NET machine learning framework goes open source. A blog post from Microsoft Research, in which they announce the open-sourcing of Infer.NET. Is anyone else but me somewhat confused about the various data science frameworks that Microsoft has?
- How to build a Simple Recommender System in Python. A blog post which discusses what a recommender system is and how you can use Python to build one.
What Is Niels Doing (WIND)
That is a good question! As you know, I wrote two blog posts about SQL Server 2019:
My plan was to relatively quickly follow up those two posts with a third post how to run SQL Server Machine Learning Services on SQL Server 2019 on Linux, and do it inside a Docker container. After having spent some time trying to get it to work, (with no luck), I gave up and contacted a couple of persons in MS asking for help. The response was that, right now in SQL Server 2019 on Linux CTP 2.0, you cannot do it - bummer! The functionality will be in a future release.
I am now reworking the post I had started on to cover SQL Server Machine Learning Services in an Ubuntu based SQL Server 2019 on Linux. I should be able to publish something within a week or two.
I am also working on the third post in the Install R Packages in SQL Server ML Services series (still). Right now I have no idea when I can publish it - Sorry!
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.