Throughout the week, I read a lot of blog-posts, articles, etc., that has to do with things that interest me
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This is the “roundup” of the posts that has been most interesting to me, for the week just gone by. This week, the roundup comes a bit early due to me having to go abroad at the weekend for a week.
.NET
- .NET Core and .NET Standard: What Is the Difference?. An article from InfoQ discussing the differences between .NET Core, and .NET Standard.
SQL Server
- What’s new in SQL Server Management Studio 17.3. SQL Server 2017 was released during Microsoft Ignite a couple of weeks ago. This week the latest version of SQL Server Management Studio (SSMS) was released. This blogpost looks at what is new in the latest SSMS version.
Databases
- Is Facebook replacing its MySQL database with something more dynamic. This post speculates whether Facebook is about to replace their MySql databases, and it talks about Facebook’s in-house NoSql database; Apollo.
Streaming
The “war” between Apache Kafka and Apache Flink of which has the best SQL implementation for streaming data continues (I wrote about it in the roundups for week 35 and 36). This week there has been posts about SQL for streaming data from both Kafka as well as Flink.
- Uber Introduces Open Source AthenaX, A Streaming SQL Platform Powered By Apache Flink. This is very awesome. Uber has open sourced its streaming analytics platform AthenaX. This is definitely something we at Derivco will have a very close look at.
- Using Kafka Streams API for predictive budgeting. A post how Pinterest uses Kafka Streams API to provide inflight spend data to thousands of ads servers in mere seconds.
- Getting Started Analyzing Twitter Data in Apache Kafka through KSQL. How to use KSQL (KSQL is the open source streaming SQL engine for Apache Kafka) to query, analyse and transform data in Kafka.
Data Science
- The Microsoft Team Data Science Process (TDSP) – Recent Updates. Microsoft Team Data Science Process (TDSP) is an agile, iterative data science methodology to deliver predictive analytics solutions and intelligent applications efficiently. Its first version was released in September 2016. This post provides an overview of recent developments involving TDSP, including recent releases and how its adoption has gone since the first public release.
- Create an Azure Machine Learning Web Service with Python and Azure DSVM. A short and sweet tutorial how to use Python together with the Azure Data Science Virtual Machine (DSVM).
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.