Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending. Unfortunately there was not much that caught my eye this week, so this is a short post.
Distributed Computing
- Scaling Push Messaging for Millions of Devices @Netflix - Susheel Aroskar at QCon NY. An InfoQ article about Zuul Push, which is a high performance asynchronous service based on Apache Netty, a non-blocking I/O (NIO)-based network application framework. The technology handles more than 5.5 million connected clients at peak.
Streaming
- Apache Kafka vs. Enterprise Service Bus (ESB)—Friends, Enemies, or Frenemies?. A blog post showing why so many enterprises leverage the open source ecosystem of Apache Kafka for successful integration of different legacy and modern applications, and how this differs but also complements existing integration solutions like ESB or ETL tools.
- Simplify Streaming Stock Data Analysis Using Databricks Delta. A post about how Databricks Delta helps solving many of the pain points of building a streaming system to analyze stock data in real-time.
Data Science
- Python vs (and) R for Data Science. This post acts as a guide for those wishing to choose between Python and R Programming languages for Data Science.
SQL Server Machine Learning Services
I am busy struggling with writing a couple of posts in the sp_execute_external_script and SQL Server Compute Context series, as well as the Install R Packages in SQL Server ML Services series. I do hope to have at least one out in a weeks time (or so).
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.