Throughout the week, I read a lot of blog-posts, articles, and so forth, that has to do with things that interest me:
- data science
- data in general
- distributed computing
- SQL Server
- transactions (both db as well as non db)
- and other “stuff”
This blog-post is the “roundup” of the things that have been most interesting to me, for the week just ending.
Distributed Computing
- Induce controlled Chaos in Service Fabric clusters. A blog-post about Service Fabric Chaos. Chaos simulates intermittent, interleaved faults (both graceful and ungraceful) throughout the cluster over extended periods of time. Service Fabric Chaos allows developers to be able to test the stability of their services while the underlying unreliable infrastructure is going through complicated state transitions due to faults.
Streaming
- Scaling Uber’s Elasticsearch Clusters. An InfoQ presentation about how Uber scaled its Elasticsearch clusters as well as its ingestion pipelines for ingestions, queries, data storage, and operations.
- Cloud-native and Scalable Kafka Architecture. Another presentation from InfoQ. This presentation is about how Netflix addresses the issues of stability and scalability in a cloud environment by having many smaller and mostly immutable Kafka clusters with limited state changes.
- Give Meaning To 100 Billion Events A Day - The Analytics Pipeline At Teads. An article about how Teads orchestrates Kafka, Dataflow and BigQuery together to ingest and transform a large stream of events (close to 100 billion per day). A very interesting read!
Data Science
- Running Pleasingly Parallel workloads using rxExecBy on Spark, SQL, Local and Localpar compute contexts. A blog-post about
rxExecBy
which is a new RevoScaleR function introduced in Microsoft Machine Learning Server 9.1. The function can be used to partition input data source by keys and apply user-defined functions on individual partitions.
Upcoming Conferences
- Azure Global Boot-Camp 2018 Durban. On April 21, 2018, all Azure communities will come together once again in the sixth great Global Azure Bootcamp event! If you are in the Durban area, please join up and come and learn about all things Azure. Here in Durban, [Lawrance][lawr], who is a fellow MVP - but in the Azure space, arranges it. Yours truly presents two talks; one about Azure ML and the other about Azure Data Lake.
~ Finally
That’s all for this week. I hope you enjoy what I did put together. If you have ideas for what to cover, please comment on this post or ping me.