Interesting Stuff - Week 15, 2022

Posted by nielsb on Sunday, April 17, 2022

Throughout the week, I read a lot of blog-posts, articles, and so forth that has to do with things that interest me:

  • AI/data science
  • data in general
  • data architecture
  • streaming
  • distributed computing
  • SQL Server
  • transactions (both db as well as non db)
  • and other “stuff”

This blog post is the “roundup” of the things that have been most interesting to me for the week just ending.

Data / Data Science / Machine Learning

  • Feathr: LinkedIn’s feature store is now available on Azure. In Data Science / Machine Learning, you have the concept of a Feature Store. The Feature Store is a place where features are centralized in an organization so they can be reused, served in a synchronous way between offline and online environments, and served in real-time with low latency. Feathr is the feature store that has been used in production and battle-tested in LinkedIn for over 6 years. It has now been open-sourced and made available on Azure. This blog post looks closer at what Feathr is and how you can get started with it.
  • Data Mesh Observability Pattern. The Data Mesh Observability Pattern helps you understand the state of your Data Mesh, the movement of its data, and its usage patterns. This blog post looks at how this pattern works and why it is crucial to the success of your enterprise Data Mesh.

Azure Data Explorer

  • Send data from Node-RED to free Azure Data Explorer (Kusto). Node-RED is a flow-based development tool for visual programming, and it provides a browser-based editor that makes it easy to wire together flows. The flows are connected through nodes. This article describes how to send data directly from Node-RED to Azure Data Explorer using the new Node-RED to Kusto node.
  • Obfuscate strings in KQL queries. This post does what it says “on the tin”. It looks at obfuscating strings in KQL queries in Azure Data Explorer to avoid displaying sensitive information when someone is reviewing executed queries.

Streaming

  • How Apache Kafka Works: An Introduction to Kafka’s Internals. You can be a decent developer without knowing/understanding the inner workings of your chosen technology(s). However, if you want to be great, it is a must to understand the underlying “plumbing”. This post announces a Kafka Internals course by Jun Rao. Jun was one of the original developers of Kafka and is co-founder of Confluent - he knows what he is talking about. If you are interested in Kafka, I urge you to have a look at that course.
  • Presto® on Apache Kafka At Uber Scale. This post looks at how Uber has connected Kafka and Presto together to enable a lightweight, interactive SQL query directly over Kafka via Presto at Uber scale.

~ Finally

That’s all for this week. I hope you enjoy what I did put together. Please comment on this post or ping me if you have ideas for what to cover.


comments powered by Disqus