A neat duckdb snipped for string normalization

A recent project of mine involved determining duplicate CRM objects across Salesforce and Hubspot. I utilized duckdb for my data processing and found this neat little text function duckdb provides: strip_accents(string). It does exactly what it says: Strip accents from a string. Thus Mühleisen becomes Muheisen. This feature saved me from manually defining a map of umlaut characters and replacing them in a bunch of places. SELECT strip_accents(first_name) as first_name_normalized, ....

January 14, 2025 1 min

Impressions from dltHub's product launch event in Berlin

The dlt team has been on a global roadshow for the last few weeks, making the stop in their home-city of Berlin last Tuesday. The evening was packed with presentations, guest speakers, and product demos. And even though one speaker fell ill, it went well over the planned schedule. If it was up to me, it could have continued for a good while longer - I was really fascinated by the insights shared from the members of the community....

November 27, 2024 4 min

Hello World

Oh wow this actually works. Pretty neat. Let’s see what I will come up with…

November 10, 2024 1 min