Data1984 – Telegram
Data1984
787 subscribers
44 photos
1 video
17 files
762 links
This channel is mostly about data related stuff, some of the main topics are #DataEngineering #SQL #Python #cloud .

Contact: @gorros
Download Telegram
Forwarded from LEFT JOIN
Лучшие практики использования SQL по версии Metabase.

Некоторые советы действительно ценные. Из того, что реально встречалось на практике особенно выделил бы один, так как в свое время удалось существенно оптимизировать время выполнения запроса:
Prefer EXISTS to IN
If you just need to verify the existence of a value in a table, prefer EXISTS to IN, as the EXISTS process exits as soon as it finds the search value, whereas IN will scan the entire table. IN should be used for finding values in lists.

Metabase довольно интересный инструмент, на одном из проектов используем его, надо бы записать видео в продолжение Гайда по BI.
Если у кого-то есть время и интерес поисследовать Metabase, а также записать видео, пишите мне в DM: @valiotti.
Interesting long read about databases for website analytics. Besides telling about their experience of choosing right solution, author describes migration process and different options. However, he doesn't mention Druid which I think can be a good fit.
https://usefathom.com/blog/worlds-fastest-analytics

Credits to: https://news.1rj.ru/str/rockyourdata/2448
Did you notice that there are no books on data engineering or just a couple which mostly describe different technologies that data engineers can use. Of course, there is "Designing Data-Intensive Applications", but this book makes sense if you have some experience. Courses also focus on technologies but not general knowledge that data engineer should have. Also technologies advertise what problems they solve, but these problems are familiar to more or less experienced data engineers. And currently there is a high demand of data engineers.
What will you look for to become a data engineer?
Anonymous Poll
31%
A book
19%
A video course
25%
A training or bootcamp
25%
An internship / real project
So as you may know, AWS has Lambda and Step Functions. But it seems Azure took a bit different approach with Durable functions. I am not sure if this is a right comparison, but it is interesting idea to have not only stateless functions but also stateful ones.
Partitioning also known as "poor man's indexing"
Never heard of this one 😁
MS_Azure_AWS_Comparison_eBook_v4.pdf
640.3 KB
Cloud Service Map for #AWS and #Azure (more up-to-date version here)