Data1984 – Telegram
Data1984
787 subscribers
44 photos
1 video
17 files
762 links
This channel is mostly about data related stuff, some of the main topics are #DataEngineering #SQL #Python #cloud .

Contact: @gorros
Download Telegram
I know that "remote vs office" topic is all over the place, but still, what do you think, does commute or other perks worth going to office, or do you miss your colleagues?
Anonymous Poll
34%
Remote
5%
Office
61%
Hybrid
Data1984 pinned «I know that "remote vs office" topic is all over the place, but still, what do you think, does commute or other perks worth going to office, or do you miss your colleagues?»
This one worth highlighting too. What can be neater than architecture design as (Python) code 😎
dataeng2021.jpg
242.2 KB
Data engineering in 2021 by datastack.tv (reference)
If you work in one of these areas:
- Back-end development
- Data analytics
- Data science
- DevOps
and you are based in Armenia and want to become a data engineer, reach out to me (check channel denoscription).
Came across this comparison while reading Firebolt white paper. Not sure why authors used a Redshift cluster which is much larger than 1TB dataset, maybe to add more computational power. Anyway, don't get scared by these clusters' prices.
It seems these days it's all about Ops, DevOps, MLOps, DevSecOps and now we have Ops for data, DataOps 😎. And yes, this term can be heard more recently, but not sure if this is something new. I guess data engineers were already covering these aspects.
https://www.linkedin.com/posts/firebolt_how-vimeo-keeps-data-intact-with-85-billion-activity-6833790832726810624-Sdjs
In data engineering landscape there are always new interesting project. And sometimes only way you hear about them is by talking to other data professionals. So here are two cool projects I learned about from a friend:
1. Presidio: Data protection and anonymization library from Microsoft
2. Trino: a new query engine from creators of Presto
It seems one tool, dbt, is driving demand for new analytics engineer specialization. Spark is popular too,and often considered as main tool for data engineers, but it did not create a specialization.
I can say from my own experience that this is much better then post-factum analytics integration with traditional ETL. I just did not know that the term is IDT.
https://medium.com/whispering-data/the-end-of-etl-as-we-know-it-92166c19084c