Great presentation about Data Mesh,a term coined by Zhamak Dehghani, in her original article.
YouTube
Data Mesh in Practice: How Europe's Leading Online Platform for Fashion Goes Beyond the Data Lake
The Data Lake paradigm is often considered the scalable successor of the more curated Data Warehouse approach when it comes to democratization of data. However, many who went out to build a centralized Data Lake came out with a data swamp of unclear responsibilities…
There are almost no books on data engineering which focus on concepts and problems rather than on specific technologies. But this book is one of this rare ones. It is a collection of advises, problems and solutions, or just ideas to reflect on.
Goodreads
97 Things Every Data Engineer Should Know: Collective W…
Take advantage of today's sky-high demand for data engi…
I wrote a summary where I compare Azure Synapse, Databricks and Azure Data Explorer focusing on the features that I find important.
https://medium.com/@gorros/azure-synapse-databricks-and-azure-data-explorer-kusto-73a3a0339cf2
https://medium.com/@gorros/azure-synapse-databricks-and-azure-data-explorer-kusto-73a3a0339cf2
Medium
Azure Synapse, Databricks, and Azure Data Explorer (Kusto)
Which analytical platform to choose?
HP introduced its new Unified analytics platform HPE GreenLake
Hpe
HPE GreenLake Announcement
See why enterprises are embracing a cloud-everywhere strategy with HPE GreenLake. Join HPE President and CEO Antonio Neri for a special broadcast on April 4. Learn about the next set of cloud services launching on HPE GreenLake edge-to-cloud platform.
#pandas #vaex #dask #polars
Top 3 Alternative Python Packages for Pandas | by Cornellius Yudha Wijaya | Towards Data Science
https://towardsdatascience.com/top-3-alternative-python-packages-for-pandas-d125627ce349?gi=3e25591d0cdf
Top 3 Alternative Python Packages for Pandas | by Cornellius Yudha Wijaya | Towards Data Science
https://towardsdatascience.com/top-3-alternative-python-packages-for-pandas-d125627ce349?gi=3e25591d0cdf
Medium
Top 3 Alternative Python Packages for Pandas
For many modern data scientists, Python is the programming language that was used for their everyday work — as a consequence, the data analysis would be done using one of the most data packages…
Forwarded from Инжиниринг Данных (Dmitry Anoshin)
SQL with Squid Games.pdf
424.7 KB
Базовый SQL на примере Squid Games. Хороший подход, сразу понятно для тех, кто смотрел сериал.
Now Azure data explorer engineer is available in Azure Synapse.
TECHCOMMUNITY.MICROSOFT.COM
Introducing Azure Synapse data explorer for log and telemetry analytics
Digital transformation is a key aspect of the new programming model of intelligent devices across the cloud. One of the key types of data that's arisen from this new application paradigm is telemetry. Telemetry data is everywhere: IoT sensors, app logs, web…
Data Analyst Certificate & Training - Grow with Google
https://grow.google/dataanalytics/#?modal_active=none
https://grow.google/dataanalytics/#?modal_active=none
Microsoft Releases Azure Open AI Service Including Access to Powerful GPT-3 Models
https://www.infoq.com/news/2021/11/azure-openai-service-gpt3/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=AI%2C+ML+%26+Data+Engineering
https://www.infoq.com/news/2021/11/azure-openai-service-gpt3/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=AI%2C+ML+%26+Data+Engineering
InfoQ
Microsoft Releases Azure Open AI Service Including Access to Powerful GPT-3 Models
At its recent Ignite conference, Microsoft announced the new Azure OpenAI Service in preview, allowing access to OpenAI’s API through the Azure platform. This new Azure Cognitive Service will give customers access to OpenAI’s powerful GPT-3 models, along…
Introducing Amazon S3 shuffle in AWS Glue | AWS Big Data Blog
https://aws.amazon.com/blogs/big-data/introducing-amazon-s3-shuffle-in-aws-glue/
https://aws.amazon.com/blogs/big-data/introducing-amazon-s3-shuffle-in-aws-glue/
Amazon
Introducing Amazon S3 shuffle in AWS Glue | Amazon Web Services
Nov 2022: Newer version of the product is now available to be used for this post. AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning (ML), and application development.…
Implement a slowly changing dimension in Amazon Redshift | AWS Big Data Blog
https://aws.amazon.com/blogs/big-data/implement-a-slowly-changing-dimension-in-amazon-redshift/
https://aws.amazon.com/blogs/big-data/implement-a-slowly-changing-dimension-in-amazon-redshift/
Amazon
Implement a slowly changing dimension in Amazon Redshift | Amazon Web Services
Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud. A star schema is a database organization structure optimized for use in a data warehouse. In a star schema, a dimension is a structure that categorizes the facts and measures…
Apache Spark Brings Pandas API with Version 3.2
InfoQ
Apache Spark Brings Pandas API with Version 3.2
The Apache Spark team has integrated the Pandas API in the product's latest 3.2 release. With this change, dataframe processing can be scaled to multiple clusters or multiple processors in a single machine using the PySpark execution engine.
Amazing website if you are looking for functional programming job and not only. For some reason it is like a candy store but for programmers 😁
https://functional.works-hub.com/
https://functional.works-hub.com/
Works-Hub
Jobs - December 2025 | Functional Works
Browse functional programming jobs, salaries, blogs and learning resources! Scala jobs, Haskell jobs, Clojure jobs and more.