NEW BOT Телеграм, страница

Data1984

Free data engineering zoomcamp starts on January 16.

GitHub - DataTalksClub/data-engineering-zoomcamp: Data Engineering Zoomcamp is a free 9-week course on building production-ready…

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼 - DataTalksClub/data-engineering-zoomcamp

👍3

682 viewsedited 19:02

Data1984

A new book by Andy Grove, creator of DataFusion, about query engines. DataFusion is an extensible query planning, optimization, and execution framework, written in Rust, that uses Apache Arrow as its in-memory format.

👍1

589 viewsedited 19:38

Data1984

https://github.com/StarRocks/starrocks

GitHub

GitHub - StarRocks/starrocks: The world's fastest open query engine for sub-second analytics both on and off the data lakehouse.…

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class perf...

668 views09:27

Data1984

https://youtu.be/nqa_Uyz1pBE

YouTube

Interview with a Boomer CTO

Interview with a Boomer CTO in 2023

Interview with a Boomer CTO in 2023 with Azuros Cloudapi - aired on © The CTO.

Programmer humor
SDLC humor
Requirements engineering
Systems Requirements
User acceptance testing
Cloud services
Programming jokes
tech humor…

😁4

575 views07:18

Data1984

So far managed Airflow is available on all major clouds.

TECHCOMMUNITY.MICROSOFT.COM

Introducing 'Managed Airflow' in Azure Data Factory

Today, we are excited to announce the capability to run Apache Airflow DAGs (Directed Acyclic Graph) within Azure Data Factory, adding a key Open-Source..

❤1

535 views12:32

Data1984

https://cloud.google.com/blog/products/data-analytics/new-blog-series-bigquery-explained-overview

Google Cloud Blog

An overview of BigQuery's architecture and how to quickly get started | Google Cloud Blog

Learn how the decoupled storage and compute architecture helps BigQuery scale seamlessly.

421 views09:25

Data1984

https://techcommunity.microsoft.com/t5/analytics-on-azure-blog/use-azure-databricks-in-vs-code-with-the-new-databricks/ba-p/3741399

TECHCOMMUNITY.MICROSOFT.COM

Use Azure Databricks in VS Code with the new Databricks extension

This VS Code extension lets developers write code locally, leveraging the powerful authoring capabilities of IDEs while connecting to Azure Databricks to run..

357 views12:43

Data1984

https://techcommunity.microsoft.com/t5/azure-data-explorer-blog/general-availability-adx-dashboards/ba-p/3749361

TECHCOMMUNITY.MICROSOFT.COM

General availability: ADX Dashboards

We are thrilled to announce the much-anticipated General Availability of ADX Dashboards!

379 views12:54

Data1984

https://youtu.be/W_v05d_2RTo

YouTube

8 Key Data Structures That Power Modern Databases

Weekly system design newsletter: https://bit.ly/3tfAlYD

Checkout our bestselling System Design Interview books:
Volume 1: https://amzn.to/3Ou7gkd
Volume 2: https://amzn.to/3HqGozy

LSM tree video: https://www.youtube.com/watch?v=I6jB0nM9SKU

Other things…

664 viewsedited 12:58

Data1984

FirstMark | 2023 MAD (ML/AI/Data) Landscape
https://mad.firstmarkcap.com/

MAD

FirstMark | 2024 MAD (ML/AI/Data) Landscape

The 2024 MAD (ML/AI/Data) Landscape is the definitive market map of companies and products in machine learning, artificial intelligence and data, compiled by FirstMark.

397 views12:49

Data1984

https://cloud.google.com/blog/products/data-analytics/building-streaming-data-pipelines/

Google Cloud Blog

Building streaming data pipelines on Google Cloud | Google Cloud Blog

This article reviews three approaches to building a streaming data pipeline on Google Cloud, using Pub/Sub and BigQuery.

448 views13:15

Data1984

AWS Lambdas - Python vs Rust. Performance and Cost Savings. - Confessions of a Data Guy
https://www.confessionsofadataguy.com/aws-lambdas-python-vs-rust-performance-and-cost-savings/

Confessions of a Data Guy

AWS Lambdas - Python vs Rust. Performance and Cost Savings. - Confessions of a Data Guy

Save money, save money!! Hear Hear! Someone on Linkedin recently brought up the point that companies could save gobs of money by swapping out AWS Python lambdas for Rust ones. While it raised the ire of many a Python Data Engineer, I thought it sounded like…

👍2

436 views15:38

Data1984

Guide to Partitions Calculation for Processing Data Files in Apache Spark - DZone
https://dzone.com/articles/guide-to-partitions-calculation-for-processing-dat

DZone

Guide to Partitions Calculation for Processing Data Files in Apache Spark

Get to Know how Spark chooses the number of partitions implicitly while reading a set of data files into an RDD or a Dataset.

👍1

465 views13:26

Data1984

Build a poor man’s data lake from scratch with DuckDB | Dagster Blog
https://dagster.io/blog/duckdb-data-lake

dagster.io

Build a Data Lake with DuckDB + Dagster

Use DuckDB, Python, and Dagster to build a lightweight data lake with SQL transforms and Parquet file support.

447 views16:35

Data1984

https://aws.amazon.com/ru/blogs/publicsector/republic-of-armenias-ministry-high-tech-industry-aws-sign-memorandum-understanding-mou/

Amazon

Republic of Armenia’s Ministry of High-Tech Industry and AWS sign Memorandum of Understanding (MoU) | Amazon Web Services

The Republic of Armenia’s Ministry of High-Tech Industry and AWS have signed a Memorandum of Understanding (MoU) with the aim of modernizing the technological infrastructure of the state and accelerating the adoption of cloud services in the public and the…

👍3😁1

484 views10:35

Data1984

Pandas 2.0 and its Ecosystem (Arrow, Polars, DuckDB) | Airbyte
https://airbyte.com/blog/pandas-2-0-ecosystem-arrow-polars-duckdb

Airbyte

Pandas 2.0 and its Ecosystem (Arrow, Polars, DuckDB) | Airbyte

Dive deeper into the power of Pandas and how leveraging it can benefit your organization. Explore a new way to work with data and unlock powerful insights!

850 views06:05

Data1984

👍3

477 views15:58

Data1984

Home - Apache Doris
https://doris.apache.org/

doris.apache.org

Apache Doris: Open source data warehouse for real time data analytics - Apache Doris

Apache Doris is an open-source database based on MPP architecture,with easier use and higher performance. As a modern data warehouse, apache doris empowers your Olap query and database analytics.

429 views18:25

Data1984

https://github.com/microsoft/semantic-kernel

GitHub

GitHub - microsoft/semantic-kernel: Integrate cutting-edge LLM technology quickly and easily into your apps

Integrate cutting-edge LLM technology quickly and easily into your apps - microsoft/semantic-kernel

393 views14:31

Data1984

GitHub - MaterializeInc/datagen: Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.
https://github.com/MaterializeInc/datagen

GitHub

GitHub - MaterializeInc/datagen: Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka…

Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format. - MaterializeInc/datagen

452 views04:54

Data1984

Welcome - Data With Rust
https://datawithrust.com/

424 views07:57

About

Blog

Apps

Platform