NEW BOT Телеграм, страница

It looks like Databricks is up to something. First it was Iceberg and now Hudi. It is not clear if they try to converge on one open table format or just want to make sure Delta Lake becomes the one. In my opinion, Iceberg has more potential to become the one.

TechCrunch

Databricks acquires Tabular to build a common data lakehouse standard

Databricks has acquired Tabular, a data management startup, in its quest to build a common standard for data lakehouses.

❤1

1.06K views17:40

Data1984

Bufstream: Kafka at 10x lower cost - Buf
https://buf.build/blog/bufstream-kafka-lower-cost

buf.build

Bufstream: Kafka at 8x lower cost

We're excited to announce the public beta of Bufstream, a drop-in replacement for Apache Kafka deployed entirely in your own VPC that's 8x less expensive to operate.

1.03K views22:08

Data1984

Build Data Products and a Data Mesh with dbt Cloud: A tutorial from Snowflake. This is very similar to a project I am currently working on.

👍1

993 viewsedited 13:15

Data1984

Snowflake infrastructure as code.
https://github.com/Titan-Systems/titan

GitHub

GitHub - Titan-Systems/titan: Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage…

Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool f...

962 viewsedited 12:24

Data1984

https://github.com/duolingo/metasearch

GitHub

GitHub - duolingo/metasearch: Search aggregator for Slack, Google Docs, GitHub, and more :mag:

Search aggregator for Slack, Google Docs, GitHub, and more :mag: - duolingo/metasearch

🤔1

854 views08:38

Data1984

AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x cost-effective. Autoscale in seconds. Single-digit ms latency.

Medium

How do we run Kafka 100% on the object storage?

Let’s see how AutoMQ makes this dream come true.

👍3

974 viewsedited 09:03

Data1984

A nice example of a hybrid solution with Kafka
Source and detailed analysis: https://x.com/BdKozlovski/status/1842912763607142579

958 viewsedited 19:04

Data1984

https://dzone.com/storage/assets/18015609-dz-tr-data-engineering-2024.pdf

880 views14:26

Data1984

https://news.1rj.ru/str/rockinroleJobs

RockinRole - Jobs

Welcome to RockInRole Jobs, where each job post is automatically refreshed to keep opportunities current and relevant. All positions listed here are fresh, new, and ready for you to explore! 🌟

Free job post DM
@Vardan_Hayrapetyan

687 views05:33

Data1984

AWS announces Amazon Nova, a new generation of state-of-the-art foundation models (FMs) that deliver frontier intelligence and industry leading price performance, available exclusively in Amazon Bedrock.
https://aws.amazon.com/blogs/aws/introducing-amazon-nova-frontier-intelligence-and-industry-leading-price-performance/

Amazon

Introducing Amazon Nova foundation models: Frontier intelligence and industry leading price performance | Amazon Web Services

Amazon Nova foundation models deliver frontier intelligence and industry leading price-performance, with support for text and multimodal intelligence, multimodal fine-tuning, and high-quality images and videos.

597 views12:58

Data1984

Introducing queryable object metadata for Amazon S3 buckets (preview) | AWS News Blog
https://aws.amazon.com/blogs/aws/introducing-queryable-object-metadata-for-amazon-s3-buckets-preview/

Amazon

Introducing queryable object metadata for Amazon S3 buckets (preview) | Amazon Web Services

Unlock S3 data insights effortlessly with AWS' rich metadata capture; query objects by key, size, tags, and more using Athena, Redshift, and Spark at scale.

705 views13:02

Data1984

New Amazon S3 Tables: Storage optimized for analytics workloads | AWS News Blog
https://aws.amazon.com/blogs/aws/new-amazon-s3-tables-storage-optimized-for-analytics-workloads/

Amazon

New Amazon S3 Tables: Storage optimized for analytics workloads | Amazon Web Services

Amazon S3 Tables optimize tabular data storage (like transactions and sensor readings) in Apache Iceberg, enabling high-performance, low-cost queries using Athena, EMR, and Spark.

874 views13:04

Data1984

I would also add Azure Data Explorer (Kusto) to the list. However, ADX is not open-source.
https://startree.ai/resources/a-tale-of-three-real-time-olap-databases

777 views12:42

Data1984

While US markets are panicking you can try to play with DeepSeek by installing it locally or using Cursor, it is already available there.
https://dev.to/lunaticprogrammer/using-deepseek-r1-in-visual-studio-code-for-free-2279

🎉1

905 views19:45

About

Blog

Apps

Platform