#reInvent #AWS
It seems ML is becoming standard feature everywhere.
https://aws.amazon.com/about-aws/whats-new/2020/12/aws-announces-amazon-redshift-ml-preview/
It seems ML is becoming standard feature everywhere.
https://aws.amazon.com/about-aws/whats-new/2020/12/aws-announces-amazon-redshift-ml-preview/
Amazon
AWS announces Amazon Redshift ML (preview)
#Scala #Kafka
I will leave this here just in case
https://www.confluent.io/blog/kafka-scala-tutorial-for-beginners/
I will leave this here just in case
https://www.confluent.io/blog/kafka-scala-tutorial-for-beginners/
Confluent
Apache Kafka and Scala - A Beginner’s Tutorial
Introduction to Apache Kafka and Scala. Learn about Kafka clients, how to use it in Scala, the Kafka Streams Scala module, and popular Scala integrations with code examples.
#reInvent #AWS
Data analytics and engineering related updates:
✅ Announcing Amazon Redshift data sharing (preview)
✅ Amazon EMR Studio (Preview): A new notebook-first IDE experience with Amazon EMR
✅ Amazon Redshift announces native console integration with partners (Preview)
✅ Amazon Redshift announces support for native JSON and semi-structured data processing (preview)
✅ Simplify running Apache Spark jobs with Amazon EMR on Amazon EKS
Data analytics and engineering related updates:
✅ Announcing Amazon Redshift data sharing (preview)
✅ Amazon EMR Studio (Preview): A new notebook-first IDE experience with Amazon EMR
✅ Amazon Redshift announces native console integration with partners (Preview)
✅ Amazon Redshift announces support for native JSON and semi-structured data processing (preview)
✅ Simplify running Apache Spark jobs with Amazon EMR on Amazon EKS
Amazon
Announcing Amazon Redshift data sharing (preview) | Amazon Web Services
Amazon Redshift is a fast, scalable, secure, and fully managed cloud data warehouse that makes it simple and cost-effective to analyze all your data using standard SQL. Amazon Redshift offers up to 3x better price performance than any other cloud data warehouse.…
#AWS #reInvent #Redshift
✅ Amazon Redshift announces Automatic Table Optimization
✅ Amazon Redshift now includes Amazon RDS for MySQL and Amazon Aurora MySQL databases as new data sources for federated querying (Preview)
✅ Amazon Redshift launches RA3.xlplus nodes with managed storage
✅ Amazon Redshift announces Automatic Table Optimization
✅ Amazon Redshift now includes Amazon RDS for MySQL and Amazon Aurora MySQL databases as new data sources for federated querying (Preview)
✅ Amazon Redshift launches RA3.xlplus nodes with managed storage
Amazon
Amazon Redshift announces Automatic Table Optimization
A great review of metadata catalog evolution.
https://engineering.linkedin.com/blog/2020/datahub-popular-metadata-architectures-explained
https://engineering.linkedin.com/blog/2020/datahub-popular-metadata-architectures-explained
Linkedin
DataHub: Popular metadata architectures explained
#Scala
Old but good article about cake pattern for dependency injection in #Scala
https://medium.com/rahasak/scala-cake-pattern-e0cd894dae4e
Old but good article about cake pattern for dependency injection in #Scala
https://medium.com/rahasak/scala-cake-pattern-e0cd894dae4e
Medium
Scala cake pattern
Be simple, look stylish
Forwarded from Инжиниринг Данных (Dmitry Anoshin)
Lakehouse = DW + Data Lake.
Примеры lakehouse:
- Redshift + Redshift Spectrum
- Snowflake
- Databrics Delta Lake
- Azure Synapse Analytics
Попался очень интересный paper, который был только недавно опубликован основателями Databricks.
This paper argues that the data warehouse architecture as we know it today will wither in the coming years and be replaced by a new architectural pattern, the Lakehouse, which will (i) be based on open direct-access data formats, such as Apache Parquet, (ii) have first class support for machine learning and data science, and (iii) offer state-of-the-art performance. Lakehouses can help address several major challenges with data warehouses, including data staleness, reliability, total cost of ownership, data lock-in, and limited use-case support.
Примеры lakehouse:
- Redshift + Redshift Spectrum
- Snowflake
- Databrics Delta Lake
- Azure Synapse Analytics
Попался очень интересный paper, который был только недавно опубликован основателями Databricks.
This paper argues that the data warehouse architecture as we know it today will wither in the coming years and be replaced by a new architectural pattern, the Lakehouse, which will (i) be based on open direct-access data formats, such as Apache Parquet, (ii) have first class support for machine learning and data science, and (iii) offer state-of-the-art performance. Lakehouses can help address several major challenges with data warehouses, including data staleness, reliability, total cost of ownership, data lock-in, and limited use-case support.
Forwarded from Инжиниринг Данных (Dmitry Anoshin)
У data сообщества большие планы на dbt.
Medium
Why DBT will one day be bigger than Spark
The world of data is moving and shaking again. Ever since Hadoop came around, people were offloading workloads from their data warehouses…
Data visualization tools comparison from Dropbox.
https://dropbox.tech/application/why-we-chose-apache-superset-as-our-data-exploration-platform
https://dropbox.tech/application/why-we-chose-apache-superset-as-our-data-exploration-platform
dropbox.tech
Why we chose Apache Superset as our data exploration platform
It seems Apache Iceberg is gaining momentum. So it worths getting familiar with it.
https://www.dremio.com/data-lake/apache-iceberg/
https://www.dremio.com/data-lake/apache-iceberg/
Dremio
Apache Iceberg Guide | Dremio Resources
Explore the comprehensive Apache Iceberg guide in Dremio's resources. Learn about its features, benefits, and how to use it for data management.
#job #datascientist #dataengineer
Sophron Engineering is looking for Data Scientists and Data Engineers to join a highly
experienced international team at a respectable Insurance company and work on
interesting, long-term project.
🔷Interested?
Find the details about the job here:
🔹Data Scientist http://bit.ly/3q8fRgW
🔹Big Data Engineer https://bit.ly/2Z0JhRY
Sophron Engineering CJSC
https://sophron.ai
https://www.linkedin.com/company/sophron-engineering
All interested candidates can contact @elyapapyan or submit their CVs to
careers@sophron.co📩
Sophron Engineering is looking for Data Scientists and Data Engineers to join a highly
experienced international team at a respectable Insurance company and work on
interesting, long-term project.
🔷Interested?
Find the details about the job here:
🔹Data Scientist http://bit.ly/3q8fRgW
🔹Big Data Engineer https://bit.ly/2Z0JhRY
Sophron Engineering CJSC
https://sophron.ai
https://www.linkedin.com/company/sophron-engineering
All interested candidates can contact @elyapapyan or submit their CVs to
careers@sophron.co📩