NEW BOT Телеграм, страница

[ml][book]

“Self-supervised learning, dubbed “the dark matter of intelligence” 1, is a promising path to advance machine learning. As opposed to supervised learning, which is limited by the availability of labeled data, self-supervised approaches can learn from vast unlabeled data [Chen et al., 2020b, Misra and Maaten, 2020]. Self-supervised learning (SSL) underpins deep learning’s success in natural language processing leading to advances from automated machine translation to large language models trained on web-scale corpora of unlabeled text [Brown et al., 2020, Popel et al., 2020]. In computer vision, SSL pushed new bounds on data size with models such as SEER trained on 1 billion images [Goyal et al., 2021]. SSL methods for computer vision have been able to match or in some cases surpass models trained on labeled data, even on highly competitive benchmarks like ImageNet [Tomasev et al., 2022, He et al., 2020a, Deng et al., 2009]. SSL has also been successfully applied across other modalities such as video, audio, and time series [Wickstrøm et al., 2022, Liu et al., 2022a, Schiappa et al., 2022a].”

https://arxiv.org/abs/2304.12210

arXiv.org

A Cookbook of Self-Supervised Learning

Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning. Yet, much like cooking, training SSL methods is a delicate art with a high...

518 views18:08

Engineer Readings

[ai][llm]
https://github.com/mlc-ai/mlc-llm

GitHub

GitHub - mlc-ai/mlc-llm: Universal LLM Deployment Engine with ML Compilation

Universal LLM Deployment Engine with ML Compilation - mlc-ai/mlc-llm

445 views11:27

Engineer Readings

[ai][llama]

https://agi-sphere.com/llama-models/

AGI Sphere

A brief history of LLaMA models - AGI Sphere

LLaMA (Large Language Model Meta AI) is a language model released by Meta (Facebook). It is Meta's answer to OpenAI's GPT models. The LLaMA base model was

450 views22:02

Engineer Readings

[twitter][algorithm]
https://tweethunter.io/blog/twitter-algorithm-full-analysis

487 views12:00

Engineer Readings

[platform engineering]

https://medium.com/hashicorp-engineering/platform-engineering-on-the-hashicorp-ecosystem-part-1-84fb314e833e

Medium

Platform Engineering on the HashiCorp Ecosystem— Part 1

The goal of this series is to provide a practical guide on how to facilitate a multi-tenant developer PaaS using the HashiCorp ecosystem

511 views16:53

Engineer Readings

[llm][benchmarking]

https://lmsys.org/blog/2023-05-03-arena/

lmsys.org

Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings | LMSYS Org

<p>We present Chatbot Arena, a benchmark platform for large language models (LLMs) that features anonymous, randomized battles in a crowdsourced manner. In t...

552 views21:08

Engineer Readings

[distributed systems][migration]

https://netflixtechblog.com/migrating-critical-traffic-at-scale-with-no-downtime-part-1-ba1c7a1c7835

Medium

Migrating Critical Traffic At Scale with No Downtime — Part 1

Shyam Gala, Javier Fernandez-Ivern, Anup Rokkam Pratap, Devang Shah

587 views16:31

Engineer Readings

[distributed systems][websocket][messaging]

https://slack.engineering/real-time-messaging/

slack.engineering

Real-time Messaging

759 views07:32

Engineer Readings

[llm][document index demo]

https://gpt-index.readthedocs.io/en/latest/examples/index_structs/doc_summary/DocSummary.html

664 views11:25

Engineer Readings

[googleio][recap]

https://developers.googleblog.com/2023/05/io23-developer-keynote-recap.html

Googleblog

Google for Developers Blog - News about Web, Mobile, AI and Cloud

Thank you for another great Google I/O! We’re continuing to make deep investments across AI, mobile,...

761 views20:58

Engineer Readings

[llm]
TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks

“While LLMs have shown great success in understanding and generating text in traditional conversational settings, their potential for performing ill-defined complex tasks is largely under-studied. Indeed, we are yet to conduct comprehensive benchmarking studies with multiple LLMs that are exclusively focused on a complex task. However, conducting such benchmarking studies is challenging because of the large variations in LLMs' performance when different prompt types/styles are used and different degrees of detail are provided in the prompts. To address this issue, the paper proposes a general taxonomy that can be used to design prompts with specific properties in order to perform a wide range of complex tasks.”

https://arxiv.org/abs/2305.11430

❤1

764 views09:40

Engineer Readings

[hdfs][caching]
https://www.uber.com/en-NL/blog/optimizing-hdfs-with-datanode-local-cache/

705 views12:32

Engineer Readings

[architecture][uber clone]

Juraj Majerik dedicated about 7 months (a total of ~300 hours) to create a simulated version of a ride-sharing app (akin to Uber) as a side project. He described each step in his blog:

https://rides.jurajmajerik.com/system-design

🔥3

633 views08:03

About

Blog

Apps

Platform