NEW BOT Телеграм, страница

[llm][model training]
https://blog.replit.com/llm-training

Replit — How to train your own Large Language Models

Learn how Replit trains Large Language Models (LLMs) using Databricks, Hugging Face, and MosaicML

Introduction

Large Language Models, like OpenAI's GPT-4 or Google's PaLM, have taken the world of artificial intelligence by storm. Yet most companies don't…

595 views06:33

Engineer Readings

[database][basics]

https://medium.com/@hnasr/following-a-database-read-to-the-metal-a187541333c2

Medium

Following a database read to the metal

App to DB to OS to SSD

555 views12:11

Engineer Readings

[ml][book]

“Self-supervised learning, dubbed “the dark matter of intelligence” 1, is a promising path to advance machine learning. As opposed to supervised learning, which is limited by the availability of labeled data, self-supervised approaches can learn from vast unlabeled data [Chen et al., 2020b, Misra and Maaten, 2020]. Self-supervised learning (SSL) underpins deep learning’s success in natural language processing leading to advances from automated machine translation to large language models trained on web-scale corpora of unlabeled text [Brown et al., 2020, Popel et al., 2020]. In computer vision, SSL pushed new bounds on data size with models such as SEER trained on 1 billion images [Goyal et al., 2021]. SSL methods for computer vision have been able to match or in some cases surpass models trained on labeled data, even on highly competitive benchmarks like ImageNet [Tomasev et al., 2022, He et al., 2020a, Deng et al., 2009]. SSL has also been successfully applied across other modalities such as video, audio, and time series [Wickstrøm et al., 2022, Liu et al., 2022a, Schiappa et al., 2022a].”

https://arxiv.org/abs/2304.12210

arXiv.org

A Cookbook of Self-Supervised Learning

Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning. Yet, much like cooking, training SSL methods is a delicate art with a high...

518 views18:08

Engineer Readings

[ai][llm]
https://github.com/mlc-ai/mlc-llm

GitHub

GitHub - mlc-ai/mlc-llm: Universal LLM Deployment Engine with ML Compilation

Universal LLM Deployment Engine with ML Compilation - mlc-ai/mlc-llm

445 views11:27

Engineer Readings

[ai][llama]

https://agi-sphere.com/llama-models/

AGI Sphere

A brief history of LLaMA models - AGI Sphere

LLaMA (Large Language Model Meta AI) is a language model released by Meta (Facebook). It is Meta's answer to OpenAI's GPT models. The LLaMA base model was

450 views22:02

Engineer Readings

[twitter][algorithm]
https://tweethunter.io/blog/twitter-algorithm-full-analysis

487 views12:00

Engineer Readings

[platform engineering]

https://medium.com/hashicorp-engineering/platform-engineering-on-the-hashicorp-ecosystem-part-1-84fb314e833e

Medium

Platform Engineering on the HashiCorp Ecosystem— Part 1

The goal of this series is to provide a practical guide on how to facilitate a multi-tenant developer PaaS using the HashiCorp ecosystem

511 views16:53

Engineer Readings

[llm][benchmarking]

https://lmsys.org/blog/2023-05-03-arena/

lmsys.org

Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings | LMSYS Org

<p>We present Chatbot Arena, a benchmark platform for large language models (LLMs) that features anonymous, randomized battles in a crowdsourced manner. In t...

552 views21:08

Engineer Readings

[distributed systems][migration]

https://netflixtechblog.com/migrating-critical-traffic-at-scale-with-no-downtime-part-1-ba1c7a1c7835

Medium

Migrating Critical Traffic At Scale with No Downtime — Part 1

Shyam Gala, Javier Fernandez-Ivern, Anup Rokkam Pratap, Devang Shah

587 views16:31

Engineer Readings

[distributed systems][websocket][messaging]

https://slack.engineering/real-time-messaging/

slack.engineering

Real-time Messaging

Did you know that ground stations transmit signals to satellites 22,236 miles above the equator in geostationary orbits, and that those signals are then beamed down to the entire North American subcontinent? Satellite radios today serve hundreds of channels…

759 views07:32

Engineer Readings

[llm][document index demo]

https://gpt-index.readthedocs.io/en/latest/examples/index_structs/doc_summary/DocSummary.html

664 views11:25

Engineer Readings

[googleio][recap]

https://developers.googleblog.com/2023/05/io23-developer-keynote-recap.html

Googleblog

Google for Developers Blog - News about Web, Mobile, AI and Cloud

Thank you for another great Google I/O! We’re continuing to make deep investments across AI, mobile,...

761 views20:58

Engineer Readings

[llm]
TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks

“While LLMs have shown great success in understanding and generating text in traditional conversational settings, their potential for performing ill-defined complex tasks is largely under-studied. Indeed, we are yet to conduct comprehensive benchmarking studies with multiple LLMs that are exclusively focused on a complex task. However, conducting such benchmarking studies is challenging because of the large variations in LLMs' performance when different prompt types/styles are used and different degrees of detail are provided in the prompts. To address this issue, the paper proposes a general taxonomy that can be used to design prompts with specific properties in order to perform a wide range of complex tasks.”

https://arxiv.org/abs/2305.11430

❤1

764 views09:40

Engineer Readings

[hdfs][caching]
https://www.uber.com/en-NL/blog/optimizing-hdfs-with-datanode-local-cache/

705 views12:32

Engineer Readings

[architecture][uber clone]

Juraj Majerik dedicated about 7 months (a total of ~300 hours) to create a simulated version of a ride-sharing app (akin to Uber) as a side project. He described each step in his blog:

https://rides.jurajmajerik.com/system-design

🔥3

633 views08:03

Engineer Readings

[leadership]
I found that quite crucial to leave here along with the technical articles.
https://youtu.be/ljqra3BcqWM

YouTube

Extreme Ownership | Jocko Willink | TEDxUniversityofNevada

NOTE FROM TED: This talk contains a discussion of violence and warfare. We've flagged this talk because it falls outside the content guidelines TED gives TEDx organizers. TEDx events are independently organized by volunteers. The guidelines we give TEDx organizers…

👍3

754 views19:37

Engineer Readings

[lectures][data science]

CS109A Data Science course materials @Harvard are free and open to everyone!

1. Lecture notes
2. R code, Python notebooks
3. Lab material
4. Advanced sections

https://harvard-iacs.github.io/2019-CS109A/pages/materials.html

👍1

658 views06:57

Engineer Readings

[scaling][cadence]
https://www.uber.com/en-NL/blog/announcing-cadence/

👍2

559 views15:28

Engineer Readings

[crdt][local-first]
Research around Collaborative applications and how they behave offline and with merge conflicts.

https://www.inkandswitch.com/local-first/

Inkandswitch

Local-first software: You own your data, in spite of the cloud

A new generation of collaborative software that allows users to retain ownership of their data.

628 views18:02

Engineer Readings

[crdt][dynamic documents]
The previous article is based on 2019 knowledge. Since that time some things have changed.
https://www.inkandswitch.com/potluck/
Here is an example you can play with:
https://www.inkandswitch.com/potluck/demo/?openDocument=aeropress

Inkandswitch

Potluck: Dynamic documents as personal software

Gradually enriching text documents into interactive applications

810 views18:18

Engineer Readings

[reverse engineering]

https://malwareunicorn.org/workshops/re101.html#0

malwareunicorn.org

Reverse Engineering 101

This workshop provides the fundamentals of reversing engineering Windows malware using a hands-on experience with RE tools and techniques.

👍1

653 views22:05

About

Blog

Apps

Platform