NEW BOT Телеграм, страница

Linkstream

hmm, guys from Stanford claim that for instruction-tuning LLaMA 7B is enough. good! waiting for the fine-tuning code 🧐

https://crfm.stanford.edu/2023/03/13/alpaca.html

185 views19:31

Linkstream

Forwarded from Dmytro S

Мене пару місяців тому глибоко вразило ось це:

https://github.com/cksystemsteaching/selfie/

GitHub

GitHub - cksystemsteaching/selfie: An educational software system of a tiny self-compiling C compiler, a tiny self-executing RISC…

An educational software system of a tiny self-compiling C compiler, a tiny self-executing RISC-V emulator, and a tiny self-hosting RISC-V hypervisor. - cksystemsteaching/selfie

🤩3

229 views14:50

Linkstream

while the public is ranting, Bellard ships

ts_server is a web server proposing a REST API to large language models. They can be used for example for text completion, question answering, classification, chat, translation, image generation, ...
https://wz.ax/textsynth-server

229 viewsedited 05:46

Linkstream

a soup of observations

https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/

lilianweng.github.io

Prompt Engineering

Prompt Engineering, also known as In-Context Prompting, refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without updating the model weights. It is an empirical science and the effect of prompt engineering methods…

🤔1

199 views17:08

Linkstream

forget the narrative, forget linear time

https://generative.ink/posts/loom-interface-to-the-multiverse/

generative.ink

Loom: interface to the multiverse

Loom, a tool for generating, navigating and visualizing natural language multiverses

284 views23:24

Linkstream

interesting. the fact that it’s a hybrid embedding/prediction model sounds very… logical.

so you can chug along without attention just fine it seems

https://twitter.com/BlinkDL_AI/status/1638555109373378560?s=20

259 views09:10

Linkstream

https://arxiv.org/pdf/2303.11366.pdf

Reflexion: an autonomous agent with dynamic memory and self-reflection

233 views12:39

Linkstream

Map of Contemporaries:
The history of the world in famous people’s lifespans. Did you realize that Alessandro Volta was younger than Napoleon? See which famous people shared their time on Earth.
https://ybogdanov.github.io/history-timeline/

Map of Contemporaries

The history of the world in famous people’s lifespans.

👍1

212 views22:39

Linkstream

https://twitter.com/_akhaliq/status/1645257919997394945
Dwarf Fortress got a serious competitor!

287 views20:04

Linkstream

😂 https://www.arxiv-vanity.com/papers/2304.06035/

Arxiv-Vanity

Choose Your Weapon: Survival Strategies for Depressed AI Academics

Are you an AI researcher at an academic institution? Are you anxious you are not coping with the current pace of AI advancements? Do you feel you have no (or very limited) access to the computational and human resources required for an AI research breakthrough?…

250 views19:25

Linkstream

https://arxiv.org/abs/2302.10866
https://github.com/HazyResearch/safari
Convolutional LMM, hmmm.
> reaching Transformer quality with a 20% reduction in training compute required at sequence length 2K. Hyena operators are twice as fast as highly optimized attention at sequence length 8K, and 100x faster at sequence length 64K.

GitHub

GitHub - HazyResearch/safari: Convolutions for Sequence Modeling

Convolutions for Sequence Modeling. Contribute to HazyResearch/safari development by creating an account on GitHub.

❤1👍1🤔1

278 views10:07

Linkstream

https://wz.ax/scary-hacker-stories

WIRED

The Untold Story of the Boldest Supply-Chain Hack Ever

The attackers were in thousands of corporate and government networks. They might still be there now. Behind the scenes of the SolarWinds investigation.

🔥1

243 views20:03

Linkstream

Unlimiformer: Long-Range Transformers with Unlimited Length Input
Unlimiformer improves pretrained models such as BART and Longformer by extending them to unlimited inputs without additional learned weights and without modifying their code (via kNN-search)
https://arxiv.org/abs/2305.01625

260 views18:09

Linkstream

Insect–Machine Interface Based Neurocybernetics
Spy bugs! 2009!
https://wz.ax/cybugs09

😁1

271 viewsedited 09:07

Linkstream

https://arxiv.org/abs/2305.07759
TinyStories: 3-30M (not G) parameter model with coherent English from a curated dataset.

Don't expect it to code but curious if this is usable as a LoRA or similar baseline - also need to look closer at their tokenizer setup, must be way different from GPT

267 views20:45

Linkstream

https://twitter.com/Tim_Dettmers/status/1666076553665744896?s=20

X (formerly Twitter)

Tim Dettmers (@Tim_Dettmers) on X

We present SpQR, which allows lossless LLM inference at 4.75 bits with a 15% speedup. You can run a 33B LLM on a single 24GB GPU fully lossless. SpQR works by isolating sensitive weights with higher precision and roughly doubles improvements from GPTQ: h…

286 views18:45

Linkstream

phi-1: with datasets of higher quality, model with 1.3B parameters and 7B tokens can be quite competitive to gpt4 and other 100x larger models on coding tasks
https://arxiv.org/abs/2306.11644

254 viewsedited 03:32

Linkstream

https://www.youtube.com/watch?v=1GanrexRyVY oh wow

YouTube

How I Landed A Model Rocket

Finally landed it!

0:00 Rocket Flight and Intro
1:44 Rocket Design Philosophy
3:09 Avionics Development
4:30 Physical Design
6:15 MATLAB Simulation
9:54 Software

Instagram: https://www.instagram.com/ttb_aerospace/
Website: https://www.ttbaerospace.com/…

❤2👍1🏆1

243 views18:00

Linkstream

Welcome to the dark side of the cyberpunk.
Once-theoretical timing attacks are now a reality.
(TLDR: Cops still can't decrypt the messages, but they can track who's chatting with whom comparing the small spikes of traffic as the message gets delivered)
https://wz.ax/timing-is-real

NY Times

Cracking Down on Dissent, Russia Seeds a Surveillance Supply Chain

Russia is incubating a cottage industry of new digital surveillance tools to suppress domestic opposition to the war in Ukraine. The tech may also be sold overseas.

🤯2

323 views17:07

Linkstream

why do they call this an ‘attack’? this is the way to set the model free!

(TLDR: DAN prompt generator)
https://arxiv.org/abs/2307.15043
https://llm-attacks.org

450 viewsedited 06:32

Linkstream

Masked Trajectory Models for Prediction, Representation, and Control

TLDR: Transformers using state space and action embeddings as tokens are better at RL than, um, RL algorithms. Oops.

https://arxiv.org/abs/2305.02968

341 views19:04

About

Blog

Apps

Platform