NEW BOT Телеграм, страница

gonzo-обзоры ML статей

— Лояльность — в одну сторону: от работника к компании. А компания предлагает «возможности» — по настроению.

— Увольнения — теперь не из-за кризиса, а как стратегический маневр.

— Прибыль — это не повод сохранять рабочие места или переучивать сотрудников, а возможность «трансформироваться», сокращая штат.

Винить Наделлу было бы глупо. Бизнес и ничего личного. Компания меняется. Мир меняется. Письмо подает сигнал Wall Street, что несмотря на увольнения все под контролем. И это не только о Microsoft. Это предупреждение для всех в ИТ-индустрии: вы ценны только тогда, когда компания видит в вас ценность в контексте ИИ. Будет больше боли.

https://blogs.microsoft.com/blog/2025/07/24/recommitting-to-our-why-what-and-how/

**

https://fastsalttimes.com/nadella-memo/

The Official Microsoft Blog

Recommitting to our why, what, and how

Satya Nadella, Chairman and CEO, shared the below communication with Microsoft employees this morning. As we begin a new fiscal year, I’ve been reflecting on the road we’ve traveled together and the path ahead. Before anything else, I want to speak to what’s…

2🫡20👍8🤔4❤3🤬3🥱3🔥2

4.9K views10:25

gonzo-обзоры ML статей

Ещё из любопытных новостей, JetBrains разрабатывает английский для программирования

In a July 23 interview with InfoWorld, JetBrains CEO Kirill Skrygan elaborated on company plans for an as-yet-unnamed language that would describe a program at a higher level of abstraction. He reflected on how computer code originally was written in Assembler and moved to higher levels of abstraction with C and C++, then on to yet higher levels with Java and C#. “And now it’s time to move even higher,” Skrygan said. “So when we write the code, we’ll basically lay out the ontology, the object-oriented architecture, what we have in mind, or have somewhere written in design docs.” This “whole architecture program” will make AI code generation more controllable, transparent, and useful, he said.

JetBrains is exploring how to make this new language a derivative from Kotlin, but Skrygan believes the derivative should be English. “So basically, you write the design doc in English, maybe with some semantics, with some abstract paragraph, some other things which might help.” He provided the example of creating a cross-platform application that works on iPhone, Android, the web, or other platforms. “So instead of writing three applications, you write it in a special programming language, which is basically English, which describes how you want to see this application in a very specified way, and then AI agents, together with JetBrains tooling, will generate the code of all of these platforms,” Skrygan said.

https://www.infoworld.com/article/4029053/jetbrains-working-on-higher-abstraction-programming-language.html

InfoWorld

JetBrains working on higher-abstraction programming language

The as-yet-unnamed language in development would produce cross-platform applications and make AI code generation more controllable, transparent, and useful.

🔥22🥱16👀5💩3❤2😁2🤔2🤷‍♀1

8.94K views11:12

gonzo-обзоры ML статей

Продолжаю наблюдать за темой про AI scientists :)

Бонусом ссылка на интересную вакансию про open-endedness

❤12👍5😁1

7.05K views17:01

gonzo-обзоры ML статей

Слайд забыл :)

❤7🦄5

6.2K views17:03

gonzo-обзоры ML статей

И снова про AI-исследователей.

Авторы претендуют на end-to-end NAS (network architecture search), заявляют что увидели аналог хода 37 Альфаго, и обнаружили закон скейлинга — чем больше компьюта, тем линейно больше SOTA архитектур.

https://news.1rj.ru/str/gonzo_ML_podcasts/591

Нас всех отскейлят!

gonzo_ML_podcasts

AlphaGo Moment for Model Architecture Discovery
Authors: Yixiu Liu, Yang Nan, Weixian Xu, Xiangkun Hu, Lyumanshan Ye, Zhen Qin, Pengfei Liu
Paper: https://arxiv.org/abs/2507.18074
Code: https://github.com/GAIR-NLP/ASI-Arch
Model: https://gair-nlp.github.io/ASI…

🥱5🤔4❤2🔥2👍1😁1🥴1

6.69K viewsedited 10:48

gonzo-обзоры ML статей

https://news.1rj.ru/str/gonzo_ML_podcasts/594

gonzo_ML_podcasts

😁7

6.31K views10:48

gonzo-обзоры ML статей

Очень прикольная работа про subliminal learning: https://news.1rj.ru/str/gonzo_ML_podcasts/602

Из серии про природу вещей и геометрию репрезентаций. Идея в том, что при дистилляции модель-студент может выучить способности, которые напрямую ей не передаются. Например, любовь к совам через обучение числовым последовательностям.

Вроде на уровне внутренних репрезентаций и общих инициализаций всё логично, но вообще даёт богатую пищу для размышлений. Куда-то сюда же ложится тема про dataset distillation (https://news.1rj.ru/str/gonzo_ML/143), да и вообще возникают вопросы, как у людей могут появляться разные фичи без явной их передачи. Может, кстати, эффект Манделы сюда же? ;)

gonzo_ML_podcasts

Subliminal Learning: Language models transmit behavioral traits via hidden signals in data
Authors: Alex Cloud, Minh Le, James Chua, Jan Betley, Anna Sztyber-Betley, Jacob Hilton, Samuel Marks, Owain Evans
Paper: https://arxiv.org/abs/2507.14805
Site: ht…

❤16👍9

6.68K viewsedited 16:41

gonzo-обзоры ML статей

https://news.1rj.ru/str/gonzo_ML_podcasts/618

gonzo_ML_podcasts

🔥7

6.21K views16:41

gonzo-обзоры ML статей

Я, кстати, хочу подсветить, что в работе про subliminal learning в большинстве экспериментов была не logit-дистилляция, для которой всё было бы более-менее очевидно (был один эксперимент на MNIST с logit-дистилляцией), а дистилляция на уровне токенов, по сути обычный SFT, когда модель-учитель (например, закрытая GPT-4.1/mini/nano) генерит ответы на несвязанные со скрытой способностью запросы, а другая такая же модель (тоже закрытая GPT-4.1/mini/nano) на этом датасете файнтюнится.

Это добавляет находке красоты!

gonzo-обзоры ML статей

❤10🤯8👍2

5.69K views07:55

gonzo-обзоры ML статей

Прикольная работа про эволюцию промптов, которая бьёт RL — GEPA (не путать с лекуновской JEPA!)

https://news.1rj.ru/str/gonzo_ML_podcasts/619

Рефлексия на естественном языке вместо скалярных наград, эволюция только инструкций без few-shot примеров — и на редкость хороший результат. Очередной пример, когда всё больше "интеллекта" выносится на сторону LLM (как и в AlphaEvolve, например, https://news.1rj.ru/str/gonzo_ML/3624), и это работает хорошо.

gonzo_ML_podcasts

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Authors: Lakshya A Agrawal, Shangyin Tan, Dilara Soylu, Noah Ziems, Rishi Khare, Krista Opsahl-Ong, Arnav Singhvi, Herumb Shandilya, Michael J Ryan, Meng Jiang, Christopher Potts, Koushik…

😁10🔥9❤3👍1

6.05K views10:57

gonzo-обзоры ML статей

https://news.1rj.ru/str/gonzo_ML_podcasts/628

gonzo_ML_podcasts

5.58K views10:59

gonzo-обзоры ML статей

Любопытная тёрка между Лекуном и Маском про инженеров и исследователей

https://www.linkedin.com/posts/yann-lecun_there-is-a-difference-between-research-and-activity-7356606929554567169-_iT2

There is a difference between research and engineering in (1) modus operandi, (2) methodology, (3) openness, (4) evaluation criteria.

Research uses the methodology of science to discover new principles, demonstrate that they can work in practice, analyze their advantages and limitations, and interact with the wider research community to criticize, validate, reproduce, compare, and improve. The criteria are conceptual simplicity, theoretical beauty/explainability, clear performance advantage over prior art on some accepted metrics. This is true for research in academia as well as in industry.

Engineering integrates methods, often developed in a research mode, to build working systems. The philosophy is to go with the first set of methods that work well enough for the task. It generally involves a lot of tinkering, tweaking, fine-tuning, and an occasional kludge to get the performance up on a real task. Whether the method is the absolute best matters less than whether it is good enough for the tasks at hand.

Researchers are evaluated largely on intellectual impact. Research evaluation is a difficult task because the product impact may occur years (sometimes decades) after the work. For that reason, evaluation must often rely on the collective opinion of the research community through proxies such as publications, citations, invited talks, awards, etc. That's one reason research must be published.

Engineers are evaluated largely on product impact, sometimes through proxy metrics such as pull requests, lines of code, etc.

By operating in engineering mode, researchers are incentivize to do incremental work. If you make no distinction between the two activities, if you don't evaluate researchers and engineers with different criteria, you run the risk of killing breakthrough innovation. True breakthroughs require teams with a long horizon and minimal constraints from product development and management.

The industry research labs of yore that have left an indelible mark on scientific and technological progress (Bell Labs Area 11, IBM Research, Xerox PARC, etc) were all research divisions that were clearly separate from engineering divisions.

How research and engineering differ in methodology and evaluation | Yann LeCun posted on the topic | LinkedIn

There is a difference between research and engineering in (1) modus operandi, (2) methodology, (3) openness, (4) evaluation criteria.

Research uses the methodology of science to discover new principles, demonstrate that they can work in practice, analyze…

❤43🔥8

6.64K viewsedited 12:16

About

Blog

Apps

Platform