NEW BOT Телеграм, страница

Forwarded from Machine Learning with Python

Some people asked me about a resource for learning about Transformers.

Here's a good one I am sharing again -- it covers just about everything you need to know.

brandonrohrer.com/transformers

Amazing stuff. It's totally worth your weekend.

https://news.1rj.ru/str/CodeProgrammer

👍5

1.71K views13:13

ML Research Hub

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

22 Feb 2024 · Yijia Shao, Yucheng Jiang, Theodore A. Kanell, Peter Xu, Omar Khattab, Monica S. Lam ·

We study how to apply large language models to write grounded and organized long-form articles from scratch, with comparable breadth and depth to Wikipedia pages. This underexplored problem poses new challenges at the pre-writing stage, including how to research the topic and prepare an outline prior to writing. We propose STORM, a writing system for the Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking. STORM models the pre-writing stage by (1) discovering diverse perspectives in researching the given topic, (2) simulating conversations where writers carrying different perspectives pose questions to a topic expert grounded on trusted Internet sources, (3) curating the collected information to create an outline. For evaluation, we curate FreshWiki, a dataset of recent high-quality Wikipedia articles, and formulate outline assessments to evaluate the pre-writing stage. We further gather feedback from experienced Wikipedia editors. Compared to articles generated by an outline-driven retrieval-augmented baseline, more of STORM's articles are deemed to be organized (by a 25% absolute increase) and broad in coverage (by 10%). The expert feedback also helps identify new challenges for generating grounded long articles, such as source bias transfer and over-association of unrelated facts.

Paper: https://arxiv.org/pdf/2402.14207v2.pdf

Codes:
https://github.com/assafelovic/gpt-researcher
https://github.com/stanford-oval/storm

👍2❤1

2.26K views16:04

ML Research Hub

LLM4Decompile: Decompiling Binary Code with Large Language Models

8 Mar 2024 · Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang ·

Decompilation aims to convert binary code to high-level source code, but traditional tools like Ghidra often produce results that are difficult to read and execute. Motivated by the advancements in Large Language Models (LLMs), we propose LLM4Decompile, the first and largest open-source #LLM series (1.3B to 33B) trained to decompile binary code. We optimize the LLM training process and introduce the LLM4Decompile-End models to decompile binary directly. The resulting models significantly outperform GPT-4o and Ghidra on the HumanEval and ExeBench benchmarks by over 100% in terms of re-executability rate. Additionally, we improve the standard refinement approach to fine-tune the LLM4Decompile-Ref models, enabling them to effectively refine the decompiled code from Ghidra and achieve a further 16.2% improvement over the LLM4Decompile-End. LLM4Decompile demonstrates the potential of LLMs to revolutionize binary code decompilation, delivering remarkable improvements in readability and executability while complementing conventional tools for optimal results.

Paper: https://arxiv.org/pdf/2403.05286v3.pdf

Code: https://github.com/albertan017/LLM4Decompile

❤1👍1

2.28K viewsedited 05:57

ML Research Hub

FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration

24 Jan 2025 · Kai-Tuo Xu, Feng-Long Xie, Xu Tang, Yao Hu ·

Paper: https://arxiv.org/pdf/2501.14350v1.pdf

Code: https://github.com/fireredteam/fireredasr

Datasets: LibriSpeech - AISHELL-1 - AISHELL-2 - WenetSpeech

👍2

2.7K viewsedited 07:33

ML Research Hub

⭐️ Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

🖥 Github: https://github.com/bcmi/Light-A-Video

📕 Paper: https://arxiv.org/abs/2502.08590v1

🌟 Dataset: https://paperswithcode.com/task/image-relighting

👍3

2.34K viewsedited 13:02

ML Research Hub

3b3d57df_374f_4bf3_9752_26a5010a718e.gif

56.6 MB

MedRAX: Medical Reasoning Agent for Chest X-ray

4 Feb 2025 · Adibvafa Fallahpour, Jun Ma, Alif Munim, Hongwei Lyu, Bo wang ·

paper: https://arxiv.org/pdf/2502.02673v1.pdf

Code: https://github.com/bowang-lab/medrax

❤1

2.36K viewsedited 06:39

ML Research Hub

Bayesian Sample Inference

🖥

Github: https://github.com/martenlienen/bsi

📕

Paper: https://arxiv.org/abs/2502.07580

🌟 Dataset: https://paperswithcode.com/dataset/cifar-10

Please open Telegram to view this post

VIEW IN TELEGRAM

❤1👍1

2.36K viewsedited 11:27

ML Research Hub

On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices

5 Feb 2025 · Bosung Kim, Kyuhwan Lee, Isu Jeong, Jungmin Cheon, Yeojin Lee, Seulki Lee ·

Paper:https://arxiv.org/pdf/2502.04363v1.pdf

Code: https://github.com/eai-lab/on-device-sora

👍3❤1

2.22K views12:59

ML Research Hub

Accelerating Data Processing and Benchmarking of AI Models for Pathology

10 Feb 2025 · Andrew Zhang, Guillaume Jaume, Anurag Vaidya, Tong Ding, Faisal Mahmood ·

Paper: https://arxiv.org/pdf/2502.06750v1.pdf

Codes:
https://github.com/mahmoodlab/trident
https://github.com/mahmoodlab/patho-bench

👍1

2.56K views15:51

ML Research Hub

LIMO: Less is More for Reasoning

5 Feb 2025 · Yixin Ye, Zhen Huang, Yang Xiao, Ethan Chern, Shijie Xia, PengFei Liu ·

Paper: https://arxiv.org/pdf/2502.03387v1.pdf

Codes:
https://github.com/gair-nlp/limo
https://github.com/zhaoolee/garss

👍3

2.68K views06:56

ML Research Hub

follow me on X

i will send useful courses and post

https://x.com/EngSheikho

X (formerly Twitter)

Hussein Sheikho (@EngSheikho) on X

Computer Systems Engineer with a Master’s degree in Computer Engineering. Expertise in Python programming, Artificial Intelligence, and Web Development

2.13K views06:58

ML Research Hub

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

7 Feb 2025 · Shilong Zhang, Wenbo Li, Shoufa Chen, Chongjian Ge, Peize Sun, Yida Zhang, Yi Jiang, Zehuan Yuan, Binyue Peng, Ping Luo ·

Paper: https://arxiv.org/pdf/2502.05179v1.pdf

Code: https://github.com/foundationvision/flashvideo

👍2❤1

2.39K views09:38

ML Research Hub

OmniParser for Pure Vision Based GUI Agent

1 Aug 2024 · Yadong Lu, Jianwei Yang, Yelong Shen, Ahmed Awadallah

Paper: https://arxiv.org/pdf/2408.00203v1.pdf

Code: https://github.com/microsoft/omniparser

Dataset: ScreenSpot

Note: Ranked #10 on Natural Language Visual Grounding on #ScreenSpot

👍1

2.06K viewsedited 06:58

ML Research Hub

PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation

20 Jan 2025 · Jinyu Wang, Jingjing Fu, Rui Wang, Lei Song, Jiang Bian

Paper: https://arxiv.org/pdf/2501.11551v2.pdf

Code: https://github.com/microsoft/pike-rag

Datasets: HotpotQA - 2WikiMultiHopQA

👍3❤1

2.18K viewsedited 07:04

ML Research Hub

Forwarded from Machine Learning with Python

✔️

Awesome AI/ML Resources: Learn AI/ML for beginners with a roadmap and free resources.

🖥

https://github.com/armankhondker/awesome-ai-ml-resources

Please open Telegram to view this post

VIEW IN TELEGRAM

👍7

1.57K views12:21

ML Research Hub

Enhance-A-Video: Better Generated Video for Free

11 Feb 2025 · Yang Luo, Xuanlei Zhao, Mengzhao Chen, Kaipeng Zhang, Wenqi Shao, Kai Wang, Zhangyang Wang, Yang You

Paper: https://arxiv.org/pdf/2502.07508v1.pdf

Code: https://github.com/NUS-HPC-AI-Lab/Enhance-A-Video

❤1👍1

2.17K views07:17

ML Research Hub

Accelerating Data Processing and Benchmarking of AI Models for Pathology

10 Feb 2025 · Andrew Zhang, Guillaume Jaume, Anurag Vaidya, Tong Ding, Faisal Mahmood

Advances in foundation modeling have reshaped computational pathology. However, the increasing number of available models and lack of standardized benchmarks make it increasingly complex to assess their strengths, limitations, and potential for further development. To address these challenges, we introduce a new suite of software tools for whole-slide image processing, foundation model benchmarking, and curated publicly available tasks. We anticipate that these resources will promote transparency, reproducibility, and continued progress in the field.

Paper: https://arxiv.org/pdf/2502.06750v1.pdf

Codes:
https://github.com/mahmoodlab/trident
https://github.com/mahmoodlab/patho-bench

👍1

2.49K viewsedited 08:01

ML Research Hub

The Hundred-Page Language Models Book

Read it:
https://github.com/aburkov/theLMbook

👍4

2.8K views12:13

ML Research Hub

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper: https://arxiv.org/pdf/2502.10248v1.pdf

Codes:
https://github.com/phixion/phixion
https://github.com/stepfun-ai/step-video-t2v

👍3

2.34K viewsedited 06:39

ML Research Hub

Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition

🖥

Github: https://github.com/nuozimiaowu/Text4VPR

📕

Paper: https://arxiv.org/abs/2502.14195v1

🌟 Dataset: https://paperswithcode.com/task/cross-modal-place-recognition

Please open Telegram to view this post

VIEW IN TELEGRAM

👍1

2.08K viewsedited 10:23

ML Research Hub

KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG

13 Feb 2025 · Yiqian Huang, Shiqi Zhang, Xiaokui Xiao ·

Paper: https://arxiv.org/pdf/2502.09304v1.pdf

Code: https://github.com/waetr/KET-RAG

👍2

2.12K views11:08

About

Blog

Apps

Platform