VADER is a method for aligning the results of diffusion models for video generation;
VADER allows to improve various models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion using different approaches such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics and others.
Please open Telegram to view this post
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM
👍5❤2
DeepInteraction & DeepInteraction++
🖥 Github: https://github.com/fudan-zvg/deepinteraction
📕 Paper: https://arxiv.org/abs/2408.05075v1
🚀 Dataset: https://paperswithcode.com/dataset/nuscenes
🚀 Dataset: https://paperswithcode.com/dataset/nuscenes
Please open Telegram to view this post
VIEW IN TELEGRAM
👍3
Forwarded from 🐳
Exciting News, Friends!
@whale has just unveiled their new roadmap, and it’s packed with potential! According to their plans, the value of their NFT is set to rise significantly in the near future
And that’s not all! They’re also hosting an airdrop of 10 million $Whale tokens,
If you are not with them yet, now is the perfect time to join!
Please open Telegram to view this post
VIEW IN TELEGRAM
❤2👍2🏆1
Paper Name: The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper: https://arxiv.org/pdf/2408.06292v2.pdf
Code: https://github.com/sakanaai/ai-scientist
🌐 https://news.1rj.ru/str/DataScienceT ⭐️
Paper: https://arxiv.org/pdf/2408.06292v2.pdf
Code: https://github.com/sakanaai/ai-scientist
Please open Telegram to view this post
VIEW IN TELEGRAM
👍2
Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2
Paper: https://arxiv.org/pdf/2408.01648v1.pdf
Code: https://github.com/AngeLouCN/SAM-2_Surgical_Video
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.01648v1.pdf
Code: https://github.com/AngeLouCN/SAM-2_Surgical_Video
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
👍1
MooER: LLM-based Speech Recognition and Translation Models from Moore Threads
Paper: https://arxiv.org/pdf/2408.05101v1.pdf
Code: https://github.com/moorethreads/mooer
Datasets: https://github.com/kaldi-asr/kaldi/tree/master/egs/aishell2
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.05101v1.pdf
Code: https://github.com/moorethreads/mooer
Datasets: https://github.com/kaldi-asr/kaldi/tree/master/egs/aishell2
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling
Paper: https://arxiv.org/pdf/2408.04810v1.pdf
Code: https://github.com/facebookresearch/unibench
Datasets: https://www.cs.toronto.edu/~kriz/cifar.html
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.04810v1.pdf
Code: https://github.com/facebookresearch/unibench
Datasets: https://www.cs.toronto.edu/~kriz/cifar.html
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper: https://arxiv.org/pdf/2408.02900v1.pdf
Code: https://github.com/UCSC-VLAA/MedTrinity-25M
Datasets: https://yunfeixie233.github.io/MedTrinity-25M/
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.02900v1.pdf
Code: https://github.com/UCSC-VLAA/MedTrinity-25M
Datasets: https://yunfeixie233.github.io/MedTrinity-25M/
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Paper: https://arxiv.org/pdf/2408.08152v1.pdf
Code: https://github.com/deepseek-ai/deepseek-prover-v1.5
Datasets: https://github.com/zhangir-azerbayev/ProofNet
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.08152v1.pdf
Code: https://github.com/deepseek-ai/deepseek-prover-v1.5
Datasets: https://github.com/zhangir-azerbayev/ProofNet
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective
Paper: https://arxiv.org/pdf/2408.08228v1.pdf
Code: https://github.com/zx-pan/medanomalydetection-iqa
Datasets: http://braintumorsegmentation.org/ || https://brain-development.org/ixi-dataset/
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.08228v1.pdf
Code: https://github.com/zx-pan/medanomalydetection-iqa
Datasets: http://braintumorsegmentation.org/ || https://brain-development.org/ixi-dataset/
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
HAIR: Hypernetworks-based All-in-One Image Restoration
Paper: https://arxiv.org/pdf/2408.08091v1.pdf
Code: https://github.com/toummHus/HAIR
Datasets: https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/ ||https://seungjunnah.github.io/Datasets/gopro
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.08091v1.pdf
Code: https://github.com/toummHus/HAIR
Datasets: https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/ ||https://seungjunnah.github.io/Datasets/gopro
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
MambaMIM: Pre-training Mamba with State Space Token-interpolation
Paper: https://arxiv.org/pdf/2408.08070v1.pdf
Code: https://github.com/fenghetan9/mambamim
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.08070v1.pdf
Code: https://github.com/fenghetan9/mambamim
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
ChatGPT Telegram Bot: GPT-4o. Fast. No daily limits.
Group Chat support (/help_group_chat to get instructions) Voice message recognition Code highlighting. Write code with AI!
15 special chat modes:
👩🏼🎓 Assistant,
👩🏼💻 Code Assistant,
👩🎨 Artist,
🧠 Psychologist,
🚀 Elon Musk and other
Group Chat support (/help_group_chat to get instructions) Voice message recognition Code highlighting. Write code with AI!
15 special chat modes:
👩🏼🎓 Assistant,
👩🏼💻 Code Assistant,
👩🎨 Artist,
🧠 Psychologist,
🚀 Elon Musk and other
BadMerging: Backdoor Attacks Against Model Merging
Paper: https://arxiv.org/pdf/2408.07362v1.pdf
Code: https://github.com/jzhang538/badmerging
Datasets: ImageNet ||CIFAR-100 || SVHN || EuroSAT
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.07362v1.pdf
Code: https://github.com/jzhang538/badmerging
Datasets: ImageNet ||CIFAR-100 || SVHN || EuroSAT
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
❤1👍1
ML Research Hub
BadMerging: Backdoor Attacks Against Model Merging Paper: https://arxiv.org/pdf/2408.07362v1.pdf Code: https://github.com/jzhang538/badmerging Datasets: ImageNet ||CIFAR-100 || SVHN || EuroSAT https://news.1rj.ru/str/DataScienceT ⭐️
If the articles we publish have changed your life, then kindly interact with the books using Telegram stars ⭐️
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper: https://arxiv.org/pdf/2408.06292v2.pdf
Code: https://github.com/sakanaai/ai-scientist
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.06292v2.pdf
Code: https://github.com/sakanaai/ai-scientist
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Paper: https://arxiv.org/pdf/2408.06070v2.pdf
Code: https://github.com/dvlab-research/controlnext
Datasets: https://laion.ai/blog/laion-5b/
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.06070v2.pdf
Code: https://github.com/dvlab-research/controlnext
Datasets: https://laion.ai/blog/laion-5b/
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
2e8dd625_3527_4643_86b6_3b96575881a9.gif
24 MB
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Paper: https://arxiv.org/pdf/2407.20183v1.pdf
Code: https://github.com/internlm/mindsearch
Datasets: HotpotQA || Bamboogle
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2407.20183v1.pdf
Code: https://github.com/internlm/mindsearch
Datasets: HotpotQA || Bamboogle
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
This media is not supported in your browser
VIEW IN TELEGRAM
MixTex: Unambiguous Recognition Should Not Rely Solely on Real Data
Paper: https://arxiv.org/pdf/2406.17148v2.pdf
Code: https://github.com/RQLuo/MixTeX
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2406.17148v2.pdf
Code: https://github.com/RQLuo/MixTeX
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
👍1
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Paper: https://arxiv.org/pdf/2408.02657v1.pdf
Code: https://github.com/alpha-vllm/lumina-mgpt || https://github.com/alpha-vllm/lumina-t2x
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.02657v1.pdf
Code: https://github.com/alpha-vllm/lumina-mgpt || https://github.com/alpha-vllm/lumina-t2x
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
👍2