با سلام خدمت همه دوستان
رویداد رایان با هدایت دکتر رهبان به زودی برگزار خواهد شد. تیم علمی رایان به دنبال گسترش و همکاری با دانشجویان گرامی علاقهمند میباشد. مسولیتهای اصلی اعضای علمی شامل هندل کردن دوره آموزشی (e.g. تولید محتوا/تمرین) و همچنین طراحی چالش (e.g. پیاده سازی ایده ها و شرکت در طراحی مسئله) می باشد. مزایای عضویت در تیم علمی شامل دریافت حقوق و ریکام (در صورت تایید هد تیم علمی) میباشد. شما عزیزان میتوانید با پرکردن این فرم علاقهمندی خود را برای عضویت در تیم علمی اعلام بفرمایید. شایان ذکر است که از اعضای تیم انتظار میرود که هفتهای حداقل ۱۵ ساعت زمان به رویداد اختصاص بدهند. مهلت پرکردن این فرم تا فردا خواهد بود.
رویداد رایان با هدایت دکتر رهبان به زودی برگزار خواهد شد. تیم علمی رایان به دنبال گسترش و همکاری با دانشجویان گرامی علاقهمند میباشد. مسولیتهای اصلی اعضای علمی شامل هندل کردن دوره آموزشی (e.g. تولید محتوا/تمرین) و همچنین طراحی چالش (e.g. پیاده سازی ایده ها و شرکت در طراحی مسئله) می باشد. مزایای عضویت در تیم علمی شامل دریافت حقوق و ریکام (در صورت تایید هد تیم علمی) میباشد. شما عزیزان میتوانید با پرکردن این فرم علاقهمندی خود را برای عضویت در تیم علمی اعلام بفرمایید. شایان ذکر است که از اعضای تیم انتظار میرود که هفتهای حداقل ۱۵ ساعت زمان به رویداد اختصاص بدهند. مهلت پرکردن این فرم تا فردا خواهد بود.
#اخبار_پژوهشی_آزمایشگاه
مقالات برتر چاپ شده از آغاز سال ۲۰۲۳ تحت نظارت آقای دکتر رهبان
Fake It Until You Make It: Towards Accurate Near-Distribution Novelty Detection
آقای حسین میرزایی دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس ICLR
Lagrangian objective function leads to improved unforeseen attack generalization
آقای محمد عزیزملایری دانشجوی دکترا آزمایشگاه RIML - چاپ شده در ژورنال Machine Learning
Compositions and methods for treating proliferative diseases
US Patent App.
Zerograd: Costless conscious remedies for catastrophic overfitting in the fgsm adversarial training
خانم زینب گلگونی دانشجوی دکترا آزمایشگاه RIML - چاپ شده در ژورنال Intelligent Systems with Applications
A deep learning framework to scale linear facial measurements to actual size using horizontal visible iris diameter: a study on an Iranian population
آقای دکتر حسین محمدرحیمی محقق در آزمایشگاه RIML - چاپ شده در ژورنال Scientific Reports
Weakly-Supervised Drug Efficiency Estimation with Confidence Score: Application to COVID-19 Drug Discovery
خانم نهال میرزایی و آقای محمد ولیثانیان دانشجویان کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس MICCAI
Forecasting influenza hemagglutinin mutations through the lens of anomaly detection
آقای محمدرضا صالحی دانشجوی اسبق آزمایشگاه RIML - چاپ شده در ژورنال Scientific Reports
Borderless azerbaijani processing: Linguistic resources and a transformer-based approach for azerbaijani transliteration
خانم ریحانه زهرابی دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس ACL - تحت نظر دکتر بیگی، دکتر عسگری و دکتر رهبان
Examination of lemon bruising using different CNN-based classifiers and local spectral-spatial hyperspectral imaging
آقای دکتر سجاد سبزی پسادکترا آزمایشگاه RIML - چاپ شده در ژورنال Algorithms
A Robust Heterogeneous Offloading Setup Using Adversarial Training
آقای مهدی امیری دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در ژورنال IEEE Transactions on Mobile Computing - تحت نظر دکتر رهبان و دکتر حسابی
Universal Novelty Detection Through Adaptive Contrastive Learning
آقای حسین میرزایی و آقای مجتبی نافذ دانشجویان کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس CVPR
Killing It With Zero-Shot: Adversarially Robust Novelty Detection
آقای حسین میرزایی دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس IEEE ICASSP
User Voices, Platform Choices: Social Media Policy Puzzle with Decentralization Salt
جمعی از دانشجویان کارشناسی - چاپ شده در کنفرانس CHI
Comparison of 2D and 3D convolutional neural networks in hyperspectral image analysis of fruits applied to orange bruise detection
آقای دکتر سجاد سبزی پسادکترا و خانم ریحانه زهرابی دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در Journal of Food Science
RODEO: Robust Outlier Detection via Exposing Adaptive Outliers
آقای حسین میرزایی و آقای مجتبی نافذ دانشجویان کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس ICML
Virtual screening for small-molecule pathway regulators by image-profile matching
آقای دکتر رهبان و خانم دکتر Anne E. Carpenter - چاپ شده در ژورنال Cell systems
Khayyam Challenge (PersianMMLU): Is Your LLM Truly Wise to The Persian Language?
Coming Soon! :)
و دهها مقاله دیگر که در Google Scholar دکتر رهبان میتوانید مشاهده کنید.
تبریک خدمت تمامی اعضای آزمایشگاه به دلیل تلاش، کوشش و پژوهش در جهت رفع مشکلات جامعه و کشور و چاپ مقالات در برترین کنفرانسها و مجلات AI
مقالات برتر چاپ شده از آغاز سال ۲۰۲۳ تحت نظارت آقای دکتر رهبان
Fake It Until You Make It: Towards Accurate Near-Distribution Novelty Detection
آقای حسین میرزایی دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس ICLR
Lagrangian objective function leads to improved unforeseen attack generalization
آقای محمد عزیزملایری دانشجوی دکترا آزمایشگاه RIML - چاپ شده در ژورنال Machine Learning
Compositions and methods for treating proliferative diseases
US Patent App.
Zerograd: Costless conscious remedies for catastrophic overfitting in the fgsm adversarial training
خانم زینب گلگونی دانشجوی دکترا آزمایشگاه RIML - چاپ شده در ژورنال Intelligent Systems with Applications
A deep learning framework to scale linear facial measurements to actual size using horizontal visible iris diameter: a study on an Iranian population
آقای دکتر حسین محمدرحیمی محقق در آزمایشگاه RIML - چاپ شده در ژورنال Scientific Reports
Weakly-Supervised Drug Efficiency Estimation with Confidence Score: Application to COVID-19 Drug Discovery
خانم نهال میرزایی و آقای محمد ولیثانیان دانشجویان کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس MICCAI
Forecasting influenza hemagglutinin mutations through the lens of anomaly detection
آقای محمدرضا صالحی دانشجوی اسبق آزمایشگاه RIML - چاپ شده در ژورنال Scientific Reports
Borderless azerbaijani processing: Linguistic resources and a transformer-based approach for azerbaijani transliteration
خانم ریحانه زهرابی دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس ACL - تحت نظر دکتر بیگی، دکتر عسگری و دکتر رهبان
Examination of lemon bruising using different CNN-based classifiers and local spectral-spatial hyperspectral imaging
آقای دکتر سجاد سبزی پسادکترا آزمایشگاه RIML - چاپ شده در ژورنال Algorithms
A Robust Heterogeneous Offloading Setup Using Adversarial Training
آقای مهدی امیری دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در ژورنال IEEE Transactions on Mobile Computing - تحت نظر دکتر رهبان و دکتر حسابی
Universal Novelty Detection Through Adaptive Contrastive Learning
آقای حسین میرزایی و آقای مجتبی نافذ دانشجویان کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس CVPR
Killing It With Zero-Shot: Adversarially Robust Novelty Detection
آقای حسین میرزایی دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس IEEE ICASSP
User Voices, Platform Choices: Social Media Policy Puzzle with Decentralization Salt
جمعی از دانشجویان کارشناسی - چاپ شده در کنفرانس CHI
Comparison of 2D and 3D convolutional neural networks in hyperspectral image analysis of fruits applied to orange bruise detection
آقای دکتر سجاد سبزی پسادکترا و خانم ریحانه زهرابی دانشجوی کارشناسیارشد آزمایشگاه RIML - چاپ شده در Journal of Food Science
RODEO: Robust Outlier Detection via Exposing Adaptive Outliers
آقای حسین میرزایی و آقای مجتبی نافذ دانشجویان کارشناسیارشد آزمایشگاه RIML - چاپ شده در کنفرانس ICML
Virtual screening for small-molecule pathway regulators by image-profile matching
آقای دکتر رهبان و خانم دکتر Anne E. Carpenter - چاپ شده در ژورنال Cell systems
Khayyam Challenge (PersianMMLU): Is Your LLM Truly Wise to The Persian Language?
Coming Soon! :)
و دهها مقاله دیگر که در Google Scholar دکتر رهبان میتوانید مشاهده کنید.
تبریک خدمت تمامی اعضای آزمایشگاه به دلیل تلاش، کوشش و پژوهش در جهت رفع مشکلات جامعه و کشور و چاپ مقالات در برترین کنفرانسها و مجلات AI
Project Denoscription:
This project is a collaborative effort between Dr. Rohban, Dr. Soleymani, and Dr. Asgari. Together, we aim to push the boundaries of language model evaluation for the Persian language. In this project, our primary goal is to benchmark and develop innovative methods for evaluating language models on the Persian language both robustly and comprehensively. Our approach will encompass both static and dynamic assessments to ensure thorough analysis. This initiative seeks to advance the field by addressing unique challenges posed by Persian language processing.
For more in-depth insights, please refer to the following papers:
"Khayyam Challenge (PersianMMLU): Is Your LLM Truly Wise to The Persian Language?"
Requirments:
Familiarity with LLM Concepts: Understanding the fundamentals and advancements in large language models.
Deep Learning Expertise: Practical knowledge and experience in deep learning techniques.
PyTorch Proficiency: Hands-on experience with the PyTorch framework is essential.
Commitment: Ability to dedicate significant time and maintain consistency throughout the project.
To apply for this position, please read the suggested papers and send your resume along with a brief summary of your research interests to mvs2667@gmail.com. We are eager to hear from motivated individuals who are passionate about advancing language model evaluation.
For any inquiries, feel free to reach out to us via the above email.
#open_position
#research_application
This project is a collaborative effort between Dr. Rohban, Dr. Soleymani, and Dr. Asgari. Together, we aim to push the boundaries of language model evaluation for the Persian language. In this project, our primary goal is to benchmark and develop innovative methods for evaluating language models on the Persian language both robustly and comprehensively. Our approach will encompass both static and dynamic assessments to ensure thorough analysis. This initiative seeks to advance the field by addressing unique challenges posed by Persian language processing.
For more in-depth insights, please refer to the following papers:
"Khayyam Challenge (PersianMMLU): Is Your LLM Truly Wise to The Persian Language?"
Requirments:
Familiarity with LLM Concepts: Understanding the fundamentals and advancements in large language models.
Deep Learning Expertise: Practical knowledge and experience in deep learning techniques.
PyTorch Proficiency: Hands-on experience with the PyTorch framework is essential.
Commitment: Ability to dedicate significant time and maintain consistency throughout the project.
To apply for this position, please read the suggested papers and send your resume along with a brief summary of your research interests to mvs2667@gmail.com. We are eager to hear from motivated individuals who are passionate about advancing language model evaluation.
For any inquiries, feel free to reach out to us via the above email.
#open_position
#research_application
#اخبار_پژوهشی_آزمایشگاه
چاپ ۲ مقاله همزمان در کنفرانس ECCV 2024 تحت نظارت دکتر رهبان
1. Snuffy: Efficient Universal Approximating Whole Slide Image Classification Framework
تبریک به آقای حسین جعفرینیا دانشجوی کارشناسیارشد و خانم نهال میرزایی دانشجوی دکترای آزمایشگاه RIML و علیرضا عالیپناه، دانیال حمدی و سعید رضوی دانشجویان کارشناسی آزمایشگاه
2. Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models
تبریک به آقای رضا عباسی دانشجوی کارشناسیارشد آزمایشگاه RIML
تحت نظارت دکتر رهبان و دکتر سلیمانی
https://www.linkedin.com/posts/mohammad-hossein-rohban-75567677_eccv-activity-7216325744702992386-cQVq?utm_source=share&utm_medium=member_android
چاپ ۲ مقاله همزمان در کنفرانس ECCV 2024 تحت نظارت دکتر رهبان
1. Snuffy: Efficient Universal Approximating Whole Slide Image Classification Framework
تبریک به آقای حسین جعفرینیا دانشجوی کارشناسیارشد و خانم نهال میرزایی دانشجوی دکترای آزمایشگاه RIML و علیرضا عالیپناه، دانیال حمدی و سعید رضوی دانشجویان کارشناسی آزمایشگاه
2. Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models
تبریک به آقای رضا عباسی دانشجوی کارشناسیارشد آزمایشگاه RIML
تحت نظارت دکتر رهبان و دکتر سلیمانی
https://www.linkedin.com/posts/mohammad-hossein-rohban-75567677_eccv-activity-7216325744702992386-cQVq?utm_source=share&utm_medium=member_android
Forwarded from Rayan AI Course
Please open Telegram to view this post
VIEW IN TELEGRAM
RL Research Group at RIML: Join Us!
We are excited to announce the formation of a new research group within RIML, dedicated to advancing the field of Reinforcement Learning. If you're passionate about AI and eager to explore the forefront of RL research, this is the perfect opportunity for you.
To foster collaboration and knowledge sharing, we are launching a weekly Journal Club. Every Tuesday from 3:30 to 5:00 PM, we will gather to discuss the latest research papers and breakthroughs in RL. This is a fantastic chance to deepen your understanding, engage in stimulating discussions, and contribute to the growing body of knowledge in this dynamic field.
This Week's Presentation Paper:
Title: On Representation Complexity of Model-based and Model-free Reinforcement Learning
Link: https://arxiv.org/abs/2310.01706
Presenter: Alireza Nobakht
Join us as we delve into the complexities of model-based versus model-free RL approaches.
Session Details:
Time: Tuesdays, 3:30 - 5:00 PM
Location: Online at https://vc.sharif.edu/ch/rohban
For in-person attendance, please message me on Telegram at @infinity2357.
Stay updated and learn more about our activities by visiting our blog: rljclub.github.io.
We look forward to seeing you and embarking on this exciting research journey together!
#RLJClub #JClub #RIML #SUT #AI #RL
We are excited to announce the formation of a new research group within RIML, dedicated to advancing the field of Reinforcement Learning. If you're passionate about AI and eager to explore the forefront of RL research, this is the perfect opportunity for you.
To foster collaboration and knowledge sharing, we are launching a weekly Journal Club. Every Tuesday from 3:30 to 5:00 PM, we will gather to discuss the latest research papers and breakthroughs in RL. This is a fantastic chance to deepen your understanding, engage in stimulating discussions, and contribute to the growing body of knowledge in this dynamic field.
This Week's Presentation Paper:
Title: On Representation Complexity of Model-based and Model-free Reinforcement Learning
Link: https://arxiv.org/abs/2310.01706
Presenter: Alireza Nobakht
Join us as we delve into the complexities of model-based versus model-free RL approaches.
Session Details:
Time: Tuesdays, 3:30 - 5:00 PM
Location: Online at https://vc.sharif.edu/ch/rohban
For in-person attendance, please message me on Telegram at @infinity2357.
Stay updated and learn more about our activities by visiting our blog: rljclub.github.io.
We look forward to seeing you and embarking on this exciting research journey together!
#RLJClub #JClub #RIML #SUT #AI #RL
arXiv.org
On Representation Complexity of Model-based and Model-free...
We study the representation complexity of model-based and model-free reinforcement learning (RL) in the context of circuit complexity. We prove theoretically that there exists a broad class of...
RL Journal Club: This Week's Session
We are pleased to invite you to this week's RL Journal Club session, where we will dive into another fascinating paper in the field of Reinforcement Learning.
Paper for Discussion:
Title: Three Dogmas of Reinforcement Learning
Link: https://arxiv.org/abs/2407.10583
Join us as we explore and critically analyze the insights presented in this paper. This session promises to be a thought-provoking discussion, providing an opportunity to deepen your understanding of the fundamental concepts and challenges in RL.
Session Details:
Date: Tuesday
Time: 3:30 - 5:00 PM
Location: Online at https://vc.sharif.edu/ch/rohban
For in-person attendance, please message me on Telegram at @infinity2357.
Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL
We are pleased to invite you to this week's RL Journal Club session, where we will dive into another fascinating paper in the field of Reinforcement Learning.
Paper for Discussion:
Title: Three Dogmas of Reinforcement Learning
Link: https://arxiv.org/abs/2407.10583
Join us as we explore and critically analyze the insights presented in this paper. This session promises to be a thought-provoking discussion, providing an opportunity to deepen your understanding of the fundamental concepts and challenges in RL.
Session Details:
Date: Tuesday
Time: 3:30 - 5:00 PM
Location: Online at https://vc.sharif.edu/ch/rohban
For in-person attendance, please message me on Telegram at @infinity2357.
Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL
arXiv.org
Three Dogmas of Reinforcement Learning
Modern reinforcement learning has been conditioned by at least three dogmas. The first is the environment spotlight, which refers to our tendency to focus on modeling environments rather than...
RL Journal Club: This Week's Session
We are excited to invite you to this week's RL Journal Club session, where we will explore an influential paper in the field of Reinforcement Learning. The session will be presented by our professor, Mohammad Hossein Rohban.
This Week's Presentation Paper:
Title: MOReL: Model-Based Offline Reinforcement Learning
Link: https://arxiv.org/abs/2005.05951
Presenter: Professor Mohammad Hossein Rohban
In this session, we will discuss the MOReL framework, which introduces a model-based approach to offline reinforcement learning, aiming to improve the data efficiency and experimental velocity of RL. The paper explores how a pessimistic MDP can be used to safely and effectively train policies using only historical data, offering a fresh perspective on offline RL.
Session Details:
Date: Tuesday
Time: 3:30 - 5:00 PM
Location: Online at https://vc.sharif.edu/ch/rohban
For in-person attendance, please message me on Telegram at @infinity2357.
Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL
We are excited to invite you to this week's RL Journal Club session, where we will explore an influential paper in the field of Reinforcement Learning. The session will be presented by our professor, Mohammad Hossein Rohban.
This Week's Presentation Paper:
Title: MOReL: Model-Based Offline Reinforcement Learning
Link: https://arxiv.org/abs/2005.05951
Presenter: Professor Mohammad Hossein Rohban
In this session, we will discuss the MOReL framework, which introduces a model-based approach to offline reinforcement learning, aiming to improve the data efficiency and experimental velocity of RL. The paper explores how a pessimistic MDP can be used to safely and effectively train policies using only historical data, offering a fresh perspective on offline RL.
Session Details:
Date: Tuesday
Time: 3:30 - 5:00 PM
Location: Online at https://vc.sharif.edu/ch/rohban
For in-person attendance, please message me on Telegram at @infinity2357.
Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL
arXiv.org
MOReL : Model-Based Offline Reinforcement Learning
In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based solely on a dataset of historical interactions with the environment. The ability to train RL policies...
🧠 RL Journal Club: This Week's Session
🤝 We invite you to join us for this week's RL Journal Club session, where we will dive into the fascinating world of Modular Reinforcement Learning. This session will explore the concept of modular RL, an approach that decomposes complex RL tasks into specialized components to enhance scalability and adaptability.
✅ This Week's Presentation:
🔹 Title: Modular Reinforcement Learning
🔸 Presenter: Arash Marioriyad
🌀 Abstract: Modular RL is an approach that emphasizes the decomposition of complex RL-based learning tasks into modular components. This methodology addresses the scalability and adaptability challenges inherent in traditional reinforcement learning by structuring agents as collections of interacting modules, each specialized for specific sub-tasks or aspects of the environment.
The presentation will be based on the following papers:
▪️ Modular Lifelong Reinforcement Learning via Neural Composition (https://arxiv.org/abs/2207.00429)
▪️ Compete and Compose: Learning Independent Mechanisms for Modular World Models (https://arxiv.org/abs/2404.15109)
▪️ Multi-Task Reinforcement Learning with Soft Modularization (https://arxiv.org/abs/2003.13661)
▪️ Modular Multitask Reinforcement Learning with Policy Sketches (https://arxiv.org/abs/1611.01796)
▪️ Recurrent Independent Mechanisms (https://arxiv.org/abs/1909.10893)
Session Details:
📅 Date: Tuesday
🕒 Time: 3:00 - 4:30 PM
🌐 Location: Online at https://vc.sharif.edu/ch/rohban
📍 For in-person attendance, please message me on Telegram at @infinity2357.
☝️ Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
💯 This session promises to be an engaging exploration of how modular approaches can transform the scalability and efficiency of reinforcement learning systems. Don't miss this opportunity to deepen your understanding and participate in thought-provoking discussions!
✌️ We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL
🤝 We invite you to join us for this week's RL Journal Club session, where we will dive into the fascinating world of Modular Reinforcement Learning. This session will explore the concept of modular RL, an approach that decomposes complex RL tasks into specialized components to enhance scalability and adaptability.
✅ This Week's Presentation:
🔹 Title: Modular Reinforcement Learning
🔸 Presenter: Arash Marioriyad
🌀 Abstract: Modular RL is an approach that emphasizes the decomposition of complex RL-based learning tasks into modular components. This methodology addresses the scalability and adaptability challenges inherent in traditional reinforcement learning by structuring agents as collections of interacting modules, each specialized for specific sub-tasks or aspects of the environment.
The presentation will be based on the following papers:
▪️ Modular Lifelong Reinforcement Learning via Neural Composition (https://arxiv.org/abs/2207.00429)
▪️ Compete and Compose: Learning Independent Mechanisms for Modular World Models (https://arxiv.org/abs/2404.15109)
▪️ Multi-Task Reinforcement Learning with Soft Modularization (https://arxiv.org/abs/2003.13661)
▪️ Modular Multitask Reinforcement Learning with Policy Sketches (https://arxiv.org/abs/1611.01796)
▪️ Recurrent Independent Mechanisms (https://arxiv.org/abs/1909.10893)
Session Details:
📅 Date: Tuesday
🕒 Time: 3:00 - 4:30 PM
🌐 Location: Online at https://vc.sharif.edu/ch/rohban
📍 For in-person attendance, please message me on Telegram at @infinity2357.
☝️ Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
💯 This session promises to be an engaging exploration of how modular approaches can transform the scalability and efficiency of reinforcement learning systems. Don't miss this opportunity to deepen your understanding and participate in thought-provoking discussions!
✌️ We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL
arXiv.org
Modular Lifelong Reinforcement Learning via Neural Composition
Humans commonly solve complex problems by decomposing them into easier subproblems and then combining the subproblem solutions. This type of compositional reasoning permits reuse of the subproblem...
🧠 RL Journal Club: This Week's Session
🤝 We invite you to join us for this week's RL Journal Club session, where we will explore the intriguing synergies between Reinforcement Learning (RL) and Large Language Models (LLMs). This session will delve into how these two powerful fields intersect, offering new perspectives and opportunities for advancement in AI research.
✅ This Week's Presentation:
🔹 Title: Synergies Between RL and LLMs
🔸 Presenter: Moein Salimi
🌀 Abstract: In this presentation, we will review research studies that combine Reinforcement Learning (RL) and Large Language Models (LLMs), two domains that have been significantly propelled by deep neural networks. The discussion will center around a novel taxonomy proposed in the paper, categorizing the interaction between RL and LLMs into three main classes: RL4LLM, where RL enhances LLM performance in NLP tasks; LLM4RL, where LLMs assist in training RL models for non-NLP tasks; and RL+LLM, where both models work together within a shared planning framework. The presentation will explore the motivations behind these synergies, their successes, potential challenges, and avenues for future research.
The presentation will be based on the following paper:
▪️ The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models (https://arxiv.org/abs/2402.01874)
Session Details:
📅 Date: Tuesday
🕒 Time: 3:30 - 5:00 PM
🌐 Location: Online at https://vc.sharif.edu/ch/rohban
📍 For in-person attendance, please message me on Telegram at @infinity2357
☝️ Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
💯 This session promises to be an enlightening exploration of how RL and LLMs can work together to push the boundaries of AI research. Don’t miss this opportunity to deepen your understanding and engage in thought-provoking discussions!
✌️ We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL #LLM
🤝 We invite you to join us for this week's RL Journal Club session, where we will explore the intriguing synergies between Reinforcement Learning (RL) and Large Language Models (LLMs). This session will delve into how these two powerful fields intersect, offering new perspectives and opportunities for advancement in AI research.
✅ This Week's Presentation:
🔹 Title: Synergies Between RL and LLMs
🔸 Presenter: Moein Salimi
🌀 Abstract: In this presentation, we will review research studies that combine Reinforcement Learning (RL) and Large Language Models (LLMs), two domains that have been significantly propelled by deep neural networks. The discussion will center around a novel taxonomy proposed in the paper, categorizing the interaction between RL and LLMs into three main classes: RL4LLM, where RL enhances LLM performance in NLP tasks; LLM4RL, where LLMs assist in training RL models for non-NLP tasks; and RL+LLM, where both models work together within a shared planning framework. The presentation will explore the motivations behind these synergies, their successes, potential challenges, and avenues for future research.
The presentation will be based on the following paper:
▪️ The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models (https://arxiv.org/abs/2402.01874)
Session Details:
📅 Date: Tuesday
🕒 Time: 3:30 - 5:00 PM
🌐 Location: Online at https://vc.sharif.edu/ch/rohban
📍 For in-person attendance, please message me on Telegram at @infinity2357
☝️ Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
💯 This session promises to be an enlightening exploration of how RL and LLMs can work together to push the boundaries of AI research. Don’t miss this opportunity to deepen your understanding and engage in thought-provoking discussions!
✌️ We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL #LLM
arXiv.org
The RL/LLM Taxonomy Tree: Reviewing Synergies Between...
In this work, we review research studies that combine Reinforcement Learning (RL) and Large Language Models (LLMs), two areas that owe their momentum to the development of deep neural networks. We...
RIML Lab
🧠 RL Journal Club: This Week's Session 🤝 We invite you to join us for this week's RL Journal Club session, where we will explore the intriguing synergies between Reinforcement Learning (RL) and Large Language Models (LLMs). This session will delve into how…
به دلیل قطعی برق دانشگاه، این جلسه به هفته آینده موکول شد.
Compositional Learning Journal Club at RIML Lab 🔥
We are pleased to announce the establishment of a new research group within RIML Lab, dedicated to the study and advancement of Compositional Learning.
Compositional learning is inspired by the inherent human ability to comprehend and generate complex ideas from simpler concepts. By enabling the recombination of learned components, compositional learning enhances a machine's ability to generalize to out-of-distribution samples encountered in real-world scenarios. This characteristic has spurred vibrant research in areas such as object-centric learning, compositional generalization, and compositional reasoning, with wide-ranging applications across various tasks, including controllable text generation, factual knowledge reasoning, image captioning, text-to-image generation, visual reasoning, speech processing, and reinforcement learning.
To promote collaboration and the exchange of knowledge, we are launching a weekly Journal Club. These sessions will be held every Sunday from 3:30 PM to 5:00 PM, where we will engage in discussions on the latest research papers and significant advancements in Compositional Learning.
For updates and additional information, please visit our blog: complearnjc.github.io.
For in-person communication, you may contact us via Telegram at @amirkasaei and @arashmarioriyad.
We look forward to your participation.
We are pleased to announce the establishment of a new research group within RIML Lab, dedicated to the study and advancement of Compositional Learning.
Compositional learning is inspired by the inherent human ability to comprehend and generate complex ideas from simpler concepts. By enabling the recombination of learned components, compositional learning enhances a machine's ability to generalize to out-of-distribution samples encountered in real-world scenarios. This characteristic has spurred vibrant research in areas such as object-centric learning, compositional generalization, and compositional reasoning, with wide-ranging applications across various tasks, including controllable text generation, factual knowledge reasoning, image captioning, text-to-image generation, visual reasoning, speech processing, and reinforcement learning.
To promote collaboration and the exchange of knowledge, we are launching a weekly Journal Club. These sessions will be held every Sunday from 3:30 PM to 5:00 PM, where we will engage in discussions on the latest research papers and significant advancements in Compositional Learning.
For updates and additional information, please visit our blog: complearnjc.github.io.
For in-person communication, you may contact us via Telegram at @amirkasaei and @arashmarioriyad.
We look forward to your participation.
Forwarded from آموزش دانشکده کامپیوتر
«دستیاری درس تحلیل هوشمند تصاویر پزشکی»
⭕️ دانشجویانی که تمایل دارند در نیمسال آینده (نیمسال اول ۱۴۰۳-۰۴) دستیار آموزشی درس تحلیل هوشمند تصاویر پزشکی دکتر رهبان باشند، میتوانند فرم زیر را پر کنند.
https://docs.google.com/forms/d/e/1FAIpQLSekQsk7e-UavxTfliCGSPpK7-dABoMpsslGgyGPG7F71hyKkw/viewform?usp=sf_link
⭕️ دانشجویانی که تمایل دارند در نیمسال آینده (نیمسال اول ۱۴۰۳-۰۴) دستیار آموزشی درس تحلیل هوشمند تصاویر پزشکی دکتر رهبان باشند، میتوانند فرم زیر را پر کنند.
https://docs.google.com/forms/d/e/1FAIpQLSekQsk7e-UavxTfliCGSPpK7-dABoMpsslGgyGPG7F71hyKkw/viewform?usp=sf_link
Google Docs
Intelligent Analysis of Biomedical Images
فرم زیر برای درخواست دستیار آموزشی درس پردازش هوشمند تصاویر پزشکی در نیمسال اول ۰۴-۱۴۰۳ است.
لطفا در صورت تمایل فرم زیر را پر کنید.
تمام اطلاعات وارد شده صرفا در اختیار استاد درس و سر دستیار درس خواهد بود.
لطفا در صورت تمایل فرم زیر را پر کنید.
تمام اطلاعات وارد شده صرفا در اختیار استاد درس و سر دستیار درس خواهد بود.
💠 Compositional Learning Journal Club
✅ This Week's Presentation:
🔹 Title: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
🔸 Presenter: Arash Marioriyad
🌀 Abstract:
Diffusion models have achieved significant success in text-to-image generation. However, alleviating the misalignment between text prompts and generated images remains a challenging issue.
This presentation will focus on two observed causes of misalignment: concept ignorance and concept mis-mapping. To address these issues, we will discuss CoMat, an end-to-end diffusion model fine-tuning strategy that uses an image-to-text concept matching mechanism.
Using only 20K text prompts to fine-tune SDXL, CoMat significantly outperforms the baseline SDXL model on two text-to-image alignment benchmarks, achieving state-of-the-art performance.
📄 Paper:
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Session Details:
- 📅 Date: Sunday, 8 September 2024
- 🕒 Time: 3:30 - 5:00 PM (GMT+3:30)
- 🌐 Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! ✌️
✅ This Week's Presentation:
🔹 Title: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
🔸 Presenter: Arash Marioriyad
🌀 Abstract:
Diffusion models have achieved significant success in text-to-image generation. However, alleviating the misalignment between text prompts and generated images remains a challenging issue.
This presentation will focus on two observed causes of misalignment: concept ignorance and concept mis-mapping. To address these issues, we will discuss CoMat, an end-to-end diffusion model fine-tuning strategy that uses an image-to-text concept matching mechanism.
Using only 20K text prompts to fine-tune SDXL, CoMat significantly outperforms the baseline SDXL model on two text-to-image alignment benchmarks, achieving state-of-the-art performance.
📄 Paper:
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Session Details:
- 📅 Date: Sunday, 8 September 2024
- 🕒 Time: 3:30 - 5:00 PM (GMT+3:30)
- 🌐 Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! ✌️
🧠 RL Journal Club: This Week's Session
🤝 We invite you to join us for this week's RL Journal Club session, where we will explore the intriguing synergies between Reinforcement Learning (RL) and Large Language Models (LLMs). This session will delve into how these two powerful fields intersect, offering new perspectives and opportunities for advancement in AI research.
✅ This Week's Presentation:
🔹 Title: Synergies Between RL and LLMs
🔸 Presenter: Moein Salimi
🌀 Abstract: In this presentation, we will review research studies that combine Reinforcement Learning (RL) and Large Language Models (LLMs), two domains that have been significantly propelled by deep neural networks. The discussion will center around a novel taxonomy proposed in the paper, categorizing the interaction between RL and LLMs into three main classes: RL4LLM, where RL enhances LLM performance in NLP tasks; LLM4RL, where LLMs assist in training RL models for non-NLP tasks; and RL+LLM, where both models work together within a shared planning framework. The presentation will explore the motivations behind these synergies, their successes, potential challenges, and avenues for future research.
The presentation will be based on the following paper:
▪️ The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models (https://arxiv.org/abs/2402.01874)
Session Details:
📅 Date: Tuesday
🕒 Time: 3:30 - 5:00 PM
🌐 Location: Online at https://vc.sharif.edu/ch/rohban
📍 For in-person attendance, please message me on Telegram at @alirezanobakht78
☝️ Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
💯 This session promises to be an enlightening exploration of how RL and LLMs can work together to push the boundaries of AI research. Don’t miss this opportunity to deepen your understanding and engage in thought-provoking discussions!
✌️ We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL #LLM
🤝 We invite you to join us for this week's RL Journal Club session, where we will explore the intriguing synergies between Reinforcement Learning (RL) and Large Language Models (LLMs). This session will delve into how these two powerful fields intersect, offering new perspectives and opportunities for advancement in AI research.
✅ This Week's Presentation:
🔹 Title: Synergies Between RL and LLMs
🔸 Presenter: Moein Salimi
🌀 Abstract: In this presentation, we will review research studies that combine Reinforcement Learning (RL) and Large Language Models (LLMs), two domains that have been significantly propelled by deep neural networks. The discussion will center around a novel taxonomy proposed in the paper, categorizing the interaction between RL and LLMs into three main classes: RL4LLM, where RL enhances LLM performance in NLP tasks; LLM4RL, where LLMs assist in training RL models for non-NLP tasks; and RL+LLM, where both models work together within a shared planning framework. The presentation will explore the motivations behind these synergies, their successes, potential challenges, and avenues for future research.
The presentation will be based on the following paper:
▪️ The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models (https://arxiv.org/abs/2402.01874)
Session Details:
📅 Date: Tuesday
🕒 Time: 3:30 - 5:00 PM
🌐 Location: Online at https://vc.sharif.edu/ch/rohban
📍 For in-person attendance, please message me on Telegram at @alirezanobakht78
☝️ Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
💯 This session promises to be an enlightening exploration of how RL and LLMs can work together to push the boundaries of AI research. Don’t miss this opportunity to deepen your understanding and engage in thought-provoking discussions!
✌️ We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL #LLM
arXiv.org
The RL/LLM Taxonomy Tree: Reviewing Synergies Between...
In this work, we review research studies that combine Reinforcement Learning (RL) and Large Language Models (LLMs), two areas that owe their momentum to the development of deep neural networks. We...
💠 Compositional Learning Journal Club
Join us this week for an in-depth discussion on Compositional Learning in the context of cutting-edge text-to-image generative models. We will explore recent breakthroughs and challenges, focusing on how these models handle compositional tasks and where improvements can be made.
✅ This Week's Presentation:
🔹 Title: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
🔸 Presenter: Amir Kasaei
🌀 Abstract:
Recent advancements in text-conditioned image generation, particularly through latent diffusion models, have achieved significant progress. However, as text complexity increases, these models often struggle to accurately capture the semantics of prompts, and existing tools like CLIP frequently fail to detect these misalignments.
This presentation introduces a Decompositional-Alignment-Score, which breaks down complex prompts into individual assertions and evaluates their alignment with generated images using a visual question answering (VQA) model. These scores are then combined to produce a final alignment score. Experimental results show this method aligns better with human judgments compared to traditional CLIP and BLIP scores. Moreover, it enables an iterative process that improves text-to-image alignment by 8.7% over previous methods.
This approach not only enhances evaluation but also provides actionable feedback for generating more accurate images from complex textual inputs.
📄 Paper: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Session Details:
- 📅 Date: Sunday
- 🕒 Time: 2:00 - 3:00 PM
- 🌐 Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! ✌️
Join us this week for an in-depth discussion on Compositional Learning in the context of cutting-edge text-to-image generative models. We will explore recent breakthroughs and challenges, focusing on how these models handle compositional tasks and where improvements can be made.
✅ This Week's Presentation:
🔹 Title: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
🔸 Presenter: Amir Kasaei
🌀 Abstract:
Recent advancements in text-conditioned image generation, particularly through latent diffusion models, have achieved significant progress. However, as text complexity increases, these models often struggle to accurately capture the semantics of prompts, and existing tools like CLIP frequently fail to detect these misalignments.
This presentation introduces a Decompositional-Alignment-Score, which breaks down complex prompts into individual assertions and evaluates their alignment with generated images using a visual question answering (VQA) model. These scores are then combined to produce a final alignment score. Experimental results show this method aligns better with human judgments compared to traditional CLIP and BLIP scores. Moreover, it enables an iterative process that improves text-to-image alignment by 8.7% over previous methods.
This approach not only enhances evaluation but also provides actionable feedback for generating more accurate images from complex textual inputs.
📄 Paper: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Session Details:
- 📅 Date: Sunday
- 🕒 Time: 2:00 - 3:00 PM
- 🌐 Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! ✌️
🧠 RL Journal Club: This Week's Session
🤝 We invite you to join us for this week's RL Journal Club session, where we will dive into a minimalist approach to offline reinforcement learning. In this session, we will explore how simplifying algorithms can lead to more robust and efficient models in RL, challenging the necessity of complex modifications commonly seen in recent advancements.
✅ This Week's Presentation:
🔹 Title: Revisiting the Minimalist Approach to Offline Reinforcement Learning
🔸 Presenter: Professor Mohammad Hossein Rohban
🌀 Abstract: This presentation will delve into the trade-offs between simplicity and performance in offline RL algorithms. We will review the minimalist approach proposed in the paper, which re-evaluates core algorithmic features and shows that simpler models can achieve performance on par with more intricate methods. The discussion will include experimental results that demonstrate how stripping away complexity can lead to more effective learning, providing fresh insights into the design of RL systems.
The presentation will be based on the following paper:
▪️ Revisiting the Minimalist Approach to Offline Reinforcement Learning (https://arxiv.org/abs/2305.09836)
Session Details:
📅 Date: Tuesday
🕒 Time: 4:00 - 5:00 PM
🌐 Location: Online at https://vc.sharif.edu/ch/rohban
📍 For in-person attendance, please message me on Telegram at @alirezanobakht78
☝️ Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
💯 Join us for an insightful session where we rethink how much complexity is truly necessary for effective offline reinforcement learning! Don't miss this chance to deepen your understanding of RL methodologies.
✌️ We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL
🤝 We invite you to join us for this week's RL Journal Club session, where we will dive into a minimalist approach to offline reinforcement learning. In this session, we will explore how simplifying algorithms can lead to more robust and efficient models in RL, challenging the necessity of complex modifications commonly seen in recent advancements.
✅ This Week's Presentation:
🔹 Title: Revisiting the Minimalist Approach to Offline Reinforcement Learning
🔸 Presenter: Professor Mohammad Hossein Rohban
🌀 Abstract: This presentation will delve into the trade-offs between simplicity and performance in offline RL algorithms. We will review the minimalist approach proposed in the paper, which re-evaluates core algorithmic features and shows that simpler models can achieve performance on par with more intricate methods. The discussion will include experimental results that demonstrate how stripping away complexity can lead to more effective learning, providing fresh insights into the design of RL systems.
The presentation will be based on the following paper:
▪️ Revisiting the Minimalist Approach to Offline Reinforcement Learning (https://arxiv.org/abs/2305.09836)
Session Details:
📅 Date: Tuesday
🕒 Time: 4:00 - 5:00 PM
🌐 Location: Online at https://vc.sharif.edu/ch/rohban
📍 For in-person attendance, please message me on Telegram at @alirezanobakht78
☝️ Note: The discussion is open to everyone, but we can only host students of Sharif University of Technology in person.
💯 Join us for an insightful session where we rethink how much complexity is truly necessary for effective offline reinforcement learning! Don't miss this chance to deepen your understanding of RL methodologies.
✌️ We look forward to your participation!
#RLJClub #JClub #RIML #SUT #AI #RL
arXiv.org
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Recent years have witnessed significant advancements in offline reinforcement learning (RL), resulting in the development of numerous algorithms with varying degrees of complexity. While these...
Forwarded from Rayan AI Course
Please open Telegram to view this post
VIEW IN TELEGRAM
💠 Compositional Learning Journal Club
Join us this week for an in-depth discussion on Compositional Learning in the context of cutting-edge text-to-image generative models. We will explore recent breakthroughs and challenges, focusing on how these models handle compositional tasks and where improvements can be made.
✅ This Week's Presentation:
🔹 Title: A semiotic methodology for assessing the compositional effectiveness of generative text-to-image models
🔸 Presenter: Amir Kasaei
🌀 Abstract:
A new methodology for evaluating text-to-image generation models is being proposed, addressing limitations in current evaluation techniques. Existing methods, which use metrics such as fidelity and CLIPScore, often combine criteria like position, action, and photorealism in their assessments. This new approach adapts model analysis from visual semiotics, establishing distinct visual composition criteria. It highlights three key dimensions: plastic categories, multimodal translation, and enunciation, each with specific sub-criteria. The methodology is tested on Midjourney and DALL·E, providing a structured framework that can be used for future quantitative analyses of generated images.
📄 Paper: A semiotic methodology for assessing the compositional effectiveness of generative text-to-image models
Session Details:
- 📅 Date: Sunday
- 🕒 Time: 5:00 - 6:00 PM
- 🌐 Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! ✌️
Join us this week for an in-depth discussion on Compositional Learning in the context of cutting-edge text-to-image generative models. We will explore recent breakthroughs and challenges, focusing on how these models handle compositional tasks and where improvements can be made.
✅ This Week's Presentation:
🔹 Title: A semiotic methodology for assessing the compositional effectiveness of generative text-to-image models
🔸 Presenter: Amir Kasaei
🌀 Abstract:
A new methodology for evaluating text-to-image generation models is being proposed, addressing limitations in current evaluation techniques. Existing methods, which use metrics such as fidelity and CLIPScore, often combine criteria like position, action, and photorealism in their assessments. This new approach adapts model analysis from visual semiotics, establishing distinct visual composition criteria. It highlights three key dimensions: plastic categories, multimodal translation, and enunciation, each with specific sub-criteria. The methodology is tested on Midjourney and DALL·E, providing a structured framework that can be used for future quantitative analyses of generated images.
📄 Paper: A semiotic methodology for assessing the compositional effectiveness of generative text-to-image models
Session Details:
- 📅 Date: Sunday
- 🕒 Time: 5:00 - 6:00 PM
- 🌐 Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! ✌️
🚨Open Position: Visual Compositional Generation Research 🚨
We are excited to announce an open research position for a project under Dr. Rohban at the RIML Lab (Sharif University of Technology). The project focuses on improving text-to-image generation in diffusion-based models by addressing compositional challenges.
🔍 Project Denoscription:
Large-scale diffusion-based models excel at text-to-image (T2I) synthesis, but still face issues like object missing and improper attribute binding. This project aims to study and resolve these compositional failures to improve the quality of T2I models.
Key Papers:
- T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional T2I Generation
- Attend-and-Excite: Attention-Based Semantic Guidance for T2I Diffusion Models
- If at First You Don’t Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
- ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
🎯 Requirements:
- Must: PyTorch, Deep Learning,
- Recommended: Transformers and Diffusion Models.
- Able to dedicate significant time to the project.
🗓 Important Dates:
- Application Deadline: 2024/10/12 (23:59 UTC+3:30)
📌 Apply here:
Application Form
For questions:
📧 a.kasaei@me.com
💬 @amirkasaei
@RIMLLab
#research_application
#open_position
We are excited to announce an open research position for a project under Dr. Rohban at the RIML Lab (Sharif University of Technology). The project focuses on improving text-to-image generation in diffusion-based models by addressing compositional challenges.
🔍 Project Denoscription:
Large-scale diffusion-based models excel at text-to-image (T2I) synthesis, but still face issues like object missing and improper attribute binding. This project aims to study and resolve these compositional failures to improve the quality of T2I models.
Key Papers:
- T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional T2I Generation
- Attend-and-Excite: Attention-Based Semantic Guidance for T2I Diffusion Models
- If at First You Don’t Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
- ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
🎯 Requirements:
- Must: PyTorch, Deep Learning,
- Recommended: Transformers and Diffusion Models.
- Able to dedicate significant time to the project.
🗓 Important Dates:
- Application Deadline: 2024/10/12 (23:59 UTC+3:30)
📌 Apply here:
Application Form
For questions:
📧 a.kasaei@me.com
💬 @amirkasaei
@RIMLLab
#research_application
#open_position
💠 Compositional Learning Journal Club
Join us this week for an in-depth discussion on Compositional Learning in the context of cutting-edge text-to-image generative models. We will explore recent breakthroughs and challenges, focusing on how these models handle compositional tasks and where improvements can be made.
✅ This Week's Presentation:
🔹 Title: InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
🔸 Presenter: Amir Kasaei
🌀 Abstract:
Recent advancements in diffusion models, like Stable Diffusion, have shown impressive image generation capabilities, but ensuring precise alignment with text prompts remains a challenge. This presentation introduces Initial Noise Optimization (InitNO), a method that refines initial noise to improve semantic accuracy in generated images. By evaluating and guiding the noise using cross-attention and self-attention scores, the approach effectively enhances image-prompt alignment, as demonstrated through rigorous experimentation.
📄 Paper: InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Session Details:
- 📅 Date: Sunday
- 🕒 Time: 5:00 - 6:00 PM
- 🌐 Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! ✌️
Join us this week for an in-depth discussion on Compositional Learning in the context of cutting-edge text-to-image generative models. We will explore recent breakthroughs and challenges, focusing on how these models handle compositional tasks and where improvements can be made.
✅ This Week's Presentation:
🔹 Title: InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
🔸 Presenter: Amir Kasaei
🌀 Abstract:
Recent advancements in diffusion models, like Stable Diffusion, have shown impressive image generation capabilities, but ensuring precise alignment with text prompts remains a challenge. This presentation introduces Initial Noise Optimization (InitNO), a method that refines initial noise to improve semantic accuracy in generated images. By evaluating and guiding the noise using cross-attention and self-attention scores, the approach effectively enhances image-prompt alignment, as demonstrated through rigorous experimentation.
📄 Paper: InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Session Details:
- 📅 Date: Sunday
- 🕒 Time: 5:00 - 6:00 PM
- 🌐 Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! ✌️
arXiv.org
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise...
Recent strides in the development of diffusion models, exemplified by advancements such as Stable Diffusion, have underscored their remarkable prowess in generating visually compelling images....