Lädt...


🔧 Computer Vision Meetup: Improved Visual Grounding through Self-Consistent Explanations


Nachrichtenbereich: 🔧 Programmierung
🔗 Quelle: dev.to

Vision-and-language models that are trained to associate images with text have shown to be effective for many tasks, including object detection and image segmentation. In this talk, we will discuss how to enhance vision-and-language models’ ability to localize objects in images by fine-tuning them for self-consistent visual explanations. We propose a method that augments text-image datasets with paraphrases using a large language model and employs SelfEQ, a weakly-supervised strategy that promotes self-consistency in visual explanation maps. This approach broadens the model’s working vocabulary and improves object localization accuracy, as demonstrated by performance gains on competitive benchmarks.

About the Speakers

Dr. Paola Cascante-Bonilla received her Ph.D. in Computer Science at Rice University in 2024, advised by Professor Vicente Ordóñez Román, working on Computer Vision, Natural Language Processing, and Machine Learning. She received a Master of Computer Science at the University of Virginia and a B.S. in Engineering at the Tecnológico de Costa Rica. Paola will join Stony Brook University (SUNY) as an Assistant Professor in the Department of Computer Science.

Ruozhen (Catherine) He is a first-year Computer Science PhD student at Rice University, advised by Prof. Vicente Ordóñez, focusing on efficient algorithms in computer vision with less or multimodal supervision. She aims to leverage insights from neuroscience and cognitive psychology to develop interpretable algorithms that achieve human-level intelligence across versatile tasks.

Not a Meetup member? Sign up to attend the next event:

https://voxel51.com/computer-vision-ai-meetups/

Recorded on June 27, 2024 at the AI, Machine Learning and Computer Vision Meetup.

...

🔧 SelfEQ Enhances Visual Grounding with Self-Consistency


📈 36.3 Punkte
🔧 Programmierung

📰 Ferretv2: An Improved Baseline for Referring and Grounding


📈 31.64 Punkte
🔧 AI Nachrichten

🔧 Oct 10: virtual AI, Machine Learning and Computer Vision Meetup!


📈 28.74 Punkte
🔧 Programmierung

🔧 Virtual Meetup: Nov 21 - Best Computer Vision Research of ECCV 2024


📈 28.74 Punkte
🔧 Programmierung

🔧 July 3: Virtual AI, Machine Learning and Computer Vision Meetup


📈 28.74 Punkte
🔧 Programmierung

🔧 Boston AI, Machine Learning and Computer Vision Meetup


📈 28.74 Punkte
🔧 Programmierung

🔧 Virtual Meetup: Nov 22 - Best Computer Vision Research of ECCV 2024


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Combining Hugging Face Transformer Models and Image Data with FiftyOne


📈 28.74 Punkte
🔧 Programmierung

🔧 Sept 26: virtual AI, Machine Learning and Computer Vision Meetup!


📈 28.74 Punkte
🔧 Programmierung

🔧 Nov 14 - Virtual AI, ML and Computer Vision Meetup


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Leveraging Pre-trained Text2Image Diffusion Models for Zero-Shot Video Editing


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: It's in the Air Tonight. Sensor Data in RAG


📈 28.74 Punkte
🔧 Programmierung

🔧 Virtual Meetup: Nov 19 - Best Computer Vision Research of ECCV 2024


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Combining Hugging Face Transformer Models and Image Data with FiftyOne


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Reducing Hallucinations in ChatGPT and Similar AI Systems


📈 28.74 Punkte
🔧 Programmierung

🔧 Nov 14 - AI, Machine Learning and Computer Vision Meetup


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Lessons Learned fine-tuning Llama2 for Autonomous Agents


📈 28.74 Punkte
🔧 Programmierung

🔧 Sept 12 - Virtual AI, Machine Learning and Computer Vision Meetup


📈 28.74 Punkte
🔧 Programmierung

🔧 Tomorrow - Oct 24: Virtual AI, ML and Computer Vision Meetup


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: To Infer or To Defer: Hazy Oracles in Human+AI Collaboration


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Accelerating Multimodal RAG Pipelines with NVIDIA


📈 28.74 Punkte
🔧 Programmierung

🔧 Oct 24: Virtual AI, Machine Learning and Computer Vision Meetup


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Anomaly Detection with Anomalib and FiftyOne


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: 5 Handy Ways to Use Embeddings, the Swiss Army Knife of AI


📈 28.74 Punkte
🔧 Programmierung

🔧 Oct 24: Virtual AI, Machine Learning and Computer Vision Meetup


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Who needs RLHF When You Have SFT?


📈 28.74 Punkte
🔧 Programmierung

🔧 Computer Vision Meetup: Accelerating Multimodal RAG Pipelines with NVIDIA


📈 28.74 Punkte
🔧 Programmierung

matomo