How to : Foundry - Google Gemini Pro Vision - Text generation

PT · January 19, 2024, 12:18pm

Hello Community,

In this tutorial, we will explore how to use Google’s Gemini Pro Vision multimodal approach to build upon the object detection techniques covered in our previous How to (link below ).
This new approach integrates multiple modalities such as text and images to provide additional context and improve the accuracy of object detection.
We will guide you through the process of incorporating this technique into your existing object detection workflow to enhance its capabilities and achieve more sophisticated results in Foundry.

Doc references :

Previous episode : How to : Foundry - Yolov8 - Object detection

Topic		Replies	Views
How to: Text Classification using Foundry Data Inferencing datasets , data-row , low-code	0	191	March 12, 2024
🆕 Labelbox editor - Live multimodal chat Labelbox Updates	0	230	June 4, 2024
Process of generating images embeddings Python SDK data-row	1	114	January 10, 2025
How to: Foundry - Use Python/SDK with Foundry Data Inferencing python-sdk , annotations , foundry	1	219	March 1, 2024
About the Data Inferencing category Data Inferencing	0	61	May 8, 2024

How to : Foundry - Google Gemini Pro Vision - Text generation

Related topics