New Clarity built a specialized AI chatbot that transformed a highly technical textbook into an interactive assistant that was able to effectively utilize images to answer queries. This system was able to display the correct technical images alongside its answers due to a secondary RAG system created by analyzing the context around and references to each image. By combining text retrieval with intelligent image selection, the chatbot created a far more effective way for users to engage with dense, visually complex subject matter.
Technical textbooks often rely heavily on detailed images, diagrams, and figures. These images are tied closely to the surrounding text and are critical for understanding the subject matter. Standard LLMs, however, were not able to reliably recognize or connect these images to specific queries because of the specialization of the topic. A solution was needed that could:
New Clarity developed a custom retrieval augmented generation (RAG) system designed to handle both text and images. The solution included several layers of engineering:
The finished chatbot delivered a more advanced learning tool than a simple question-and-answer system:
The success of the project came from treating images as first-class data within the retrieval system. Rather than relying on an LLM alone, New Clarity combined metadata-rich image indexing with advanced text retrieval. This allowed the AI to reason about when an image was necessary and which one best supported the user’s question.
This project demonstrated how advanced retrieval and metadata strategies can extend AI beyond text-only systems. Similar techniques can be applied in fields such as engineering, medicine, or legal research, where diagrams, scans, or exhibits are as important as the text itself. New Clarity continues to help organizations unlock the full value of their technical content by pairing AI agents with both textual and visual intelligence.
Elit facilisis maecenas euismod vulputate. Dignissim natoque nascetur donec urna in vel vitae.