Our original flagship technology is at the core of our offering. Logo Detection and Mark Recognition is one of the most valuable and sought-after visual signals. With VISUA you’ll finally get the scale, precision and flexibility the market demands.
Choose from a vast library of existing logos and marks, or self-activate any new logo or mark in less than a minute, at the highest levels of detection. With VISUA, training data and lead times are a thing of the past.
Logo Detection is a specific implementation of Visual-AI (also known as computer vision or vision-ai) that allows an AI to be trained to recognise logos, product & industry marks, icons, cartoon or popular characters, and other unique design elements, in any visual media (images and videos).
In contrast to other computer vision solutions that provide a one-size-fits-all offering, VISUA does not have a standard price list. This is for a very logical reason – There are many factors and combinations of settings that define the final cost for each customer and our Visual-AI (Computer Vision) solutions are very flexible in this regard so that the final implementation not only meets each customer’s specific technical needs, but also budgetary needs.
For instance, these are just some of the factors that influence the final cost:
All these factors, plus some other more obscure ones allow us to optimise the offering to deliver the very best value for any use case and scale.
Absolutely, we actively encourage and are very happy for our customers to benchmark our Visual-AI (Computer Vision) tech stack against other providers as we typically out-perform them.
However, as we don’t provide a one-size-fits-all system, we like to discuss your specific use case and requirements. Based on the outcome of that discussion, we then set up a live test using your own data. Once complete you receive the results and annotations in whatever format you need and we are available to discuss the specifics with you.
This is completely free, so simply get in touch to set this up
Our Visual-AI (Computer Vision) technologies and API focus on the processing of visual media for the purpose of detecting logos, objects and text within images or indeed visually similar copies of a source image. This is typically carried out for client/partner companies who already have access to their own source data.
For some specific projects we can, and have, assisted in the collection of data for processing. However, this is the exception and there are requirements, such as minimum volume and data licensing requirements.
If this is something you might require, please get in touch.
Our Visual-AI (Computer Vision) technologies and API focus on the processing of visual media to identify and report on key visual signals. The data is fed back to clients for them to make use of in their service or platform.
VISUA deliver data accuracy (precision and recall) in the range of 98.7% average precision and between 90-99% recall, (recall varies based on use-case). For more detail on Precision and Recall see the relevant question below. VISUA also uses humans to constantly sample-check and confirm the accuracy of the AI derived data, so data verification by the customer is rarely necessary.
Moderation of content is different. This is useful where the detection of an element (such as a brand) is correct, but the context of its use may be ambiguous. This is especially the case in use cases around copyright and trademark infringement or product counterfeits, where marginal cases need a human review for final decision.
In most cases, customers have their own Trust and Safety teams to review and moderate content, but for specific projects we can, and have, assisted in this task. However, this is the exception and there are requirements, such as minimum volume and data licensing requirements.
If this is something you might require, please get in touch.
The simple answer is no. We already process billions of images and millions of hours of video per month and have the ability to scale up for heavy demand at almost a moment’s notice.
Many world-leading companies already trust our technology to deliver high-volume processing for them, so if you need computer vision / Visual-AI at scale, you’ve come to the right place.
VISUA’s Visual-AI (computer vision) tech stack is built and optimised to handle massive volumes of data in the millions of media items per customer per month. Lower volumes can be supported, but typically, the lower limits are in the thousands of media files per day.
If your volume requirements are smaller than that then it may be worth reaching out to one of our customers in your specific sector, who will be able to support your needs better.
However, we do understand that some of the largest projects came from small beginnings. Also, some academic studies have relatively small processing requirements. So, if you have a new or academic project that you’d like to discuss, please do reach out. We’d be happy to discuss further.
In real terms the library size is unlimited. This is thanks to our Instant Learning feature that simply requires a single example of a logo/mark/icon. With that said, our current library contains over 100,000+ source objects, and growing daily.
Absolutely! The logo library is dynamic and new logos, marks and icons can be added by simply uploading a single example of what you wish to be added. Within a few hours the new logo can be active in your library.
VISUA’s Visual-AI (computer vision) Logo Detection technology uses a unique and patented feature called ‘Instant Learning. This means that, unlike other similar systems, new logos, marks and icons do not require lengthy training with tens or hundreds of example images. Instead, new source elements can be trained by loading a single example into your library. As such, the training process takes seconds to complete. Thereafter a new logo can be tracked within a few hours of it being added to the library.
VISUA’s Visual-AI (computer vision) Logo Detection technology uses a unique and patented feature called ‘Instant Learning. This means that from your perspective, it takes seconds to train a new logo, mark or icon. Once loaded into the library, the system self learns and is able to track a new logo mere hours later.
No, you can track as many or as few logos as you wish per API call. It should be noted however, that the overall cost will reflect how many logos you wish to detect during media processing.
Great question. This is another quite unique offering from VISUA. Deployment can be implemented in the cloud, on-premise or even a combination of cloud and on-premise if required.
Absolutely! Indeed some of our most interesting and unique applications have been on-device. Of course, every project is different and requirements vary, so if On-Device deployment is critical for your project, please do get in touch to discuss further.
Absolutely. This includes special marks, icons and indeed any unique graphical design elements, such as cartoon characters, emblems, stylized words, etc.
Yes. VISUA’s Logo Detection API works with all popular image and video file formats, including streaming media
VISUA’s Logo Detection API supports the detection of logos in all popular Images and video formats. This includes GIFs and even streaming video formats.
There is no specific minimum resolution as such, however, lower resolution media would also impact on the size and quality of logos contained in the media. This would therefore require specific tuning of the logo detection API in order to maximise the accuracy of logo, mark and icon detections.
With regard to maximum resolution, our resolution can process media files up to 4K resolution.
Yes. Our Logo Detection API is very flexible and can be specifically tuned per customer use. We call this ‘Occlusion Tolerance’ and allows the level of sensitivity to obscured brands/marks to be tuned in a range from zero occlusion (fully visible) to high occlusion (highly obscured or cropped).
It’s usually best to organise a short discussion to determine your specific requirements and from that a live test can be organised
Yes. Our Logo Detection API is very flexible and can be specifically tuned per customer use. We call this Perspective/Distortion Tolerance’ and allows the level of sensitivity to distortion/perspective of brands/marks to be tuned in a range from zero tolerance (zero degrees/zero distortion) to high tolerance (up to 70 degrees perspective and high distortion).
It’s usually best to organise a short discussion to determine your specific requirements and from that a live test can be organised.
Yes. This is another tuning option because the smaller the logo to be detected, the harder the Logo Detection Visual-AI (computer vision) needs to work. However, this is also linked to the resolution of the media because the higher the resolution, the higher the quality of the logo (more pixels), so it is less intensive to accurately detect smaller logos in a very high-quality image/video.
It’s usually best to organise a short discussion to determine your specific requirements and from that a live test can be organised.
Yes, of course. We call this ‘Matching Tolerance’ and it allows you to specify the percentage match that you wish to report on. For instance, you may only want to see detections that are a 100% match of your source logo. However, in most cases, our clients prefer to include close matches that would include modified versions of the logo.
It’s usually best to organise a short discussion to determine your specific requirements and from that a live test can be organised.
We are proud to deliver the industry’s most accurate Visual-AI (computer vision) technology stack, and this is especially true when it comes to Logo Detection. This has been confirmed on many occasions where clients have tested numerous providers against our tech as part of their due diligence testing. In fact, we always encourage prospective clients to run tests against other solutions and compare the results with our Visual-AI.
The main reason for this is the flexibility our API provides and the unique ability to tune the stack to deliver the very best results for each use case.
In real terms VISUA delivers data accuracy (precision and recall) in the range of 98.7% average precision and between 90-99% recall, (recall varies based on use-case). For more detail on Precision and Recall see the relevant question in this FAQ.
Precision and recall are critical terms when it comes to Visual-AI (computer vision) and together equate to the overall accuracy of the detections. Each term relates to either false positives or false negatives as follows:
Precision = False Positives
a test result which wrongly indicates that a particular condition or attribute is present. In other words seeing something that is NOT actually there.
Recall = False Negatives
a test result which wrongly indicates that a particular condition or attribute is absent. In other words NOT seeing something that IS actually there.
Currently VISUA boasts 98.7% average precision and between 90-99% recall, (recall varies based on use-case).
If this is a key KPI for your use case then get in touch to organise a more in depth discussion and a live test.
This is another flexible option when using our Logo Detection API. Our ‘Intelligence’ option allows you to choose whether to receive basic binary present/not present for your detections or advanced intelligence, such as size in frame, position in frame, share of voice, time on screen (for video), etc.
Complete details are available in our API Documentation, but if you would like to discuss this further, please reach out and a call can be organised.
Detection and annotation data are typically provided in JSON, XML or CSV format. Please get in touch if you require an alternative format
Yes. Logo placement detection is fully supported. In some cases this is based on standard object data, such as clothing, buildings and other objects. In other cases, a custom detection model may be required; for instance for specific advertising banners at sports venues/stadia.
If this is a key requirement for you, get in touch to discuss your needs in more detail.
Our logo detection Visual-AI (computer vision) API can process data to virtually any schedule you require. This can include real-time processing (used most often by broadcast and sports sponsorship monitoring platforms) or as long as 24 to 48 hours (as often used by brand monitoring companies). The speed of processing is another factor in cost, so please discuss this with us further.
We like to think that our Visual-AI (computer vision) API is very easy to implement as part of any workflow, in fact, in most cases implementation takes as little as two hours. We have very clear API documentation also. But we are not simply an API provider, so do not hesitate to get in touch with any questions you may have. We also implement a very thorough onboarding process and as a client you will have direct access to our team for any ongoing support questions.
Yes, you can find very clear API documentation for our Logo Detection endpoint, or indeed any of our other technologies. You can find all logo detection documentation here.
Absolutely! Unlike other solutions on the market that charge significant fees for support, or force you to reach out to third-party consultants, VISUA is proud to be much more than simply an API provider. You can get in touch with any questions you may have during your research and feasibility stage. We also implement a very thorough onboarding process and as a client you will have direct access to our team for any ongoing support questions.
For sure! Many of our partner clients came to us with quite unique requirements. A short discussion will allow us to gather your requirements and determine how easily we might support it.
Every offering from each company has a slightly different focus. The differences are too numerous to outline in this FAQ. However, we have developed specific comparison documents, which are available here. Specifically, you can find Google Cloud Vision vs VISUA, Amazon Rekognition vs VISUA and Microsoft Azure Vs VISUA documents. More are being added regularly.
If you have specific questions, please don’t hesitate to get in touch.
For sure! You can combine logo detection with text detection and object and scene detection to begin to understand context and sentiment from visual media.
In fact not only is it technically possible, we have built our API to make this as simple as possible. Our ‘Batch Task Processing’ allows multiple tech stack requests to be made in a single call. See our API Documentation for more details.
Yes, we have specific commercial initiatives to support these types of projects, although there are some qualifying requirements. Please get in touch to see if your project qualifies for support.
Supports the detection of logos in Images, video and GIFs.
Supports any resolution of source media, from low-res to 4K.
Detect logos as small as 0.01% of the full image size.
Choose to detect only exact matches, (ideal for brand monitoring applications) or include close matches and loose lookalikes of brands and marks, (ideal for trademark infringement monitoring).
The level of sensitivity to obscured brands/marks can be chosen. Allows detection from zero occlusion (fully visible) to high occlusion (highly obscured or cropped).
Choose whether to receive basic binary present/not present for your detections or advanced intelligence, such as size in frame, position in frame, share of voice, time on screen (for video), etc.
Allows you to determine the sensitivity to perspective/tilt, from zero tolerance (zero degrees) to high tolerance (up to 70 degrees).
Identify the type of object a brand or mark appears on, such as the type of signage, different product packaging or an item of sportswear.
The processing response time can be set to meet any use case and business need. From batched reports to real-time processing of live streams.
Seamlessly integrating our API is quick and easy, and if you have questions, there are real people here to help. So start today; complete the contact form and our team will get straight back to you.