Extract Text from Images Workflow
This workflow allows you to use Optical Character Recognition (OCR) to extract text from images (including image-only PDFs). The workflow will also allow you to segment the text (if necessary) and add embeddings to ApertureDB using a pre-trained model. You can then use a query text or image to search for similar text segments, and then find the underlying image or PDF document.
For more detailed information about what this workflow is doing, see the ocr-extraction
documentation in GitHub.
Creating the workflow
For general information on creating workflows in ApertureDB Cloud see Creating and Deleting Workflows.
Configure your workflow by selecting:
- Which instance to use. If you only have one instance, there will be no options to select.
- Whether to extract text from images.
- Whether to extract text from PDF documents.
- The OCR model to use (Tesseract or EasyOCR)
- Whether to segment the extracted text and add embeddings.
Once you have filled in the fields, click "Submit". Your workflow will be created and will start running.
See the results
If you go to the "My Instances" page and click on "Connect" for the instance you used, you will see an option to go to the Web UI for your instance. You will see the number of descriptors in the database increase as the workflow runs. Click on the refresh button to update the count.
You can also do a "find similar" search using the "SEMANTIC SEARCH" tab in the WebUI.