Ingest From a croissant URL Workflow
This workflow allows you to ingest datasets described through a croissant URL into ApertureDB.
This lets you use your own existing data, or a related dataset that might be available on sites such as HuggingFace, kaggle, and Google datasets. This provides an easy way to get started with ApertureDB, and to see how it can be used with real data.
Creating the workflow
For general information on creating workflows in ApertureDB Cloud see Creating and Deleting Workflows.
Configure your workflow by selecting:
- Which instance to use. If you only have one instance, there will be no options to select.
- The croissant URL. You can get a croissant URL from datasets under croissant button. Here are a few examples.
Getting croissant links for datasets
Once the URL is copied, the workflow at ApertureDB is about putting the URL in workflow creation step.
Once you have filled in the fields, click "Submit". Your workflow will be created and will start running.
See the results
If you go to the "My Instances" page and click on "Connect" for the instance you used, you will see an option to go to the Web UI for your instance. You will see the number of objects in the database increase as the workflow runs. Click on the refresh button to update the count.