Skip to main content

Ingest From a croissant URL Workflow

This workflow allows you to ingest datasets described through a croissant URL into ApertureDB.

This lets you use your own existing data, or a related dataset that might be available on sites such as HuggingFace, kaggle, and Google datasets. This provides an easy way to get started with ApertureDB, and to see how it can be used with real data.

Creating the workflow

For general information on creating workflows in ApertureDB Cloud see Creating and Deleting Workflows.

Configure your workflow by selecting:

  • Huggingface Dataset
    • Huggingface croissant link
  • Kaggle Dataset Kaggle croissant link

Once the URL is copied, the workflow at ApertureDB is about putting the URL in workflow creation step.

Setup Your Workflow

Once you have filled in the fields, click "Submit". Your workflow will be created and will start running.

See the results

If you go to the "My Instances" page and click on "Connect" for the instance you used, you will see an option to go to the Web UI for your instance. You will see the number of objects in the database increase as the workflow runs. Click on the refresh button to update the count.