Skip to main content

Uploading Data with Pre-Defined Labels

A step-by-step guide for Incorporating Custom Labels into Data Uploads

Allison McIllece-Lasswell avatar
Written by Allison McIllece-Lasswell
Updated this week

Many users want to include their own labels, concepts, or tags alongside their data entries when uploading CSV files for analysis. Beehive AI fully supports this, making it easy to incorporate a coding system directly during the upload process for more customized analysis. Here’s how it works:


Key Features

  • Direct Label Upload: Labels, tags, or concepts can be added directly within CSV files.

  • Multiple Labels: Multiple labels can be included for a single data entry.


Steps to Include Labels in CSV Files

1️⃣ Prepare the CSV File

To include labels with data entries:

  • Add a New Column: Create an additional column in the CSV file specifically for storing the labels.

    • While not strictly required, it is strongly recommended to place this new - Labels column immediately after the Parent column it is linked to.

ID

Date

What would convince you to switch to a plant based burger?

502cjl5…

2020-02-28

I would switch if there are healthier ingredients.

6a1upc…

2020-03-01

Taste

  • Mark the Column: The new column’s header must be exactly the same as the Parent column header, followed by the addition of - Labels
    For example:

    • Parent column header:
      What would convince you to switch to a plant based burger?

    • Labels column header:
      What would convince you to switch to a plant based burger? - Labels

ID

Date

What would convince you to switch to a plant based burger?

What would convince you to switch to a plant based burger? - Labels

502cjl5…

2020-02-28

I would switch if there are healthier ingredients.

6a1upc…

2020-03-01

Taste


2️⃣ Specify the Pre-defined Labels

  • In the new labels column, add the relevant labels for each data entry, separated by commas.

  • One or multiple labels can be added as needed.

  • ⚠️ Important: All labels must be in lower case to ensure accurate processing by the platform.

    • Supported ✅: health,ingredients – lower case

    • Not Supported ❌: Health,Ingredients – capitalized, which does not meet the required format.

    • Not Supported ❌: HEALTH,INGREDIENTS – capitalized, which does not meet the required format.

Example row in the CSV:

ID

Date

What would convince you to switch to a plant based burger?

What would convince you to switch to a plant based burger? - Labels

502cjl5…

2020-02-28

I would switch if there are healthier ingredients.

health,ingredients

6a1upc…

2020-03-01

Taste

taste


3️⃣ Add a Configuration Row

In the CSV file, a new empty row must be added in the second row to accommodate a configuration line that informs the platform about which Parent column the labels in the - Labels column relate to.

ID

Date

What would convince you to switch to a plant based burger?

What would convince you to switch to a plant based burger? - Labels

502cjl5…

2020-02-28

I am concerned that they are not that healthy for you, nor are they cheap.

health,price

6a1upc…

2020-03-01

Taste

taste

In this new second row, insert the following configuration line in the - Labels column using the exact format as shown below:

Example configuration row:

{"col":"What would convince you to switch to a plant based burger?"}

ID

Date

What would convince you to switch to a plant based burger?

What would convince you to switch to a plant based burger? - Labels

{"col": "What would convince you to switch to a plant based burger?"}

502cjl5…

2020-02-28

I am concerned that they are not that healthy for you, nor are they cheap.

health,price

6a1upc…

2020-03-01

Taste

taste

⚠️ Important:
The formatting of the configuration line must be matched exactly—including the spacing, symbols, and overall structure.

  • Only the text within the quotes (in this example: "What would convince you to switch to a plant based burger?") should be swapped out for the Parent column header relevant to your dataset.

  • The text you swap in must match your actual Parent column header exactly—character for character, including any punctuation and spacing.

  • Do not change any of the surrounding structure {"col":"..."}, as the platform uses this precise format to recognize the labels column’s association with the parent data column.

This step is crucial for ensuring the platform accurately links the uploaded labels to the correct data entries.


4️⃣ Repeat for Multiple Columns

Users can repeat Steps 1-3 for each Parent column that has pre-defined labels. Simply ensure that:

  • Each new labels column follows the same naming convention (Parent Column Header - Labels).

  • Each new labels column has its own configuration row entry, matched to the corresponding parent column exactly.

By repeating these steps, users can easily manage and label multiple columns of data within the same upload.


5️⃣ Save & Upload

  • Save the file in CSV format.

  • Upload the CSV file to the platform as usual.

  • The system will automatically detect and apply the labels during analysis.


Example CSV Format

Here’s a snippet of how the final CSV might look:

  • The Parent column appears in blue

  • The new - Labels column appears in green

ID

Date

What would convince you to switch to a plant based burger?

What would convince you to switch to a plant based burger? - Labels

{"col": "What would convince you to switch to a plant based burger?"}

502cjl5…

2020-02-28

I am concerned that they are not that healthy for you, nor are they cheap.

health,price

6a1upc…

2020-03-01

Taste

taste


Need help?

For any questions or help with file setup, our support team is available to assist!

Did this answer your question?