Custom Data Source

Updated by Ole Dallerup

Custom Upload is available to Enterprise customers.
Before getting started, please get in touch with your Dreamdata Customer Success Manager. A custom upload of data almost always requires tech resources at your end, so before you invest time and resources let us help you and ensure a custom upload is needed.

For most Dreamdata customers, the native data sources within Dreamdata solve all their data integration needs. However, in some cases, data exists in other standard or custom-built systems. This article will describe how that is done while covering the different use cases we usually see when needed.

This overview covers the following

  1. Implementation Process
  2. Use Cases
    1. Upload a custom CRM Source
    2. Upload custom Stage Objects
    3. Upload custom Event and Web-tracking Data

Implementation Process

  1. The customer prepares the data to upload.
    1. Schedule a data export
    2. Transform the data into the supported file formats.
    3. Compress the data using gzip.
  2. The customer upload the data.
    1. The data has to be uploaded to Google Cloud Storage or SFTP.
      1. You gain access to upload through your Customer Success Manager.
      2. Here are code samples of how to upload data to Google Cloud Storage.
    2. If incremental uploads, structure the data in folders by type and date
      1. /accounts/2024-01-23/file.json
      2. /{data-type}/{date}/file_{number}.json
    3. If full load, overwrite files at each upload
  3. Dreamdata will pick up the latest version of the data two times daily, at 04:00 AM (UTC) and 03:00 PM (UTC).
    1. Dreamdata recommends uploading the data at least daily unless the data changes infrequently.

File format

Files should be delivered in JSON newline delimited (.ndjson) or parquet.

We recommend compressing the files using gzip to reduce data transfer costs.

Incrementally

It's possible to upload the data incrementally or upload full data batch each time.

We recommend doing what is the easiest for you.

Use Cases

Below are three use cases, each requiring different file types and schemas. All fields within these schemas are necessary, but you may leave some fields empty if you don't have any data or if it doesn't apply to your context.

You should share multiple files or just one, depending on the use case.

Upload a custom CRM Source

Uploading a custom CRM Source is typically used for customers who are in one of the following situations

  • Customer has a CRM not yet supported by Dreamdata (Dreamdata natively supports Salesforce, HubSpot, Pipedrive, and MS Dynamics).
  • Customer has a self-hosted CRM such as Oracle or MS Dynamics (Dreamdata has native support of MS Dynamics Cloud).
  • Customer has a home build CRM
  • Customer wants to avoid hitting CRM API limits (for example, Salesforce)

Here, you can read more about the schema format when loading a custom CRM data source.

Upload custom Stage Objects
Before uploading custom stage objects, it is recommended to consider if you can't add a custom or calculate field on your CRM object and use the stage model builder. That is typically easier and faster, and also available to all plans.

Uploading Custom Stage Objects is typically needed when you have your ERP system with information not else available in your CRM or when you have defined your ways of measurement as joins of multiple objects.

Here, you can read more about the schema format when loading custom Stage Objects.

Upload custom Event and Web-tracking Data
Before uploading custom event data, it is recommended to check out the Google Sheet integration or Servcer-Side tracking. That is typically easier and faster to implement, while also being available to all paid plans.

Uploading custom web-tracking is typically used when uploading data from tracking solutions Dreamdata does not yet support out of the box such as Snowplow, a home build or open-source tracking solution.

Uploading custom events are typically custom events coming from a service you are using, that Dreamdata does not yet support or is from your own build service, such as your product.

Here, you can read more about the schema format when loading events and web-tracking data.


How did we do?