Intro to Dreamdata

Setting Up Dreamdata

Understanding: Single sign-on and SAML

How to invite your colleagues to Dreamdata

How to validate your data

How to set up stage models

How to Begin? Choosing Your Onboarding Path

Shared Reports

Analytics Hub

Reports

Report Types

Templates

Configurator

Widgets

Dashboards

Content List

Real-Time View

Events report

Pages report

Segmentation

Ad Spend

Signal Impact Report

Audience Reach

Setup Content Reporting

Content Performance - Dashboard Options

What KPI should you use to measure the effect of B2B content?

Measuring influenced pipeline for B2B content - the true conversion metric

What content generates pipeline?

Which channel performs best for different content?

Overview

Return on Ads Spend (ROAS)

Google Search Ads

Google Display Ads

LinkedIn Ads

Microsoft Ads

Facebook Ads

YouTube Ads

Capterra Ads

Google Search

Organic

Acquisition

ROI

Performance vs. Revenue attribution: A guide on when to use what

AI-Generated Report Summary

Dreamdata Reveal

Engagement Score

Company Journey report

Deals

Dreamdata Report for LinkedIn Engagement

Search for companies or contacts

Funnel Stages Report

Link to the Customer Journey from your CRM

Content Analytics - Dashboard Options

Which content influenced the MQLs created in a time period?

Revenue Reporting

Revenue Segmentation

Revenue Attribution

Attribution Models- dashboard explanation

Customer Acquisition Cost (CAC) report

Time to Value Report

Evaluate how G2 Influences your Business

Overview

Web Traffic

Ad Performance

Ad Budget

Journey Metrics

Activating Signals with Audiences

AI Signal Recommendation Agent

Signals

Microsoft Customer List

Google Ads Customer Match

Meta Audiences

LinkedIn Matched Audiences

Google Ads Enhanced Conversion For Leads

LinkedIn Conversions (CAPI)

Microsoft Enhanced Conversions

Meta Conversions

Optimizing Google Ads with Dreamdata: Which Stages to Feed Enhanced Conversions?

Google Ads Enhanced Conversion: Salesforce vs Dreamdata

Google Ads Enhanced Conversion: HubSpot vs Dreamdata

Salesforce Syncs

HubSpot Syncs

Syncs - Automation

Syncs - Data Privacy

Understanding Data Privacy for B2B Advertising: Consent for Conversions and Audiences

Webhook syncs

The Dreamdata Chrome Extension

Slack Notifications

Microsoft Teams Notifications

Audience Builder

Audiences: HubSpot vs Dreamdata

Setting up Meta Ads

Paid sources: Overview

Setting up NextRoll

Setting up X (Twitter) Ads

Setting up Microsoft Ads

Setting up LinkedIn Ads

Setting up Google Ads

Setting up LinkedIn Ads access & permissions

Setting up G2

Setting up Google Search

Setting up Capterra

Setting up Microsoft Dynamics

Setting up Salesforce

Parent and Child Account Relationships

Setting up Pipedrive

Setting up HubSpot

Setting up Salesforce Marketing Cloud Account Engagement (Pardot)

Setting up Salesforce Marketing Cloud - Early Access

Setting up Marketo

Setting up Oracle Eloqua - Early Access

Import Customer Acquisition Cost data using Google Sheet

Import Events data using Google Sheet

Import ROI Cost Data using Google Sheet

Upload custom Stage Objects

Upload custom ROI Cost or CAC data

Upload custom Events and Web Tracking

Upload Custom CRM data

Custom Data Upload

Zapier Use Cases

Setting up Zapier integration & Zaps for Lead Ads

Setting up SafeBase Integration

Setting up Outreach

Setting up Intercom

Overview of Attribution Models

Data-Driven Attribution

Custom Attribution Models

Attribution Exclusions

LinkedIn Impression Attribution

Creating Attribution Models

Setup Guide: All Salesforce Opportunities entering specific Stage

Setup Guide: All Microsoft Dynamics Opportunities in a specific Stage

Setup Guide: All Pipedrive Deals entering specific Stage

Stage Model Preview

Setup Guide: Creation of Opportunities/Deals

Setup Guide: Tracked sign-up events

How Dreamdata Handles Currency Exchange in Stage Models

Setup Guide: All HubSpot Deals entering specific Stage

Stage Model documentation

Data Hub

Understanding: How to map UTMs in Dreamdata

Understanding: UTM mapping rules

Event Builder: Create additional events in Dreamdata

Importing Historical Web Tracking Data into Dreamdata with the Event Builder

Event Builder: Best Practices

Data Modelling Schedule

Google BigQuery V2

Snowflake Schema V3

AWS S3 V2

Data Warehouse Schema

Connect your Dreamdata data to Snowflake

Setting up Data Export to BigQuery of CRM Properties

Build your own Revenue Attribution report in BigQuery

Streamline Your Revenue Analysis: Visualize all your revenue data in one place by using BigQuery Export

Google Bigquery Export - Why can't I see or query the data?

Free Datasets

Snowflake

Google BigQuery Legacy

AWS S3 Legacy

Microsoft Azure Storage

Automatically create Accounts not in your CRM

How to share Signals with your Sales team

What is Reverse ETL?

Guides for Looker Studio Reporting

Getting Started with Looker Studio Templates

Google Connected Sheets

Connect Dreamdata to Tableau

Overview

Company Data Enrichment

Working with multiple currencies

Dreamdata without connecting a CRM

Importing Historical Web Tracking Data into Dreamdata

Menu: Settings

Allowed Domains

Learn more about the 'Ad Account' filter

Learn more about the 'Branded Search' filter

Setting up B2B Benchmarks

CRM-Based Channel and Source in the Absence of Tracking Activity

CRM filters

Understanding: Unspecified

Understanding: Conversions

Understanding: Unknown

Understanding: Monthly Tracked User (MTU)

Understanding: Source, channel and event

Understanding: Session

Understanding: Referrer

Company Logo

Understanding changes in historic reporting of attribution

How Dreamdata Maps Contacts to Companies

Why does my Linkedin campaign performance show 0 Opps?

Understanding the difference: Funnel Stages vs Time to Value reports

What is a company?

Understanding: Anonymous

Forgotten password

Which IP addresses do I need to whitelist?

Data retention and deletion

How is anonymous traffic linked to companies?

Why am I seeing gaps in Segmentation report data?

Can I connect multiple CRM's?

Can I update my company details?

Can I exclude content or websites from being tracked?

Understanding: Influenced vs Attributed Leads and Value

Understanding the Difference: Conversions vs. Stages

What does Visitors, Contacts and Companies mean?

How do we connect stage models

Roles and Permissions

Understanding: First party vs. third party cookies

Benchmarks FAQs

What is the reporting Time Zone?

Why are my dashboards empty?

Why am I seeing more sessions than page views?

Welcome Partner!

Ideal Customer Profile

Our Partner Tiers

Partner Advantages

Referral Guide and UTM tracking

Our Partner Material

Agency Partners - Contact Us

E-Commerce Order and Subscription Tracking in Dreamdata

Version 2 documentation

Setting up tracking with Segment

Sending partial data to Dreamdata

Tracking Bing Ads

Tracking Google Ads

Tracking Meta Ads

Tracking LinkedIn Ads

All Categories > Data Platform > Data Hub > Attribution Models > Data-Driven Attribution

Data-Driven Attribution

Updated 1 year ago by Mikkel Settnes

Data-Driven Models replace specific business rules with a mathematical algorithm, that uses data from all the journeys to dynamically determine what touchpoints influenced the given Stage and therefore should receive credit.

The benefit of rule-based models are the explainability. You can look at a single customer journey and follow the rules of the model and understand the credit scoring. On the other hand, the Data-driven models consider all the journeys at the same time. This makes the model able to reason about what the journeys have in common, but at the cost that you can no longer understand the attribution fully by looking at a single journey. Thus, unlike rule-based models such as time decay, Dreamdata's Data-driven attribution model's algorithm dynamically determines credit, rather than assigning more weight to recent interactions based on a fixed decay function.

When should I use the data-driven attribution model?

The rule-based models are limited to consider only a single journey at a time and follow a fixed and pre-defined rule-set.

The Data-driven model defines its own rules when looking at all your journeys at the same time.

Use Data-driven attribution if you want to attribute more weight to a touchpoint that is typically influencing the customer journeys.

Also consider the Data-driven attribution model if:

You have longer journeys, where it is not clear that a special action is most important, and you do not need to be able to follow the "rules" made by the algorithm.

It is important to note that rule-based models, when summed for all journeys, will also give a picture of what is typically happening.

The difference is that the data-driven attribution model will adjust the weighting of each touchpoint based on how frequently it appears.

Example

A rule-based model will always give the same amount of credit to the first touch, regardless of how frequent this type of touchpoint is.

The data driven model will not give as much attribution weighting to this touchpoint if it does not appear in journeys that reached the selected stage.

When you want to better follow the rules of the algorithm or there are actions along your journey that you care more about: consider combining the Data-driven model with custom rules or setup special exclusions to better guide the algorithm. This is done in the data hub --> attribution model inside your Dreamdata instance.

Data-driven models can be combined with custom rules and exclusion in order to improve the algorithms understanding of your journeys

Do I need a specific amount of data?

The Dreamdata Data-driven model can be applied regardless of the size of your data.

When you have limited amount of data or journeys with a low amount of touchpoint, the Data-driven model will be close to a Linear attribution model.

Limited amount of data makes the Data-driven model similar the Linear attribution model.

The Linear model represent the fair sharing of credit in the absence of information that makes some touchpoints worth more than others. This is why the Data-driven model goes towards the Linear model in cases when the number of journeys are below 50.

Note that the journey is only useful to the algorithm if it has more than one type of touchpoint. If only a single type of touchpoint exist in the journey - the attribution is not affected by any attribution model.

In such cases, you expect to not have conclusive evidence that credit should be shared in another way than equal.

Data-Driven attribution based on Markov chains

The data-driven attribution models within the Dreamdata platform are based on a Markov model.

Markov chains (or Markov models) are a common methodology used in attribution problems. It allows us to switch from the heuristic rule-based models to a probabilistic model that takes into account all journeys at the same time to find similarities.

You will also see this method referred to as Chain-Based Models, Funnel Models, or Full-Path Models, essentially covering the same mathematical background - namely Markov Chains.

A Markov Chain describes the customer journey as a series of touchpoints eventually leading to a positive or negative business goal - often referred to as converting (positive) or non-converting (negative) paths. In the Dreamdata platform a business goal is defined through the Stage Model setup.

This aligns with a general picture of the customer journey as a sequence of touchpoints that can come in any order and have varying length. This alignment makes it easier to intuitively understand how the model works, without deep diving into the math.

How it works

We use the historical customer journeys leading up to a Stage to make a graph of all customer journeys. This graph is unique to each Stage model.

A touchpoint that is important to generating Leads, might not be equally valuable when the goal is to generate Closed Revenue.

The graph describes how likely an account is to experience a specific touchpoint along the customer journey.

To figure out if a touchpoint is important, we calculate what is commonly referred to as the removal effect. This is the mathematical equivalent of answering: “what would happen if this touchpoint did not exist?”

The rational of the removal effect therefore becomes: If the conversion rate changed a lot if we removed touchpoints X from all the journeys, then X must be important and should receive more credit compared to other touchpoints, that did not cause large changes when removed.

We calculate the removal effect for all touchpoints, thereby determining a weighting of the different touchpoints.

When we add more data, the model will adjust and adapt to your observed journeys.

Data-driven attribution uses a Markov chain model to consider all customer journeys and find similarities in the paths that are leading to the chosen Stage.
Credit are assigned based on which touchpoint cause the biggest change when removed.

The fine print part

All data-driven attribution models have a built in problem - aside from the fact that they cannot tell you about the things you cannot find data for.

They look for correlations.

Some methods are more sensitive than others, but the problem is fundamental in any data modelling. In fact, this even goes beyond attribution modelling.

An example to illustrate this:

Many closed deals in B2B will have a meeting somewhere near the end of the journey. This means that most paths leading to a closed deal will have gone through a meeting.

Any model will pick this up and conclude that the meeting must be important. The model is doing what you asked it to: determine what usually happens when you close deals. But we (= the humans) know that the meeting is probably not causing the sale as it is most likely just a meeting to discuss the final terms. The decision was already made. But it correlates with closing.

In this way, any data-driven attribution model is in danger of just telling you what your sales process looks like. As these touchpoints correlate with your sales without causing your sales.

This problem is far more severe in B2B situations, because of the much longer and non-linear journeys observed in this space, compared to its B2C counterpart.

To alleviate this problem to the extent possible, Dreamdata allows for a detailed customization of the data-driven models by giving the user the ability to exclude such touchpoints from the model. Leaving the model to only consider things that potentially could cause sales.

Furthermore, the data-driven attribution model can be combined with (customized) rule-based models such that a certain part of the credit is determined by a rule based model, while we assign the remaining part using the data-driven model. This is done in the data hub --> attribution model inside your Dreamdata instance.

How did we do?

(opens in a new tab)