Compound attributes

Learn from this article:

This article describes how to implement attribute with the compound data type.

What are compound attributes

Imagine use case with flights. Flight can have multiple attributes like origin city, destination city, date of flight and price. With regular data types like string, boolean, date, datetime, int or float, we are able to create separate attributes like:

List of destination cities - London, Prague, …
Last destination city - London
Total revenue from all flights in USD - 1024
Dates of all flights - 2018-12-01, 2019-01-01, …

However, using separate attributes one cannot build a segment containing “all customers who flew to London in December 2018”, because we do not know which date of flight belongs to which destination. That is why we need the attributes of the compound data type.

The goal is to create 1 attribute with the origin, destination, date and price as sub-attributes (dimensions) of the attribute.

How compound attributes appear

This is done by creating a new, independent, isolated attribute with the compound data type (sub-attributes can have any of standard scalar data types) in the attributes table. Then, calculating values for the attribute.

In attributes table create a new attribute (row).

Fill in all standard values like ID, name, description, all flags, etc.

Then, for the data type, fill compound like:

compound([["origin","Origin","string"],["destination","Destination","string"],["date","Date","date"],["price","Price","float"]])

To see, what is going on, let’s break it down with some whitespace:

compound(
	[
		["origin","Origin","string"],
		["destination","Destination","string"],
		["date","Date","date"],
		["price","Price","float"]
	]
)

As you can see, the compound() is the name of the data type. Inside parentheses, there is a JSON array or arrays. The outer array is the array of sub-attributes. Each nested array is one sub-attribute.

The meaning of elements in the nested array is based on order. The first value is ID of sub-attribute (needs to be unique on sub-attribute level - no 2 sub-attributes in one attribute can have the same ID). The second value is the name of the sub-attribute (shown in UI). The third value is the data type of the sub-attribute, which can be any attribute scalar type: string, date, datetime, boolean, int or float. Sub-attribute can not be of type compound (no nested compound attributes). We are done with the attribute definition.

The value for such attribute will look like this:

["Prague","London","2018-12-01","128"]

Note: If it's a "List" type attribute, it will be multiple rows of values in the same format as the above JSON list.

The value is, again, a JSON array of 4 elements. First element is the origin, second destination, third date and fourth price. As you can see - the values follow the order of the sub-attributes defined in the attributes table’s column data_type. Note that all elements in the array must have double-quotes - be strings. The array must be homogenous.

There are multiple ways on how to calculate this attribute, here is one suggestion on how to do it.

SELECT
	json_build_array(
		"origin"::text, 
		"destination"::text, 
		"date"::text, 
		"price"::text
	) as "value"
FROM (
	SELECT
		payload->>'origin_city' as "origin",
		payload->>'destination_city' as "destination",
		payload->>'flight_date' as "date",
		payload->>'flight_price' as "price"
	FROM
		base
	WHERE
		"source_id"='flights_system'
		 AND "version" IN ('1-0-0')
		 AND "type" IN ('flight')
		 AND "customer_entity_id" IS NOT null
	) AS prep

The whole idea behind this query is that one takes values of interest from the base and combine it into the JSON array using json_build_array function.

Meiro Implementation Training

SOPs for Project Implementation

Meiro Customer Data Platform - Software Limits & Requirements

How to Request for FBM Access to Clients' Accounts

How to set up MI and ME API connection in CDP

Data Audit Checklist

Format events

Transform events (CR)

Load events to CDP

Load events to CDP (CR)

Customer Entity Retention Principles

How to Implement Meiro Events in MI and CDP and set up alerts for Meiro Events

Identity Stitching Setup

Identity Stitching rules

How to do Quality Checking on Identity Stitching

Identity Stitching edge cases and solutions

How to rerun identity stitching for all/some entities/events and recalculate attributes

Problems with large customer entities and how to investigate and solve them

Description

Attribute data types and sample values

Compound attributes

How to create context queries that require information of every customer entity

How to troubleshoot when there are no attribute values

How to delete attributes correctly

How to set up attributes in Meiro Business Explorer

How to generate simulated attributes values for demo purpose

Tutorial: How to set up RFM attributes (WIP)

Set export destinations

Facebook Custom Audience destination - upsert

The Trade Desk export destination

Google Analytics Destination

Criteo Destination

Twitter Ads Audience Destination

Google Analytics 4 Destination

Facebook Conversion API Implementation

How to find credentials for GA4 destination

Step-by-step guide for creating reports in Metabase

Set Up Business Explorer Instance & QA

Workspace documentation

Workspaces Monitoring Alert & Analyst Duty

Best Practices for Insights Creation

Component Testing and Documentation guide

Best Practices for Attributes Creation

Best Practices when setting up Destinations

Tutorial: How to implement product feed

Project Handover Procedures

Project Management Handbook

Project Management FAQ

How PMs work with Jira

Compound attributes

What are compound attributes

How compound attributes appear

No Comments