<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Pipeline Archives - Reflective Data</title>
	<atom:link href="https://www.reflectivedata.com/category/data-pipeline/feed/" rel="self" type="application/rss+xml" />
	<link>https://reflectivedata.com/category/data-pipeline/</link>
	<description></description>
	<lastBuildDate>Fri, 09 Aug 2024 12:09:25 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://reflectivedata.com/wp-content/uploads/2016/09/cropped-new-favicon-2-32x32.png</url>
	<title>Data Pipeline Archives - Reflective Data</title>
	<link>https://reflectivedata.com/category/data-pipeline/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Case Study: Overcoming The 1 Million Event Limit in GA4 Without Upgrading to GA360</title>
		<link>https://www.reflectivedata.com/case-study-overcome-1m-event-limit-ga4-without-ga360</link>
					<comments>https://www.reflectivedata.com/case-study-overcome-1m-event-limit-ga4-without-ga360#respond</comments>
		
		<dc:creator><![CDATA[Jason Dolan]]></dc:creator>
		<pubDate>Tue, 08 Aug 2023 12:22:08 +0000</pubDate>
				<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Case Study]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<category><![CDATA[GA4]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=23650</guid>

					<description><![CDATA[<p>GA360 is a great tool, and for the enterprises that can afford and justify the cost, probably the best analytics tool they can invest in. At the same time, many companies don't have the budget to pay upwards of $150k for an analytics tool.</p>
<p>The post <a href="https://www.reflectivedata.com/case-study-overcome-1m-event-limit-ga4-without-ga360">Case Study: Overcoming The 1 Million Event Limit in GA4 Without Upgrading to GA360</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote><p><span style="font-size: 23px;"><em>GA360 is expensive and we felt like, for our use case, it would be too much. I was delighted to learn that Reflective Data can help us collect over 1M events per day in our BigQuery dataset for a fraction of the cost.</em></span></p>
<p style="text-align: right;">Jurgita, Digital Data Analyst, Vilnius</p>
</blockquote>
<p>GA360 is a great tool, and for the enterprises that can afford and justify the cost, probably the best analytics tool they can invest in. At the same time, many companies don&#8217;t have the budget to pay upwards of $150k for an analytics tool.</p>
<h2>The challenge</h2>
<p>One of the best features of GA4 over the previous version is that even the free version has the BigQuery export pipeline available.</p>
<p>The free version of GA4, though, has a limit on how many events can be exported each day. More specifically, this limit is set at 1M events per day. Barbora, like many companies with large traffic numbers, was reaching this limit in GA4 but at the same time wasn&#8217;t able to justify the cost of GA360.</p>
<h2>The solution</h2>
<p>At Reflective Data, we built a solution called Parallel Tracking for GA4.</p>
<p>Here&#8217;s how it works.</p>
<p>Parallel Tracking duplicates all HTTP requests going from your site or app to GA4 and sends them to Reflective Data&#8217;s processing endpoint.</p>
<p>Data is then processed just like GA4 does it and inserted into your BigQuery dataset. There is no limit on the daily event count with Parallel Tracking.</p>
<figure id="attachment_23652" aria-describedby="caption-attachment-23652" style="width: 745px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2023/08/Screenshot-2023-08-08-at-15.07.44.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img fetchpriority="high" decoding="async" class="size-full wp-image-23652" src="http://reflectivedata.com/wp-content/uploads/2023/08/Screenshot-2023-08-08-at-15.07.44.png" alt="GA4 to BigQuery 1m event limit" width="745" height="594" srcset="https://reflectivedata.com/wp-content/uploads/2023/08/Screenshot-2023-08-08-at-15.07.44.png 745w, https://reflectivedata.com/wp-content/uploads/2023/08/Screenshot-2023-08-08-at-15.07.44-700x558.png 700w" sizes="(max-width: 745px) 100vw, 745px" /></a><figcaption id="caption-attachment-23652" class="wp-caption-text">Schema for avoiding 1m event limit in GA4</figcaption></figure>
<p>&nbsp;</p>
<p>Getting started with Parallel Tracking is easy as 1, 2, 3.</p>
<h3>Step 1: Tracking code modification</h3>
<p>After signing up for <a href="http://reflectivedata.com/services/google-analytics-4-parallel-tracking/">Parallel Tracking</a>, you&#8217;ll have to enable it by modifying your GA4 implementation. We have plug-and-play solutions for Google Tag Manager, gtag.js, Measurement Protocol and most other implementation methods that GA4 has.</p>
<h3>Step 2: Pipeline activation</h3>
<p>In order to activate the pipeline, all you need to do is provide Reflective Data with a Google Cloud Platform Service Account key file. After providing the key file, GA4 data will start flowing into your BigQuery dataset.</p>
<h3>Step 3: Querying the data</h3>
<p>Once the pipeline is active, you can start querying your GA4 data just like with the default GA4 to BigQuery data export.</p>
<h2>Conclusion</h2>
<p>If your daily event volume in GA4 is close to or exceeds 1 million then you have two options. Upgrade to GA360 or implement <a href="http://reflectivedata.com/services/google-analytics-4-parallel-tracking/">Parallel Tracking</a>. If the event volume exceeding 1 million is the only reason you&#8217;re looking to upgrade, Parallel Tracking is likely a better option as it is several times more affordable compared to GA360.</p>
<p>Ending with another quote from the customer here.</p>
<blockquote><p><em>I guess you could say we were your average enterprise with LOTS of legacy data infrastructure that had been built over many years. This system was extremely complex and very expensive to maintain. When Reflective Data came in, they acted as true professionals, worked very closely with our IT and came up with the plan that pleased everyone.</em></p>
<p><em>Today, being on the new cloud-based data infrastructure for almost a year now, I can say with all certainty that this project was a success. Not only are we saving tens of thousands of dollars every month on the infrastructure alone, the amount of hours it takes to maintain the system has gone from hundreds down to a ten or so. This has huge impact on our business. Implementing anything new would&#8217;ve taken at least 6 months with the old system, now it&#8217;s a matter of week or two to get everything up and running.</em></p>
<p>Last but not the least, Reflective Data saved us another $150k year as we no longer needed to use GA360 and instead implemented their Parallel Tracking solution for GA4.</p>
<p style="text-align: right;">Daria, VP of Marketing, Warsaw</p>
</blockquote>
<p>It&#8217;s feedback like this that makes us love the work we do even more! <a href="http://reflectivedata.com/services/google-analytics-4-parallel-tracking/">Get in touch</a> and learn how we can help you, too.</p>
<p>For more case studies, <a href="http://reflectivedata.com/case-studies/">see here</a>.</p>
<p>The post <a href="https://www.reflectivedata.com/case-study-overcome-1m-event-limit-ga4-without-ga360">Case Study: Overcoming The 1 Million Event Limit in GA4 Without Upgrading to GA360</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/case-study-overcome-1m-event-limit-ga4-without-ga360/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Export Experiment Data From Google Optimize &#8211; While You Still Can</title>
		<link>https://www.reflectivedata.com/export-experiment-data-from-google-optimize</link>
					<comments>https://www.reflectivedata.com/export-experiment-data-from-google-optimize#respond</comments>
		
		<dc:creator><![CDATA[Jason Dolan]]></dc:creator>
		<pubDate>Tue, 24 Jan 2023 10:43:12 +0000</pubDate>
				<category><![CDATA[A/B testing]]></category>
		<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<category><![CDATA[Google Optimize]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=20215</guid>

					<description><![CDATA[<p>Google Optimize and Optimize 360 will no longer be available after September 30, 2023. Your experiments and personalizations can continue to run until that date. Any experiments and personalizations still active on that date will end.</p>
<p>In order to not lose your data, you should act on exporting it now!</p>
<p>The post <a href="https://www.reflectivedata.com/export-experiment-data-from-google-optimize">Export Experiment Data From Google Optimize &#8211; While You Still Can</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Google Optimize and Optimize 360 will no longer be available after <strong>September 30, 2023</strong>. Your experiments and personalizations can continue to run until that date. Any experiments and personalizations still active on that date will end.</p>
<p>This came as an unwelcome surprise for anyone working in the experimentation industry. Even if you didn&#8217;t use the tool itself, it was the first tool most newcomers used to get themselves into experimenting on their websites.</p>
<p>For a while, Google Optimize going away was everything people talked about on Twitter and LinkedIn.</p>
<figure id="attachment_20216" aria-describedby="caption-attachment-20216" style="width: 1120px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-14.21.06.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img decoding="async" class="size-full wp-image-20216" src="http://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-14.21.06.png" alt="Google Optimize Data Export" width="1120" height="422" srcset="https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-14.21.06.png 1120w, https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-14.21.06-700x264.png 700w, https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-14.21.06-1024x386.png 1024w, https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-14.21.06-768x289.png 768w" sizes="(max-width: 1120px) 100vw, 1120px" /></a><figcaption id="caption-attachment-20216" class="wp-caption-text"><a href="https://twitter.com/SimoAhava/status/1616660321346658306">Source</a></figcaption></figure>
<h2>Exporting Experiment Data From Google Optimize</h2>
<p>Since several of our existing clients asked for it, we built <a href="http://reflectivedata.com/services/google-optimize-data-export">Google Optimize Data Exporter</a> to store your experiment data for as long as you need it. It supports almost any data destination, including popular ones like Google BigQuery, Amazon S3 and Snowflake.</p>
<p>Google Optimize Data Exporter runs on the same robust and scalable <a href="http://reflectivedata.com/analytics-data-pipeline/integrations">Reflective Data Infrastructure</a> that you hopefully already know and love.</p>
<figure id="attachment_20201" aria-describedby="caption-attachment-20201" style="width: 2220px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-10.52.20.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img decoding="async" class="size-full wp-image-20201" src="http://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-10.52.20.png" alt="Google Optimize Data Export" width="2220" height="286" srcset="https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-10.52.20.png 2220w, https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-10.52.20-700x90.png 700w, https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-10.52.20-1024x132.png 1024w, https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-10.52.20-768x99.png 768w, https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-10.52.20-1536x198.png 1536w, https://reflectivedata.com/wp-content/uploads/2023/01/Screenshot-2023-01-24-at-10.52.20-2048x264.png 2048w" sizes="(max-width: 2220px) 100vw, 2220px" /></a><figcaption id="caption-attachment-20201" class="wp-caption-text">Google Optimize Data Export</figcaption></figure>
<p>We&#8217;ve made the process of exporting your Google Optimize data as simple as possible. Here&#8217;s a quick overview.</p>
<h3>1. Planning and scoping</h3>
<p><a href="http://reflectivedata.com/services/google-optimize-data-export#services-contact-section" target="_blank" rel="noopener">Get in touch</a> with one of our data analysts to plan your Google Optimize Data export. The main questions to answer are the list of experiments, dimensions, metrics, time frames and the data destination you wish to use for your Optimize data export.</p>
<h3>2. Data export and storage</h3>
<p>Executing the plan. Our data analyst will configure Reflective Data Export System to pull the requested data from your Google Optimize instance and store at your chosen data storage destination.</p>
<p>Most exports use Google BigQuery as a data destination.</p>
<h3>3. Reporting and consultation</h3>
<div>Need help accessing or using the data? Reflective Data experts are happy to assist you with everything ranging from configuring interactive reports to consulting you on maximising insights you can draw from this dataset.</div>
<div>
<section class="elementor-section elementor-top-section elementor-element elementor-element-9407817 elementor-section-boxed elementor-section-height-default elementor-section-height-default" data-id="9407817" data-element_type="section">
<div class="elementor-container elementor-column-gap-default">
<div class="elementor-row">
<div class="elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-2bcdcea" data-id="2bcdcea" data-element_type="column">
<div class="elementor-column-wrap elementor-element-populated">
<div class="elementor-widget-wrap">
<div class="elementor-element elementor-element-5f9b35c elementor-widget elementor-widget-heading" data-id="5f9b35c" data-element_type="widget" data-widget_type="heading.default">
<div class="elementor-widget-container">
<h2 class="elementor-heading-title elementor-size-default">Why is Google Optimize being sunset?</h2>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</section>
<section class="elementor-section elementor-top-section elementor-element elementor-element-adf23db elementor-section-boxed elementor-section-height-default elementor-section-height-default" data-id="adf23db" data-element_type="section">
<div class="elementor-container elementor-column-gap-default">
<div class="elementor-row">
<div class="elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-fe4e504" data-id="fe4e504" data-element_type="column">
<div class="elementor-column-wrap elementor-element-populated">
<div class="elementor-widget-wrap">
<div class="elementor-element elementor-element-ae4d3b9 elementor-widget elementor-widget-text-editor" data-id="ae4d3b9" data-element_type="widget" data-widget_type="text-editor.default">
<div class="elementor-widget-container">
<div class="elementor-text-editor elementor-clearfix">
<h3><strong>Official statement</strong></h3>
<blockquote>
<div>
<p><em>We remain committed to enabling businesses of all sizes to improve your user experiences and are investing in A/B testing in Google Analytics 4. We are focused on bringing the most effective solutions and integrations to our customers, especially as we look toward the future with Google Analytics 4.</em></p>
<p><em>Optimize, though a longstanding product, does not have many of the features and services that our customers request and need for experimentation testing. We therefore have decided to invest in solutions that will be more effective for our customers.</em></p>
</div>
</blockquote>
<p>At Reflective Data, we’re sad to see Google Optimize go. Especially because it enabled so many smaller teams to get started with experimenting with their site.</p>
<p>On the other hand, this will create a big opportunity for the other, dedicated experimentation vendors, to fill this cap in the market.</p>
<p>We’re quite sure, GA4 will improve its experimentation reporting capabilities but running the experiments themselves will likely stay outside of Google’s ecosystem.</p>
<p>Either way, if you have run experiments on Google Optimize, you should export your data ASAP. If you need help, <a href="http://reflectivedata.com/services/google-optimize-data-export">we’ve got you covered</a>.</p>
<section class="elementor-section elementor-top-section elementor-element elementor-element-aba2aff elementor-section-boxed elementor-section-height-default elementor-section-height-default" data-id="aba2aff" data-element_type="section">
<div class="elementor-container elementor-column-gap-default">
<div class="elementor-row">
<div class="elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-a9cb833" data-id="a9cb833" data-element_type="column">
<div class="elementor-column-wrap elementor-element-populated">
<div class="elementor-widget-wrap">
<div class="elementor-element elementor-element-e01131c elementor-widget elementor-widget-heading" data-id="e01131c" data-element_type="widget" data-widget_type="heading.default">
<div class="elementor-widget-container">
<h2 class="elementor-heading-title elementor-size-default">Google Optimize Alternatives</h2>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</section>
<section class="elementor-section elementor-top-section elementor-element elementor-element-f5dfbc5 elementor-section-boxed elementor-section-height-default elementor-section-height-default" data-id="f5dfbc5" data-element_type="section">
<div class="elementor-container elementor-column-gap-default">
<div class="elementor-row">
<div class="elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-7fcce66" data-id="7fcce66" data-element_type="column">
<div class="elementor-column-wrap elementor-element-populated">
<div class="elementor-widget-wrap">
<div class="elementor-element elementor-element-21a5e55 elementor-widget elementor-widget-text-editor" data-id="21a5e55" data-element_type="widget" data-widget_type="text-editor.default">
<div class="elementor-widget-container">
<div class="elementor-text-editor elementor-clearfix">
<p>While we don’t directly partner with any of the testing tools vendors, we do have extensive experience using most of them. Including Optimizely, VWO, Convert, Adobe Target, Sitespect, AB Tasty and Mutiny – to name a few.</p>
<p>Choosing your alternative to Google Optimize depends and various factors like your company’s experimentation maturity, budget and tech stack.</p>
<p>Instead of promoting any of the more traditional testing tools, we would like to encourage you to learn more about an open-source alternative <a href="https://www.growthbook.io/">GrowthBook</a>.</p>
<p>We’ve helped several companies implement GrowthBook and would be happy to discuss this option with you, too. Below are some of the reasons why you might want to consider GrowthBook as your Google Optimize alternative.</p>
<ul>
<li>Free and open-source</li>
<li>Full data ownership</li>
<li>Sits on top of your data warehouse (i.e. BigQuery)</li>
<li>Supports both client-side and server-side testing</li>
</ul>
<h2>Conclusion</h2>
<p>Google Optimize as we knew and loved it is going away on September 30, 2023.</p>
<p>If you ever used Google Optimize to run experiments on your website, you should export this data for future reference.</p>
<p>While you can attempt exporting Optimize data manually using Google Analytics Reporting API but it&#8217;s much easier done using Reflective Data&#8217;s <a href="http://reflectivedata.com/services/google-optimize-data-export">Google Optimize Data Exporter</a>.</p>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</section>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</section>
</div>
<p>The post <a href="https://www.reflectivedata.com/export-experiment-data-from-google-optimize">Export Experiment Data From Google Optimize &#8211; While You Still Can</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/export-experiment-data-from-google-optimize/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Increase Your Company&#8217;s Profits by Getting and Leveraging a Data Warehouse</title>
		<link>https://www.reflectivedata.com/increase-your-companys-profits-by-getting-and-leveraging-a-data-warehouse/</link>
					<comments>https://www.reflectivedata.com/increase-your-companys-profits-by-getting-and-leveraging-a-data-warehouse/#respond</comments>
		
		<dc:creator><![CDATA[Jason Dolan]]></dc:creator>
		<pubDate>Thu, 11 Aug 2022 11:21:15 +0000</pubDate>
				<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=14731</guid>

					<description><![CDATA[<p>Possible use cases for a data warehouse are virtually limitless and depend on what kind of business you run. In this article, I'm providing some of the more common ways together with examples of how to benefit from having a data warehouse.</p>
<p>The post <a href="https://www.reflectivedata.com/increase-your-companys-profits-by-getting-and-leveraging-a-data-warehouse/">Increase Your Company&#8217;s Profits by Getting and Leveraging a Data Warehouse</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Most companies that contact us (Reflective Data) for data services already have specific use cases for their data infrastructure (mainly data pipelines and a data warehouse). Nevertheless, we occasionally get approached by businesses that kind of know they&#8217;d need a data warehouse but aren&#8217;t entirely sure how to benefit from having one.</p>
<p>Possible use cases are virtually limitless and depend on what kind of business you run. In this article, I&#8217;m providing some of the more common ways together with examples of how to benefit from having a data warehouse.</p>
<p>In no specific order, here we go.</p>
<h3>Better attribution leads to more optimized marketing and ad spend</h3>
<p>It&#8217;s no secret that some analytics tools and most ad platforms tend to default to attribution models that work best for <em>them</em>. I mean, they use models that show as if they&#8217;re bringing you the most traffic and conversions. Some tools let you modify the model or attribution windows to some extent but generally, it&#8217;s quite limited.</p>
<blockquote><p>Let&#8217;s say someone scrolled past your ad on Facebook without clicking on it. Later that day they receive an email from you with a nice offer – they end up visiting your suite and making a purchase. Guess what Facebook will tell where this purchase came from? You guessed it, they will attribute it to seeing the ad on Facebook.</p></blockquote>
<p>Now imagine having all this attribution data available in your data warehouse in a raw format. This will give you full flexibility over which attribution model and window to use. No longer &#8220;apples to oranges&#8221; comparisons when it comes to your traffic sources.</p>
<p>Combine this with data from other sources like your CRM or backend and you&#8217;ll have the ultimate attribution machine in your hands. Use attribution models like <a href="https://towardsdatascience.com/into-to-markov-chain-multi-touch-attribution-bb1968ff1f54" target="_blank" rel="noopener">Markov Chain</a> or even AI/ML-based models and use metrics like LTV or churn for long-term analysis.</p>
<p>Proper attribution with an &#8220;apples to apples&#8221; comparison is the key to optimizing your marketing and ad spend.</p>
<p>Perhaps the best way to start collecting this kind of data in your data warehouse would be <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/">Parallel Tracking</a>.</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-14688" src="http://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline.png" alt="Reflective Data Pipeline" width="1633" height="612" srcset="https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline.png 1633w, https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline-700x262.png 700w, https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline-1024x384.png 1024w, https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline-768x288.png 768w, https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline-1536x576.png 1536w" sizes="(max-width: 1633px) 100vw, 1633px" /></a></p>
<h3>More accurate audiences lead to better targeting and savings in ad spend</h3>
<p>What if I told you there was a way to evaluate visitors or groups of visitors based on their likelihood of becoming a customer or even a high-value long-term loyal customer? All before they even land on your website or make their first purchase. Imagine building your advertising audiences based on those predictions while constantly improving the model. Well, the big players have been doing this for years but today the barriers have lowered and most businesses could leverage this kind of technology. Trust me, it&#8217;s easier than you might think.</p>
<p>There are three main components to getting started with AI/ML-based audience future value predictions.</p>
<ol>
<li>Set up a data warehouse &#8211; build yourself or hire <a href="http://reflectivedata.com/services/analytics-services/">Reflective Data</a> to consult and help with the technical setup</li>
<li>Configure data pipelines (the systems that transfer data from various sources into your data warehouse) &#8211; <a href="http://reflectivedata.com/analytics-data-pipeline/">We&#8217;ve got you covered</a> if you need help</li>
<li>Leverage a system similar to <a href="https://cloud.google.com/blog/products/data-analytics/predictive-marketing-analytics-using-bigquery-ml-machine-learning-templates" target="_blank" rel="noopener">BigQuery ML</a> for the predictive models &#8211; Yes, <a href="http://reflectivedata.com/services/analytics-services/">we can help</a> with the ML part, too</li>
</ol>
<p>Instead of spending your precious marketing budget on people that&#8217;ll never buy from you, focus on those that have the highest potential of becoming high LTV customers.</p>
<p>$$$</p>
<h3>Data-driven product recommendations lead to higher AOV</h3>
<p>Product recommendations, if done right, can provide enormous value for both the customers and merchants. Now that we have tools like <a href="https://cloud.google.com/recommendations" target="_blank" rel="noopener">Recommendations AI</a> from Google, getting started with your own recommendations engine has never been easier. Or cheaper.</p>
<p>All you need is data, preferably in a data warehouse like BigQuery. For this use case, I&#8217;d recommend using <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/">Parallel Tracking</a> to get your data into your data warehouse.</p>
<p>Case study: How IKEA managed to increase their Click Through Rates by 30% and AOV by 2% by leveraging the Recommendations AI. <a href="https://www.youtube.com/watch?v=PyjC0wRRtBg" target="_blank" rel="noopener">LINK</a></p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2022/08/ikea_retail_3.max-2200x2200-1.jpg" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-16395" src="http://reflectivedata.com/wp-content/uploads/2022/08/ikea_retail_3.max-2200x2200-1.jpg" alt="ikea data warehouse product recommendations" width="2200" height="917" srcset="https://reflectivedata.com/wp-content/uploads/2022/08/ikea_retail_3.max-2200x2200-1.jpg 2200w, https://reflectivedata.com/wp-content/uploads/2022/08/ikea_retail_3.max-2200x2200-1-700x292.jpg 700w, https://reflectivedata.com/wp-content/uploads/2022/08/ikea_retail_3.max-2200x2200-1-1024x427.jpg 1024w, https://reflectivedata.com/wp-content/uploads/2022/08/ikea_retail_3.max-2200x2200-1-768x320.jpg 768w, https://reflectivedata.com/wp-content/uploads/2022/08/ikea_retail_3.max-2200x2200-1-1536x640.jpg 1536w, https://reflectivedata.com/wp-content/uploads/2022/08/ikea_retail_3.max-2200x2200-1-2048x854.jpg 2048w" sizes="(max-width: 2200px) 100vw, 2200px" /></a></p>
<h3>Data-driven marketing automation drives more sales</h3>
<p>Still sending generic emails to all customers or manually building audiences? Welcome to the modern days where we can leverage the rich customer data available in our data warehouses. Use it to automate many critical parts of our marketing. Including emails.</p>
<p>A few scenarios to consider.</p>
<ul>
<li>Predictive model described above detects people with the highest future customer lifetime value (LTV). Nudge those people a bit by sending them an offer that&#8217;s a bit more generous than what you&#8217;d send to everyone. Believe me, it&#8217;s worth it.</li>
<li>Include personal product recommendations in your marketing emails.</li>
</ul>
<p>To get high-quality data into your data warehouse, you need a <a href="http://reflectivedata.com/analytics-data-pipeline/">data pipeline</a> that connects with your existing tools.</p>
<h3>Data-driven personalization drives more sales and higher AOV</h3>
<p>There are many ways how you can introduce personalization to your customers but one aspect is true for all of them – they all need good quality data as input.</p>
<p>Below are a few common personalization solutions our clients have implemented.</p>
<ul>
<li>Customize the front page of your e-commerce (or any other) website based on each customer&#8217;s previous browsing and shopping behavior. This can include banners, offers, discounts, product recommendations and more.</li>
<li>Customize the website experience based on the type of customer that has landed on your site. Depends on your business type and target audience but generally individuals, people representing small businesses and those shopping for a large enterprise all expect a somewhat different experience.</li>
<li>Personalize overall messaging based on visitors&#8217; behavior. Let&#8217;s say you&#8217;re an insurance company and someone landed on your blog article comparing different car insurance options, then they move on and read a few more related articles. Now, when they land on your main website, it&#8217;d be good to welcome them with relevant car insurance options as opposed to generic content.</li>
</ul>
<p>To get high-quality data into your data warehouse, you need a <a href="http://reflectivedata.com/analytics-data-pipeline/">data pipeline</a> that connects with your existing tools.</p>
<h3>Analyzing user behavior can lead to savings in customer support costs</h3>
<p>Maintaining a proper support team can be costly. Many companies that we&#8217;ve worked with could&#8217;ve (and eventually have) saved anywhere from 5% to 60% in support-related costs by having relevant data available in their data warehouse, analyzing it and, of course, making the changes based on insights found.</p>
<p>Data points to collect here include.</p>
<ul>
<li>Live chat messages</li>
<li>Call transcripts</li>
<li>Site search behavior</li>
<li>Help center behavior</li>
<li>Social media, Reddit and other platforms</li>
</ul>
<p>To collect this kind of data into your data warehouse, you need a <a href="http://reflectivedata.com/analytics-data-pipeline/">data pipeline</a> that connects with the tools necessary.</p>
<p>Potential ways to execute those insights include.</p>
<ul>
<li>A/B testing</li>
<li>Chat bot</li>
<li>Re-arranging help center</li>
<li>Train support personnel</li>
<li>Improve your product UI/UX</li>
<li>Better onboarding guides</li>
</ul>
<h3>Running an experimentation program on top of a data warehouse will save $ on tool vendor costs</h3>
<p>Most conventional experimentation (A/B testing) tools like Optimizely, Google Optimize and others operate in silos and keep their own copy of all the data they need to operate. In reality, though, this dataset is very similar to many other tools that you may already use, including Google Analytics. Having another tool to operate in a silo isn&#8217;t (cost) effective and can often lead to data discrepancies.</p>
<p>Modern experimentation tools like <a href="https://www.geteppo.com/">Eppo</a>, <a href="https://www.growthbook.io/">Growthbook</a> and others don&#8217;t create another siloed copy of your data. Instead, they will sit on top of your data warehouse and use data you already have – nicely integrated with various sources. Not only is this much more cost-effective but it also helps you avoid creating yet another source for data discrepancies. Not to mention that you don&#8217;t have to define all the goals beforehand as you can simply use SQL to define those after the fact. Or while the test is already running.</p>
<p>A simple <a href="http://reflectivedata.com/analytics-data-pipeline/">data pipeline</a> and an open-source tool like Growthbook is all you need to get started. Save tons compared to tools like Optimizely and get a more robust solution that enables you to test across all parts of your tech stack. Front end, back end, APIs, databases and more.</p>
<figure id="attachment_16226" aria-describedby="caption-attachment-16226" style="width: 2526px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2022/08/Screenshot-2022-08-03-at-23.00.42.jpg" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-16226" src="http://reflectivedata.com/wp-content/uploads/2022/08/Screenshot-2022-08-03-at-23.00.42.jpg" alt="Growthbook open-source A/B testing" width="2526" height="952" srcset="https://reflectivedata.com/wp-content/uploads/2022/08/Screenshot-2022-08-03-at-23.00.42.jpg 2526w, https://reflectivedata.com/wp-content/uploads/2022/08/Screenshot-2022-08-03-at-23.00.42-700x264.jpg 700w, https://reflectivedata.com/wp-content/uploads/2022/08/Screenshot-2022-08-03-at-23.00.42-1024x386.jpg 1024w, https://reflectivedata.com/wp-content/uploads/2022/08/Screenshot-2022-08-03-at-23.00.42-768x289.jpg 768w, https://reflectivedata.com/wp-content/uploads/2022/08/Screenshot-2022-08-03-at-23.00.42-1536x579.jpg 1536w, https://reflectivedata.com/wp-content/uploads/2022/08/Screenshot-2022-08-03-at-23.00.42-2048x772.jpg 2048w" sizes="(max-width: 2526px) 100vw, 2526px" /></a><figcaption id="caption-attachment-16226" class="wp-caption-text">Growthbook open-source A/B testing</figcaption></figure>
<h3>Using a data warehouse in favor of a conventional analytics tool will save $ on tool vendor costs</h3>
<p>There are free analytics tools out there but they all come with quite serious limitations which leads most bigger companies to look for a paid solution. What most tools really do is provide you with a tracker, data processing endpoint and a few shiny reports – oh, and ask a fortune for doing so.</p>
<p>In reality, all you really need is a tool like <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/">Parallel Tracking</a> that connects with your existing Google Analytics (both UA and GA4) trackers while duplicating hits to another processing endpoint and then sending them straight into your data warehouse. Free from data collection, cardinality or other limitations which are often seen even in the paid analytics platforms. From there, connect your BI tool and build all the reports you might ever need. All while saving like 10x compared to something like GA360.</p>
<p>As a bonus, you&#8217;ll likely have data from other sources in your data warehouse, too. This unlocks the potential to do even more advanced analysis that&#8217;s not possible with any one analytics tool alone.</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-14688" src="http://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline.png" alt="Reflective Data Pipeline" width="1633" height="612" srcset="https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline.png 1633w, https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline-700x262.png 700w, https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline-1024x384.png 1024w, https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline-768x288.png 768w, https://reflectivedata.com/wp-content/uploads/2022/06/reflective-data-pipeline-1536x576.png 1536w" sizes="(max-width: 1633px) 100vw, 1633px" /></a></p>
<h2>Conclusion</h2>
<p>It&#8217;s not a question of whether you need a data warehouse or even whether the ROI is there. It&#8217;s more about the specific tools and tech you&#8217;re going to implement from the get-go – switching a vendor later is a lot of effort and costly, too.</p>
<p>Our recommendation is to avoid CDPs and other fancy buzzword tools and instead get something more robust – BigQuery as your data warehouse, a set of <a href="http://reflectivedata.com/analytics-data-pipeline/">data pipelines</a>, a BI tool for reporting and then go from there.</p>
<p>Data engineers at Reflective Data would be more than happy to answer your questions in the comments below or hop on a quick <a href="http://reflectivedata.com/analytics-data-pipeline/#services-contact-section">free consultation session</a>.</p>
<p>The post <a href="https://www.reflectivedata.com/increase-your-companys-profits-by-getting-and-leveraging-a-data-warehouse/">Increase Your Company&#8217;s Profits by Getting and Leveraging a Data Warehouse</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/increase-your-companys-profits-by-getting-and-leveraging-a-data-warehouse/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Case Study: Building and maintaining a Data Pipeline and Data Warehouse for the Enterprise</title>
		<link>https://www.reflectivedata.com/case-study-building-and-maintaining-a-data-pipeline-and-data-warehouse-for-the-enterprise/</link>
					<comments>https://www.reflectivedata.com/case-study-building-and-maintaining-a-data-pipeline-and-data-warehouse-for-the-enterprise/#respond</comments>
		
		<dc:creator><![CDATA[Jason Dolan]]></dc:creator>
		<pubDate>Mon, 20 Sep 2021 10:35:22 +0000</pubDate>
				<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Case Study]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=11562</guid>

					<description><![CDATA[<p>At Reflective Data, we've worked with companies big and small. This means we have seen all levels of maturity when it comes to the infrastructure and knowledge around data pipelines and data warehouses.</p>
<p>Some of the most challenging projects have been enterprises with quite some infrastructure, legacy pipelines, and of course, opinions. Smaller businesses are just starting to adopt the concept of having all of their data stored in a data warehouse but many enterprises have been doing this for a decade!</p>
<p>The post <a href="https://www.reflectivedata.com/case-study-building-and-maintaining-a-data-pipeline-and-data-warehouse-for-the-enterprise/">Case Study: Building and maintaining a Data Pipeline and Data Warehouse for the Enterprise</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote><p><span style="font-size: 23px;"><em>The amount of load that Reflective Data was able to lift from the shoulders of my team was unbeliavable. Without their help, we&#8217;d still be building out the pipelines and never gotten to the level of advanced analysis and ML that we&#8217;re able to do now. They let us focus on what brings the most value to our business.</em></span></p>
<p style="text-align: right;">Melanie, Director of Data Analytics, Frankfurt</p>
</blockquote>
<p>At Reflective Data, we&#8217;ve worked with companies big and small. This means we have seen all levels of maturity when it comes to the infrastructure and knowledge around data pipelines and data warehouses.</p>
<p>Some of the most challenging projects have been enterprises with quite some infrastructure, legacy pipelines, and of course, opinions. Smaller businesses are just starting to adopt the concept of having all of their data stored in a data warehouse but many enterprises have been doing this for a decade!</p>
<h2>The challenge</h2>
<p>When many of the enterprises that we&#8217;ve worked with started building their data pipelines, they didn&#8217;t have tools like Airflow, BigQuery etc. that we use and love today. This means the bulk of it was built in-house. Even the concept of cloud computing was in its early days and most operations were kept on-premise.</p>
<p>The challenge with this kind of setup starts by understanding the existing setup. In some cases, the documentation is close to none and the people that built it are no longer with the company. This alone can take a month or so – mapping everything out, understanding the structure, creating the plan for moving forward.</p>
<p>Another challenge is getting everyone on the team on board. More often than not there are people who value the work that has been put into the old system over the years so much that it blinds them from seeing the obvious benefits of moving to a much more modern infrastructure.</p>
<h2>The solution</h2>
<p>When working with the enterprise and legacy infrastructure, nothing happens overnight. Below are the phases of a typical project of getting an enterprise client onto a modern cloud-based data infrastructure.</p>
<h3>Phase 1: understanding and mapping the existing situation</h3>
<p>With most enterprises, it&#8217;s not just one team or system that depends on the data infrastructure. More often than not, this is the backbone of the entire business. This means we need to make sure we understand every aspect of the current system, where it gets the data, how it&#8217;s being processed and what processes depend on this data.</p>
<h3>Phase 2: planning the infrastructure</h3>
<p>We do our best to work closely with all teams involved to make sure their needs are taken into account. This means a series of on-hands meetings where we learn about their use cases and problems they&#8217;re having with the existing setup. The output of this phase is a clear plan for moving forward, including the tool stack, reporting mechanisms and several feedback rounds to make sure everyones&#8217; needs are taken into account.</p>
<h3>Phase 3: implementation</h3>
<p>Depending on the in-house knowledge, resources and other aspects, a company can decide to implement the plan themselves and continue using Reflective Data as a consultant or hire us to handle the technical execution as well. By far, the most effective arrangement in our experience has been where we do the bulk of the work while including a few technical people from the client&#8217;s side in every step of the process. In some cases, those people is hired specifically for this purpose.</p>
<h3>Phase 4: monitoring, reporting and integrations</h3>
<p>The whole point of having high-quality data is to make it actionable. Of course, we handle core integrations within the implementation phase but in a sense, data infrastructure is a growing organism that needs constant attention. Reflective Data is here to build long-term relationships with its clients, ready to help whenever there&#8217;s a new data source to be added, a report to be built or if a new team member needs training.</p>
<h2>Conclusion</h2>
<p>Moving away from a legacy data infrastructure is one of the best actions an enterprise can take towards being more data-driven, more effective in managing the infrastructure and, in all reality, keeping up with the competition.</p>
<p>Ending with another quote from the customer here.</p>
<blockquote><p><em>I guess you could say we were your average enterprise with LOTS of legacy data infrastructure that had been built over many years. This system was extremely complex and very expensive to maintain. When Reflective Data came in, they acted as true professionals, worked very closely with our IT and came up with the plan that pleased everyone.</em></p>
<p><em>Today, being on the new cloud-based data infrastructure for almost a year now, I can say with all certainty that this project was a success. Not only are we saving tens of thousands of dollars every month on the infrastructure alone, the amount of hours it takes to maintain the system has gone from hundreds down to a ten or so. This has huge impact on our business. Implementing anything new would&#8217;ve taken at least 6 months with the old system, now it&#8217;s a matter of week or two to get everything up and running.</em></p>
<p style="text-align: right;">Julien, VP of Marketing, Austin</p>
</blockquote>
<p>It&#8217;s feedback like this that makes us love the work we do even more! <a href="http://reflectivedata.com/services/analytics-services/">Get in touch</a> and learn how we can help you, too.</p>
<p>For more case studies, <a href="http://reflectivedata.com/case-studies/">see here</a>.</p>
<p>The post <a href="https://www.reflectivedata.com/case-study-building-and-maintaining-a-data-pipeline-and-data-warehouse-for-the-enterprise/">Case Study: Building and maintaining a Data Pipeline and Data Warehouse for the Enterprise</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/case-study-building-and-maintaining-a-data-pipeline-and-data-warehouse-for-the-enterprise/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Measure Long-Term Metrics Like Customer Lifetime Value (LTV) Using Google Analytics</title>
		<link>https://www.reflectivedata.com/measure-long-term-metrics-like-customer-lifetime-value-ltv-using-google-analytics/</link>
					<comments>https://www.reflectivedata.com/measure-long-term-metrics-like-customer-lifetime-value-ltv-using-google-analytics/#comments</comments>
		
		<dc:creator><![CDATA[Jason Dolan]]></dc:creator>
		<pubDate>Tue, 25 May 2021 13:43:07 +0000</pubDate>
				<category><![CDATA[A/B testing]]></category>
		<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=9865</guid>

					<description><![CDATA[<p>Long-term metrics like customer lifetime value (LTV) and churn can be so much more insightful and lead to better results when optimized for when compared to the more basic metrics like transactions or revenue. Yet, these metrics are often ignored or at least not involved in the analysis and optimization processes enough. One of the reasons is that it's quite difficult to track them using common analytics and testing tools like Google Analytics and Optimize.</p>
<p>In this article, we are going to explore some of the ways we can leverage Google Analytics to track churn, LTV and other really useful metrics.</p>
<p>The post <a href="https://www.reflectivedata.com/measure-long-term-metrics-like-customer-lifetime-value-ltv-using-google-analytics/">Measure Long-Term Metrics Like Customer Lifetime Value (LTV) Using Google Analytics</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Long-term metrics like customer lifetime value (LTV) and churn can be so much more insightful and lead to better results when optimized for when compared to the more basic metrics like transactions or revenue. Yet, these metrics are often ignored or at least not involved in the analysis and optimization processes enough. One of the reasons is that it&#8217;s quite difficult to track them using common analytics and testing tools like Google Analytics and Optimize.</p>
<p>In this article, we are going to explore some of the ways we can leverage Google Analytics to track churn, LTV and other really useful metrics.</p>
<p>Depending on the software you&#8217;re using, there may be some off-the-shelf solutions that you can install. For example, if you&#8217;re on Shopify then you can use something like <a href="https://www.littledata.io/">Littledata</a> to send a more accurate LTV value into a custom dimension in Google Analytics. More often than not, though, there is no good solution available or you just need more control over the setup.</p>
<p>One common misconception is that such long-term retention metrics are relevant for a few specific business types only. Yes, metrics like churn are vital for SaaS and subscription products but any company that gets return business should have their long-term KPIs in place. And I don&#8217;t mean simply tracking them but actually analyzing them and optimizing the business with those metrics in mind.</p>
<blockquote><p><em>Acquiring a new customer is anywhere from five to 25 times more expensive than retaining an existing one. It makes sense: you don’t have to spend time and resources going out and finding a new client — you just have to keep the one you have happy.</em></p>
<p style="text-align: right;"><a href="https://hbr.org/2014/10/the-value-of-keeping-the-right-customers#:~:text=Depending%20on%20which%20study%20you,the%20one%20you%20have%20happy.">Harvard Business Review</a></p>
</blockquote>
<p>So, if you&#8217;ve been focusing on getting new customers and metrics like revenue or transactions, this article is just for you!</p>
<h2>How to measure retention metrics like LTV and churn</h2>
<p>The long-term retention metrics most relevant to you depend on the type of business you&#8217;re working with but the most common ones are customer lifetime value (LTV) and churn. Below is a list of other popular retention KPIs. Think about the ones that would be relevant for your business.</p>
<p>Common Customer Retention Metrics</p>
<ol>
<li>Customer Churn</li>
<li>Revenue Churn</li>
<li>Existing Customer Growth Rate</li>
<li>Repeat Purchase Ratio</li>
<li>Product Return Rate</li>
<li>Days Sales Outstanding</li>
<li>Net Promoter Score</li>
<li>Time Between Purchases</li>
<li>Loyal Customer Rate</li>
<li>Customer Lifetime Value</li>
</ol>
<p style="text-align: right;"><a href="https://blog.hubspot.com/service/customer-retention-metrics">Source</a></p>
<p>Almost all retention metrics require a proper <a href="http://reflectivedata.com/everything-need-know-google-analytics-user-id/">User ID implementation</a>. This means you&#8217;d have to identify the user over time and even if they&#8217;re using multiple devices or browsers. Luckily, in most cases, actions like completing a purchase or signing up for a subscription do involve some kind of authentication.</p>
<p>While it&#8217;s possible to track retention metrics with Google Analytics alone, in most cases you&#8217;d get much better (more accurate) results when combining it with some other technology. Let&#8217;s explore two of the more popular options.</p>
<h3>Sending retention data into Google Analytics</h3>
<p>This solution involves sending retention data into a <a href="http://reflectivedata.com/ideas-for-google-analytics-custom-dimensions-and-metrics/">custom dimension or a custom metric</a> in Google Analytics.</p>
<p>The exact workflow depends on the software (CRM, CMS, database etc.) your site uses but the general process would look something like this.</p>
<ol>
<li>Create a custom dimension in Google Analytics (should be user-scoped)<br />
<a  href="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.18-21_42_39.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-9893" src="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.18-21_42_39.png" alt="Retention related custom metrics in Google Analytics" width="1168" height="546" srcset="https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.18-21_42_39.png 1168w, https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.18-21_42_39-700x327.png 700w, https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.18-21_42_39-1024x479.png 1024w, https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.18-21_42_39-768x359.png 768w" sizes="(max-width: 1168px) 100vw, 1168px" /></a></li>
<li>For logged-in/identified users, pull/calculate the values for the relevant retention metrics from a database or other system (CRM, CMS etc.)<br />
Something like this if your order data is stored in BigQuery.<br />
<a  href="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-nimbusweb.me-2021.05.18-22_00_07.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-9894" src="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-nimbusweb.me-2021.05.18-22_00_07.png" alt="Query retention metrics from BigQuery" width="687" height="392" /></a></li>
<li>Make the retention metrics available in the <a href="https://developers.google.com/tag-manager/devguide">data layer</a><br />
<a  href="http://reflectivedata.com/wp-content/uploads/2021/05/Screenshot-2021-05-18-at-22.06.56.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-9895" src="http://reflectivedata.com/wp-content/uploads/2021/05/Screenshot-2021-05-18-at-22.06.56.png" alt="Retention metrics in the dashboard" width="310" height="132" /></a></li>
<li>Use Google Tag Manager to send your retention metrics to Google Analytics, using the custom dimension or metrics slots/indices according to how you configured them in step #1<a  href="http://reflectivedata.com/wp-content/uploads/2021/05/Screenshot-2021-05-18-at-22.10.04.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-9896" src="http://reflectivedata.com/wp-content/uploads/2021/05/Screenshot-2021-05-18-at-22.10.04.png" alt="Sending retention metrics to Google Analytics" width="796" height="217" srcset="https://reflectivedata.com/wp-content/uploads/2021/05/Screenshot-2021-05-18-at-22.10.04.png 796w, https://reflectivedata.com/wp-content/uploads/2021/05/Screenshot-2021-05-18-at-22.10.04-700x191.png 700w, https://reflectivedata.com/wp-content/uploads/2021/05/Screenshot-2021-05-18-at-22.10.04-768x209.png 768w" sizes="(max-width: 796px) 100vw, 796px" /></a></li>
</ol>
<p>Now, having this data available in Google Analytics, you do whatever you want with it. Here are a few examples.</p>
<p><strong>Using LTV in a Google Analytics custom report</strong></p>
<p>&nbsp;</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_06_39.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-9931" src="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_06_39.png" alt="Using LTV in a Google Analytics custom report" width="977" height="520" srcset="https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_06_39.png 977w, https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_06_39-700x373.png 700w, https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_06_39-768x409.png 768w" sizes="(max-width: 977px) 100vw, 977px" /></a></p>
<p>&nbsp;</p>
<p><strong>LTV in the Google Analytics user explorer report</strong></p>
<p>&nbsp;</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_13_08.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-9932" src="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_13_08.png" alt="LTV in the Google Analytics user explorer report" width="987" height="843" srcset="https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_13_08.png 987w, https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_13_08-700x598.png 700w, https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-analytics.google.com-2021.05.21-16_13_08-768x656.png 768w" sizes="(max-width: 987px) 100vw, 987px" /></a></p>
<p>Notice the difference between LTV that Google Analytics is reporting by default ($439) and the value we see in our custom dimension ($2,016). This is because Google Analytics can&#8217;t keep a track of the user as accurately as your backend system or an e-commerce platform you&#8217;re using. The same goes with other retention metrics, getting accurate metrics requires some custom work.</p>
<p>The list of possible use cases for this kind of data is unlimited. For example, think about creating custom segments in Google Analytics for customers that are in the top 10% in terms of LTV and see what differentiates them from the rest of the visitors. Besides making more/larger purchases, of course. Things like their traffic source, what pages they landed on, what A/B test variants they saw etc. can be quite insightful.</p>
<p>Talking about ways you can analyze your data and the insights it will give you. Let&#8217;s take measuring retention metrics to a whole new level by sending data into a data warehouse.</p>
<h3>Storing data in a data warehouse</h3>
<p>If you&#8217;re just getting started with retention metrics and you still mostly optimize for generic metrics like leads, total transactions and total revenue, then you&#8217;ll still be better off with having them in Google Analytics. Compared to not having them at all, that is. If you&#8217;re serious about analyzing and optimizing for retention and customer lifetime value then you need a data warehouse.</p>
<p>Here&#8217;s a quick step-by-step guide that will lead you in the right direction.</p>
<ol>
<li>Send all Google Analytics data into a data warehouse (i.e. <a href="https://cloud.google.com/bigquery">BigQuery</a>). Tools using the <a href="https://developers.google.com/analytics/devguides/reporting/core/v4">Reporting API</a> (most of them) can get you started but for true unsampled hit-level data you need something like <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/">Parallel Tracking</a>.</li>
<li>Send, pull, push data from other relevant sources into your data warehouse. This should include your database, CRM, CMS, marketing tools, ads platforms, customer support, live chat and every other tool that has data about your customers and their interactions with your brand. Self-service tools like <a href="https://www.stitchdata.com/">Stitch</a> will get you started but we&#8217;d recommend <a href="http://reflectivedata.com/analytics-data-pipeline/">more flexible managed solutions</a>.</li>
<li>Solution for accessing data stored in your data warehouse. You&#8217;d need something (could be separate tools) that can handle ad-hoc queries, dashboarding, automated reports, and building data models. Tools like <a href="https://datastudio.google.com/">Google Data Studio</a> will get you started. Something like <a href="https://looker.com/">Looker</a> or <a href="https://www.tableau.com/">Tableau</a> would be better. Our recommendation is to go with a <a href="http://reflectivedata.com/services/analytics-services/">managed service</a> that will put together the best set of tools for you and configure the rest as well.</li>
</ol>
<p>If having retention metrics in Google Analytics enabled you to do all sorts of useful new reports and analysis then your options with the setup above are truly limitless.</p>
<p>Having a proper data warehouse will be your competitive advantage. Not only does it give you an option to get a very good overview of the current status of your business and your customers, but it will also allow for truly optimizing the user experience and user journey. Leading to a better user experience and improved retention metrics. Remember, acquiring a new customer is anywhere from five to 25 times more expensive than retaining an existing one!</p>
<p>One way to describe the usefulness of a data warehouse is by giving you a few sample questions. Questions that would be very difficult to answer without a data warehouse.</p>
<ul>
<li>Purchases from which traffic channels are most likely to be refunded at some point in the future? Might lead to revising your marketing budget.</li>
<li>Which traffic sources have the highest retention/LTV?</li>
<li>What is the correlation between subscription value ($) and churn rate?</li>
<li>What is the long-term impact of your campaigns or A/B experiments? Do quick wins lead to higher churn or lower LTV?</li>
<li>Does data from different sources add up? Maybe Google Analytics is missing some transactions that are in Shopify or perhaps some of them are duplicates?</li>
</ul>
<p>Here&#8217;s an example of the last one on the list above.</p>
<figure id="attachment_10018" aria-describedby="caption-attachment-10018" style="width: 624px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-console.cloud_.google.com-2021.05.25-15_10_28.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="wp-image-10018 size-full" src="http://reflectivedata.com/wp-content/uploads/2021/05/screenshot-console.cloud_.google.com-2021.05.25-15_10_28.png" alt="Google Analytics vs Shopify order count" width="624" height="896" srcset="https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-console.cloud_.google.com-2021.05.25-15_10_28.png 624w, https://reflectivedata.com/wp-content/uploads/2021/05/screenshot-console.cloud_.google.com-2021.05.25-15_10_28-488x700.png 488w" sizes="(max-width: 624px) 100vw, 624px" /></a><figcaption id="caption-attachment-10018" class="wp-caption-text">Google Analytics vs Shopify order count</figcaption></figure>
<p>As you can see, Google Analytics is missing a good amount of transactions and this requires further investigation. Definitely something you should include in your Google Analytics dashbaord.</p>
<p>This was just a short list of ideas to get you thinking about what is possible with a proper data warehouse. You can trust me when I say that I&#8217;ve seen companies become truly data-driven after they&#8217;ve implemented a tailor-made data warehouse and started digging for insights they couldn&#8217;t before.</p>
<h3>Working with automatically recurring events</h3>
<p>It is important to keep in mind that some retention metrics can change without the user themselves taking any action. You need to make sure that those cases are being tracked and taken into account. Here are a few examples.</p>
<ul>
<li>Recurring orders/payments</li>
<li>Subscription expirations</li>
<li>Payment method expirations</li>
<li>Orders being changed/cancelled (i.e. due to missing item)</li>
</ul>
<p>If your data warehouse was configured properly, you should have this data already available. Just make sure to include it in your analysis and reports.</p>
<p>In case you don&#8217;t have a data warehouse and you&#8217;re trying to solve this with Google Analytics alone then you need to use the <a href="https://developers.google.com/analytics/devguides/collection/protocol/v1">Measurement Protocol</a>. Some of the more common subscription platforms like ReCharge for Shopify have this built-in or solvable with some 3-rd party solutions but quite often custom development is required. In which case, you should think about implementing a data warehouse instead.</p>
<h2>Conclusion</h2>
<p>If you&#8217;re in a business where customers are expected to generate value more than once (repeat purchase, subscription etc.) then you need to start focusing on your retention metrics.</p>
<p>Google Analytics can get you started with the basic metrics and limited accuracy. A much better setup would be Google Analytics combined with <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/">Parallel Tracking</a> and if you&#8217;re serious about optimizing for those metrics then you need a <a href="http://reflectivedata.com/analytics-data-pipeline/">custom-built data warehouse</a> where all marketing data is pulled together.</p>
<p>Feel free to post your ideas and questions in the comments below. If you&#8217;d like to get some consultation and discuss your ideas further, <a href="http://reflectivedata.com/services/analytics-services/">get in touch with us</a>.</p>
<p>The post <a href="https://www.reflectivedata.com/measure-long-term-metrics-like-customer-lifetime-value-ltv-using-google-analytics/">Measure Long-Term Metrics Like Customer Lifetime Value (LTV) Using Google Analytics</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/measure-long-term-metrics-like-customer-lifetime-value-ltv-using-google-analytics/feed/</wfw:commentRss>
			<slash:comments>3</slash:comments>
		
		
			</item>
		<item>
		<title>Case Study: Storing Google Analytics Data Within The European Union or Locally</title>
		<link>https://www.reflectivedata.com/case-study-storing-google-analytics-data-within-european-union-or-locally/</link>
					<comments>https://www.reflectivedata.com/case-study-storing-google-analytics-data-within-european-union-or-locally/#respond</comments>
		
		<dc:creator><![CDATA[Jason Dolan]]></dc:creator>
		<pubDate>Fri, 12 Mar 2021 12:06:18 +0000</pubDate>
				<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Case Study]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=8887</guid>

					<description><![CDATA[<p>Data protection and privacy rules are getting tougher all over the world. This is especially true for the European Union and even more so for some specific industries. Including finance, medical and others that handle sensitive information about their users.</p>
<p>While Google Analytics has been making some improvements in the privacy area and is GDPR compliant, this is not enough for many businesses and industries.</p>
<p>At Reflective Data, we're often working with companies that are under close monitoring of their regulators. To help them out, we've built custom solutions that allow storing Google Analytics within the European Union or sometimes even completely locally.</p>
<p>The post <a href="https://www.reflectivedata.com/case-study-storing-google-analytics-data-within-european-union-or-locally/">Case Study: Storing Google Analytics Data Within The European Union or Locally</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote><p><span style="font-size: 23px;"><em>With the help from Reflective Data and using their custom data pipeline for Google Analytics, we were able to avoid sending data to Google Analytics, store everything within the EU and still benefit from the rich and actionable dataset you get with Google Analytics.</em></span></p>
<p style="text-align: right;">Aaron, Chief Data Officer</p>
</blockquote>
<p>&nbsp;</p>
<blockquote><p><img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f4e2.png" alt="📢" class="wp-smiley" style="height: 1em; max-height: 1em;" /> Everything mentioned in this article applies to both Google Analytics Universal Analytics and GA4, including the built solutions.</p></blockquote>
<p>Data protection and privacy rules are getting tougher all over the world. This is especially true for the European Union and even more so for some specific industries. Including finance, medical and others that handle sensitive information about their users.</p>
<p>While Google Analytics has been making some <a href="https://support.google.com/analytics/answer/9019185?hl=en">improvements in the privacy area</a> and is GDPR compliant, this is not enough for many businesses and industries.</p>
<p>At Reflective Data, we&#8217;re often working with companies that are under close monitoring of their regulators. To help them out, we&#8217;ve built custom solutions that allow storing Google Analytics data within the European Union or sometimes even completely locally, on the servers of our client.</p>
<h2>The challenge</h2>
<p>Do we want it or not, the regulations for collecting, processing and storing data are getting stiffer each year. European Union, with rules like <a href="https://gdpr-info.eu/">GDPR</a>, is a trendsetter in data protection.</p>
<p>While it&#8217;s good from a customers&#8217; perspective, following all those rules and regulations can be quite a challenge for businesses that have to follow them. Especially, as new rules are introduced and the penalties for not following them <a href="https://www.itgovernance.co.uk/dpa-and-gdpr-penalties">can reach millions</a>.</p>
<p>More often than not, the regulations are being launched faster than companies can adopt. Even if the company themselves are doing everything right, it&#8217;s likely that some third-party software they&#8217;re using is lacking behind.</p>
<p>At Reflective Data, we have been contacted by numerous companies that, for a reason or another, aren&#8217;t able to use Google Analytics but would still require a similar dataset and comparable functionality. Let&#8217;s be honest, Google Analytics and its Universal Analytics data model is still one of the best ones available for marketing attribution and website user behavior analysis.</p>
<p><em>Yes, <a href="https://support.google.com/analytics/answer/10089681">Google Analytics 4</a> will be the future of tracking but it&#8217;s nowhere near being ready for production for most businesses due to lacking several critical features.</em></p>
<h2>The solution</h2>
<p>One of our most popular tools is the <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/">Parallel Tracking for Google Analytics</a>. It&#8217;s a system that allows sending raw hit-level Google Analytics data into a data warehouse. Most commonly, our clients choose this solution for the following reasons.</p>
<ul>
<li>They hit the 10M/hits/mo limit</li>
<li>Problems with data sampling</li>
<li>Looking for alternatives for Google Analytics 360</li>
<li>Need access to raw hit-level data</li>
</ul>
<p>These are all nice features that our clients love but there&#8217;s one more – Parallel Tracking can work without sending data to Google Analytics at all. This is thanks to how our technology was built. Instead of pulling data from Google Analytics using the <a href="https://developers.google.com/analytics/devguides/reporting/core/v4">Reporting API</a> (as many solutions do), we are duplicating all hits that are sent to Google&#8217;s data processing endpoint and sending them to our engine as well. This means, with a simple tracking code modification, we can disable hits being sent to Google Analytics.</p>
<p>Our data processing engine works very similarly to what&#8217;s happening in Google Analytics. Hits are processed and enriched using the same methodology and sessions are processed just like in Google Analytics.</p>
<p>Ultimately, this means you&#8217;ll get the best from both worlds. The rich and actionable dataset of Google Analytics and full ownership of your data. So, whatever regulations apply to how you can store this kind of data, now you have a solution to comply with them.</p>
<blockquote><p><img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f4e2.png" alt="📢" class="wp-smiley" style="height: 1em; max-height: 1em;" /> Everything mentioned in this article applies to both Google Analytics Universal Analytics and GA4, including the built solutions.</p></blockquote>
<h2>Conclusion</h2>
<p>There are many reasons why you&#8217;d want to use Google Analytics as your analytics platform. Including the rich data model that works for both marketing attribution and user behavior analysis, (almost) unlimited amount of resources, great community, ease of finding people that can work with this model and more.</p>
<p>If, for some reason, you can&#8217;t use Google Analytics. Could be because they&#8217;re storing data outside of the EU or that you need to store everything locally. Then we have a solution for you!</p>
<p>For details and contact information, check out the <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/">product page for Parallel Tracking</a>.</p>
<p>Ending with another quote from a customer here.</p>
<blockquote><p><em>Before joining the financial institution that I&#8217;m currently working for, I had been a marketing director for various e-commerce companies. This meant that I was really used to tools like Google Analytics. Now, working in finance, I learned that using Google Analytics was not an option due to several regulations and we had to store all data locally on our own servers.</em></p>
<p><em>We had some kind of an in-house solution but that was nowhere near the capabilites and functionality I was used to. I was really lucky to learn about Reflective Data and their Parallel Tracking solution! They were really helpful, worked closely with our team to learn about our needs and finally built a solution that exceeded our expectations. 10/10 would recommend to anyone that for some reason isn&#8217;t allowed to use Google Analytics.</em></p>
<p style="text-align: right;">Michelle, Director of Online Marketing, Frankfurt</p>
</blockquote>
<p>It&#8217;s feedback like this that makes us love the work we do even more! <a href="http://reflectivedata.com/services/analytics-services/">Get in touch</a> and learn how we can help you, too.</p>
<p>For more case studies, <a href="http://reflectivedata.com/case-studies/">see here</a>.</p>
<p>The post <a href="https://www.reflectivedata.com/case-study-storing-google-analytics-data-within-european-union-or-locally/">Case Study: Storing Google Analytics Data Within The European Union or Locally</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/case-study-storing-google-analytics-data-within-european-union-or-locally/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Case Study: Sending Data From Shopify and Google Analytics to BigQuery for More Advanced Analysis</title>
		<link>https://www.reflectivedata.com/case-study-sending-data-from-shopify-and-google-analytics-to-bigquery-for-more-advanced-analysis/</link>
					<comments>https://www.reflectivedata.com/case-study-sending-data-from-shopify-and-google-analytics-to-bigquery-for-more-advanced-analysis/#comments</comments>
		
		<dc:creator><![CDATA[Jason Dolan]]></dc:creator>
		<pubDate>Mon, 25 Jan 2021 15:11:20 +0000</pubDate>
				<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Case Study]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=8090</guid>

					<description><![CDATA[<p>Using Google Analytics Parallel Tracking and a custom data pipeline for Shopify, we managed to get all necessary data in BigQuery for more advanced analysis and reporting.</p>
<p>The post <a href="https://www.reflectivedata.com/case-study-sending-data-from-shopify-and-google-analytics-to-bigquery-for-more-advanced-analysis/">Case Study: Sending Data From Shopify and Google Analytics to BigQuery for More Advanced Analysis</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p><span style="font-size: 21pt;">Using Google Analytics Parallel Tracking and a custom data pipeline for Shopify, we managed to get all necessary data in BigQuery for more advanced analysis and reporting.</span></p>
<blockquote><p><strong>About the client</strong></p>
<p>Our client is a successful e-commerce business, headquartered in the UK.</p>
<p>Their focus is on food supplements, nutritional products, and diet plans.</p>
<p>Depending on a season, their monthly online revenue is 1-1.5 million GBP (1.3 &#8211; 2 million USD). This comes from around 14k orders every month.</p></blockquote>
<p><em>Due to NDAs, we are not allowed to disclose the name of the client. Everything else in this case study remains unchanged.</em></p>
<h2>The challenge</h2>
<p>When the company in the focus of this case study first contacted us, their main concerns were about properly tracking customer lifetime value (LTV) and churn. Being in the business of food supplements where, naturally, a big chunk of revenue is coming through subscriptions (automatically recurring orders), keeping track of these metrics is extremely important.</p>
<h3>Tech stack</h3>
<p>The main tech tool stack for this client was a fairly standard one for an e-commerce business of this size. <a href="https://www.shopify.com/plus">Shopify Plus</a> for website and order processing, <a href="https://rechargepayments.com/">ReCharge</a> for handling subscriptions, <a href="https://analytics.google.com/analytics/web/">Google Analytics</a> for general reporting, and <a href="https://www.shipstation.com/">ShipStation</a> for managing shipments.</p>
<p>***</p>
<p>While Shopify can provide you with reports for both LTV and churn (via ReCharge analytics), it is rather limited when it comes to attributing those numbers to your marketing and other efforts like <a href="https://www.optimizely.com/optimization-glossary/conversion-rate-optimization/">CRO</a> or A/B testing.</p>
<p>The client pointed out three main areas where they need a more detailed look into LTV and churn-related metrics:</p>
<ul>
<li>Ad Spend (mostly Google Ads and Facebook)</li>
<li>E-mail campaigns</li>
<li>CRO and A/B testing</li>
</ul>
<h2>The solution</h2>
<p>After having several meetings with various teams from the client&#8217;s side, we had a quite good understanding of what kind of reports they&#8217;re trying to build and what kind of data it requires.</p>
<p>In order to accommodate all of their needs, we had to involve data from the following sources:</p>
<ul>
<li>Shopify</li>
<li>ReCharge</li>
<li>Google Analytics</li>
<li>Google Ads</li>
<li>Facebook Ads</li>
<li>ShipStation</li>
<li>Google Optimize</li>
</ul>
<p>As our solution involved data from more than three sources, it was quite obvious that setting up a data warehouse is required. The client decided to go with our default recommendation for data warehouse software — <a href="https://cloud.google.com/bigquery/">BigQuery</a>.</p>
<h3>Getting data in BigQuery</h3>
<p>As LTV and churn had always been such essential metrics for the client, they had already tried solving this in-house. Their existing tool stack consisted of <a href="https://www.stitchdata.com/">Stitch</a>, <a href="https://supermetrics.com/">Supermetrics</a>, <a href="https://www.mysql.com/">MySQL</a> database and <a href="https://www.google.com/sheets/about/">Google Sheets</a>. While this could&#8217;ve worked in theory, the reality was a really slow system that constantly failed and produced inaccurate numbers.</p>
<p>Our solution involved using <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/">Parallel Tracking</a> technology for Google Analytics and a <a href="http://reflectivedata.com/analytics-data-pipeline/">custom data pipeline</a> for the rest of the sources. This enabled us to modify the ETL for maximum efficiency while meeting all the data requirements.</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2020/03/google-analytics-parallel-tracking_1.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-3746" src="http://reflectivedata.com/wp-content/uploads/2020/03/google-analytics-parallel-tracking_1.png" alt="Marketing Data Warehouse" width="900" height="259" srcset="https://reflectivedata.com/wp-content/uploads/2020/03/google-analytics-parallel-tracking_1.png 900w, https://reflectivedata.com/wp-content/uploads/2020/03/google-analytics-parallel-tracking_1-700x201.png 700w, https://reflectivedata.com/wp-content/uploads/2020/03/google-analytics-parallel-tracking_1-768x221.png 768w" sizes="(max-width: 900px) 100vw, 900px" /></a></p>
<h3>Building the reports</h3>
<p>Since our client was already familiar with the rest of Google&#8217;s Marketing Suite and it works really well with BigQuery, using <a href="https://datastudio.google.com/u/0/navigation/reporting">Data Studio</a> for building the reports was an easy choice. Besides being a free (!) tool, by leveraging the <a href="https://cloud.google.com/bigquery/docs/visualize-jupyter">BigQuery BI Engine</a>, Data Studio can provide you with the best performance as well!</p>
<p>After understanding the reports that would be used most frequently, we built a set of 2 main dashboards:</p>
<ul>
<li><strong>LTV</strong> per source/medium/country/product/campaign/age</li>
<li><strong>Churn</strong> per source/medium/country/product/campaign/age</li>
</ul>
<figure id="attachment_8099" aria-describedby="caption-attachment-8099" style="width: 1286px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2021/01/screenshot-datastudio.google.com-2021.01.18-17_55_44.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-8099" src="http://reflectivedata.com/wp-content/uploads/2021/01/screenshot-datastudio.google.com-2021.01.18-17_55_44.png" alt="Example of an LTV report" width="1286" height="436" srcset="https://reflectivedata.com/wp-content/uploads/2021/01/screenshot-datastudio.google.com-2021.01.18-17_55_44.png 1286w, https://reflectivedata.com/wp-content/uploads/2021/01/screenshot-datastudio.google.com-2021.01.18-17_55_44-700x237.png 700w, https://reflectivedata.com/wp-content/uploads/2021/01/screenshot-datastudio.google.com-2021.01.18-17_55_44-1024x347.png 1024w, https://reflectivedata.com/wp-content/uploads/2021/01/screenshot-datastudio.google.com-2021.01.18-17_55_44-768x260.png 768w" sizes="(max-width: 1286px) 100vw, 1286px" /></a><figcaption id="caption-attachment-8099" class="wp-caption-text">Example of an LTV report</figcaption></figure>
<p>These dashboards provide the client with a good overview of their key metrics and how they correlate with some of the more common dimensions/attributes. This is a good stepping stone for a deeper analysis and digging for insights.</p>
<p>Luckily, the client&#8217;s marketing team already had a few people that were comfortable working with SQL, Jupyter Notebooks etc. This meant that by pointing them in the right direction and providing a few guidelines, they had everything needed for digging deeper into data stored in BigQuery.</p>
<figure id="attachment_8098" aria-describedby="caption-attachment-8098" style="width: 1284px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2021/01/Screenshot-2021-01-18-at-17.53.01.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-8098" src="http://reflectivedata.com/wp-content/uploads/2021/01/Screenshot-2021-01-18-at-17.53.01.png" alt="Example of a churn report" width="1284" height="563" srcset="https://reflectivedata.com/wp-content/uploads/2021/01/Screenshot-2021-01-18-at-17.53.01.png 1284w, https://reflectivedata.com/wp-content/uploads/2021/01/Screenshot-2021-01-18-at-17.53.01-700x307.png 700w, https://reflectivedata.com/wp-content/uploads/2021/01/Screenshot-2021-01-18-at-17.53.01-1024x449.png 1024w, https://reflectivedata.com/wp-content/uploads/2021/01/Screenshot-2021-01-18-at-17.53.01-768x337.png 768w" sizes="(max-width: 1284px) 100vw, 1284px" /></a><figcaption id="caption-attachment-8098" class="wp-caption-text">Example of a churn report</figcaption></figure>
<p>Besides dashboards and enabling ad-hoc analysis, a few teams also wished for a periodic automated report in their inbox. This was also something we were happy to set up using some custom software on <a href="https://cloud.google.com/">Google Cloud</a>.</p>
<h2>Conclusion</h2>
<p>While this project started as a rather standard one for us, it turned out to include several custom parts that made it fun for us and enabled maximum effectiveness for the client.</p>
<p>What better way to take it all together than a few words from the team we worked with on the client&#8217;s side.</p>
<blockquote><p>We turned to Reflective Data with a rather concrete goal in mind — to fix our LTV and churn tracking and reporting. Being in the business of  food supplements, a large portion of our revenues come from recurring orders. This makes keeping track of those metrics extremely important for us. Not only was Reflective Data able to set up a proper tracking system, they helped us figure out the dashboards, email reports and helped our analysts on SQL analysis! Couldn&#8217;t be happier about this!</p></blockquote>
<p>Nothing&#8217;s better than such warm words from a happy client!</p>
<p>If you, too, need help with your analytics &amp; data setup, <a href="http://reflectivedata.com/services/analytics-services/">get in touch with us</a> for a free consultation session. I&#8217;m sure we can find a solution that suits both of us.</p>
<p>The post <a href="https://www.reflectivedata.com/case-study-sending-data-from-shopify-and-google-analytics-to-bigquery-for-more-advanced-analysis/">Case Study: Sending Data From Shopify and Google Analytics to BigQuery for More Advanced Analysis</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/case-study-sending-data-from-shopify-and-google-analytics-to-bigquery-for-more-advanced-analysis/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
		<item>
		<title>[Medium] Hit-Level Unsampled Google Analytics to BigQuery Without 360</title>
		<link>https://www.reflectivedata.com/medium-hit-level-unsampled-google-analytics-data-to-bigquery-without-360/</link>
					<comments>https://www.reflectivedata.com/medium-hit-level-unsampled-google-analytics-data-to-bigquery-without-360/#respond</comments>
		
		<dc:creator><![CDATA[Silver Ringvee]]></dc:creator>
		<pubDate>Wed, 30 Dec 2020 11:11:11 +0000</pubDate>
				<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Medium]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=7809</guid>

					<description><![CDATA[<p>Google Analytics and BigQuery, two tools that both the major players in their respective segments. Yet, there is no way to easily send raw hit-level data from one to another.</p>
<p>In this article, originally posted on medium.com, we're going to walk through the reasons you might want to access raw Google Analytics data in BigQuery and a few solutions that will get you there.</p>
<p>The post <a href="https://www.reflectivedata.com/medium-hit-level-unsampled-google-analytics-data-to-bigquery-without-360/">[Medium] Hit-Level Unsampled Google Analytics to BigQuery Without 360</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Google Analytics and BigQuery, two tools that are both major players in their respective segments. Yet, there is no way to easily send raw hit-level data from one to another.</p>
<p>In this article, originally posted on <a href="https://medium.com/reflective-data/hit-level-unsampled-google-analytics-to-bigquery-without-360-a6a1477a5d3">medium.com</a>, we&#8217;re going to walk through the reasons you might want to access raw Google Analytics data in BigQuery and a few solutions that will get you there.</p>
<p>***</p>
<p id="baed" class="mq mr le vn b mt mu mv mw mx my mz na nb nc nd ne nf ng nh ni nj nk nl nm nn hw ci" data-selectable-paragraph="">Google Analytics is, without a doubt, the most popular tool in the digital analytics segment. Besides being the best tool for tracking e-commerce, content and lead-gen websites, with some modification you can use Google Analytics for <a class="co no" href="http://reflectivedata.com/using-google-analytics-for-tracking-saas/" rel="noopener nofollow">tracking SaaS</a> and other web apps as well.</p>
<p id="c028" class="mq mr le vn b mt mu mv mw mx my mz na nb nc nd ne nf ng nh ni nj nk nl nm nn hw ci" data-selectable-paragraph="">While being a really powerful and flexible tool, Google Analytics, too, has a few shortcomings. Perhaps the biggest problem every advanced user faces is the fact that there is no way to access raw hit-level data. The second major issue with Google Analytics is related to sampling and data collection limits.</p>
<p id="3c30" class="mq mr le vn b mt mu mv mw mx my mz na nb nc nd ne nf ng nh ni nj nk nl nm nn hw ci" data-selectable-paragraph="">Now, let’s take a look at some of the ways you can overcome these shortcomings without spending a ton on Google Analytics 360 — a premium version of Google Analytics that, to some extent, also eliminates these issues.</p>
<p id="d41b" class="mq mr le vn b mt mu mv mw mx my mz na nb nc nd ne nf ng nh ni nj nk nl nm nn hw ci" data-selectable-paragraph=""><em class="np">PS! If you already have GA 360, these techniques will give you an even more robust and flexible tracking solution.</em></p>
<h2 id="1778" class="nq nr le aw ep vo nt vp vq vr nv vs vt vu nx vv vw vx nz vy vz wa ob wb wc oc ci" data-selectable-paragraph="">Table of Contents</h2>
<p id="30c4" class="mq mr le vn b mt od mv mw mx oe mz na nb of nd ne nf og nh ni nj oh nl nm nn hw ci" data-selectable-paragraph="">– <a class="co no" href="https://medium.com/reflective-data/hit-level-unsampled-google-analytics-to-bigquery-without-360-a6a1477a5d3#84b5" rel="noopener">Sampling in Google Analytics</a><br />
– <a class="co no" href="https://medium.com/reflective-data/hit-level-unsampled-google-analytics-to-bigquery-without-360-a6a1477a5d3#166b" rel="noopener">Ways to avoid sampling in Google Analytics</a><br />
– <a class="co no" href="https://medium.com/reflective-data/hit-level-unsampled-google-analytics-to-bigquery-without-360-a6a1477a5d3#98f4" rel="noopener">Why do you even need the raw data</a><br />
– <a class="co no" href="https://medium.com/reflective-data/hit-level-unsampled-google-analytics-to-bigquery-without-360-a6a1477a5d3#633a" rel="noopener">How to get access to unsampled hit-level raw data</a><br />
– <a class="co no" href="https://medium.com/reflective-data/hit-level-unsampled-google-analytics-to-bigquery-without-360-a6a1477a5d3#11ea" rel="noopener">What about the price</a></p>
<p data-selectable-paragraph="">***</p>
<p><a href="https://medium.com/reflective-data/hit-level-unsampled-google-analytics-to-bigquery-without-360-a6a1477a5d3" target="_blank" rel="noopener">Read the full post on medium.com</a></p>
<p>The post <a href="https://www.reflectivedata.com/medium-hit-level-unsampled-google-analytics-data-to-bigquery-without-360/">[Medium] Hit-Level Unsampled Google Analytics to BigQuery Without 360</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/medium-hit-level-unsampled-google-analytics-data-to-bigquery-without-360/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>How to Query Google Analytics Data Using SQL</title>
		<link>https://www.reflectivedata.com/how-to-query-google-analytics-data-using-sql/</link>
					<comments>https://www.reflectivedata.com/how-to-query-google-analytics-data-using-sql/#comments</comments>
		
		<dc:creator><![CDATA[Jason Dolan]]></dc:creator>
		<pubDate>Fri, 30 Oct 2020 14:48:47 +0000</pubDate>
				<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Technical]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=6985</guid>

					<description><![CDATA[<p>SQL is the most popular language for professionals to communicate with databases and query data. Google Analytics is the most popular tool for digital analytics. How come there's no way to query Google Analytics data using SQL? In this article, we'll explore the solutions.</p>
<p>The post <a href="https://www.reflectivedata.com/how-to-query-google-analytics-data-using-sql/">How to Query Google Analytics Data Using SQL</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>SQL (pronounced &#8220;ess-que-el&#8221;) stands for Structured Query Language. SQL is a language used to communicate with databases. According to ANSI (American National Standards Institute), it is the standard language for relational database management systems. Most data analysts, data scientists and data engineers use SQL on daily basis to complete tasks related to ad-hoc queries, reporting and data visualization.</p>
<p>Google Analytics, on the other hand, is the most popular tool in the digital analytics world. It is used to keep track of marketing efforts, user behavior, traffic sources and more. The most common way to access data from Google Analytics is the web user interface. Alternative options include the <a href="https://developers.google.com/analytics/devguides/reporting/core/v4" target="_blank" rel="noopener noreferrer">Reporting API</a> or external tools like <a href="https://looker.com/" target="_blank" rel="noopener noreferrer">Looker</a> or <a href="https://datastudio.google.com/" target="_blank" rel="noopener noreferrer">Data Studio</a>.</p>
<p>What&#8217;s interesting, though, is that there is no way to use the most popular query language to query data from the most popular analytics platform. That&#8217;s right, you cannot query Google Analytics data using SQL.</p>
<p>This is no big deal for the more basic users just checking the built-in reports in the Google Analytics UI. It is, for sure, a limitation for power users working with tools like Python and use data in custom models or even feed it into machine learning algorithms and recommendation engines.</p>
<p>In this article, we&#8217;re going to cover the solutions that will enable you to query your Google Analytics data using SQL.</p>
<h2>Step 1 &#8211; Getting data into a database/data warehouse</h2>
<p>As mentioned above, there is, unfortunately, no way to query data directly from Google Analytics using SQL. This means that the first step is to get data into some sort of a relational database or a data warehouse that support SQL queries.</p>
<p>There are three options for sending Google Analytics data into some external data storage.</p>
<h3>1. Google Analytics 360 to BigQuery export</h3>
<p>If your company has the 360 (premium) version of Google Analytics then you can use its native BigQuery export feature. <a href="https://support.google.com/analytics/answer/3437618?hl=en" target="_blank" rel="noopener noreferrer">Here are the details</a> for setting this up. Keep in mind, though, that this solution will not work with the standard (free) version of Google Analytics.</p>
<h3>2. Google Analytics Parallel Tracking</h3>
<p>Google Analytics Parallel Tracking is a third party service that sends all of the raw hits into a data warehouse of your choice (i.e. BigQuery). These hits are then processed into sessions to provide a dataset similar to Google Analytics 360 export.</p>
<p>While this solution is not free, it will cost you only a small fraction of the cost of Google Analytics 360.</p>
<p><a href="http://reflectivedata.com/analytics-data-pipeline/from-google-analytics-to-bigquery/" target="_blank" rel="noopener noreferrer">Getting started with Google Analytics Parallel Tracking</a>.</p>
<h3>3. Export data using the API</h3>
<p>This is the most technical solution of the three. It leverages the<a href="https://developers.google.com/analytics/devguides/reporting/core/v4" target="_blank" rel="noopener noreferrer"> Google Analytics Reporting API</a> to pull data from Google Analytics and into your database or data warehouse.</p>
<p>One way to get started with this solution is to use the <a href="https://code.markedmondson.me/gago/" target="_blank" rel="noopener noreferrer">gago library</a> in Go language to communicate with the Reporting API. Then you need the code for writing this data into your database.</p>
<p>The limitations with data export solution are that you can&#8217;t query all of the metrics and dimensions at the same time, you&#8217;re still affected by the data collection and sampling limits of Google Analytics and a modification in your tracking system is required (custom dimensions for hit timestamp, hit type, sessions ID, client ID and more).</p>
<h2>Step 2 &#8211; Query Google Analytics data using SQL</h2>
<p>This depends a bit on the setup you used to send data into a database/data warehouse and the type of data warehouse being used. Since the most common option is <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/" target="_blank" rel="noopener noreferrer">Parallel Tracking</a> and <a href="https://cloud.google.com/bigquery" target="_blank" rel="noopener noreferrer">Google BigQuery</a>, we are going to use these in our examples as well.</p>
<p>The simplest way you can run your first SQL query against your Google Analytics data stored in BigQuery is to go into the <a href="https://console.cloud.google.com/bigquery" target="_blank" rel="noopener noreferrer">BigQuery user interface</a> and choose the right dataset containing your Google Analytics data.</p>
<p>With parallel tracking, you will have your Google Analytics data stored in three separate tables.</p>
<ul>
<li>raw_hits</li>
<li>processed_hits</li>
<li>processed_sessions</li>
</ul>
<p>Depending on the type of query you want to run, choose the right table. Keep in mind that some of the information (bounce rate, geo-location etc.) isn&#8217;t available before hits are processed into sessions.</p>
<p>For example, we could write a simple SQL query like this and see the top 10 countries by the number of Google Analytics sessions.</p>
<figure id="attachment_7056" aria-describedby="caption-attachment-7056" style="width: 588px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.27-17_16_01.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-7056" src="http://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.27-17_16_01.png" alt="Query Google Analytics data using SQL" width="588" height="669" /></a><figcaption id="caption-attachment-7056" class="wp-caption-text">Query Google Analytics data using SQL</figcaption></figure>
<p>Now, to make this query a bit more interesting, we might add in metrics like users and bounce rate. All doable with SQL, of course.</p>
<p>Notice how you can define your own rules for things like &#8220;bounce&#8221; and &#8220;bounce rate&#8221;.</p>
<figure id="attachment_7058" aria-describedby="caption-attachment-7058" style="width: 678px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.27-17_31_40.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-7058" src="http://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.27-17_31_40.png" alt="Query Google Analytics sessions, users and bounce rate using SQL" width="678" height="813" srcset="https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.27-17_31_40.png 678w, https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.27-17_31_40-584x700.png 584w" sizes="(max-width: 678px) 100vw, 678px" /></a><figcaption id="caption-attachment-7058" class="wp-caption-text">Query Google Analytics sessions, users and bounce rate using SQL</figcaption></figure>
<p>Pretty neat, right?</p>
<p>Using the BigQuery UI is good for testing your setup and quick ad-hoc queries. In most cases, though, you&#8217;ll probably use some sort of a BI tool. One of the popular options these days is Google Data Studio. So, let&#8217;s use this in our examples a well.</p>
<p>Google Data Studio is free and probably the easiest to get started with. That being said, it is still packed with useful features and makes creating interactive dashboards a fun and enjoyable process.</p>
<h3>Connecting BigQuery with Google Data Studio</h3>
<p>Google Data Studio, like many other BI tools, has a native integration for BigQuery as a data source.</p>
<figure id="attachment_7103" aria-describedby="caption-attachment-7103" style="width: 1063px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-15.02.02.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-7103" src="http://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-15.02.02.png" alt="Connect BigQuery to Google Data Studio" width="1063" height="824" srcset="https://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-15.02.02.png 1063w, https://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-15.02.02-700x543.png 700w, https://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-15.02.02-1024x794.png 1024w, https://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-15.02.02-768x595.png 768w" sizes="(max-width: 1063px) 100vw, 1063px" /></a><figcaption id="caption-attachment-7103" class="wp-caption-text">Connect BigQuery to Google Data Studio</figcaption></figure>
<p>While you can automatically pull all data in a certain table, in most cases it&#8217;s a better idea to use a custom SQL query.</p>
<p>Data Studio has this feature built-in as well.</p>
<figure id="attachment_7106" aria-describedby="caption-attachment-7106" style="width: 1128px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2020/10/screenshot-datastudio.google.com-2020.10.30-15_09_45.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-7106" src="http://reflectivedata.com/wp-content/uploads/2020/10/screenshot-datastudio.google.com-2020.10.30-15_09_45.png" alt="Custom SQL query in Data Studio" width="1128" height="721" srcset="https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-datastudio.google.com-2020.10.30-15_09_45.png 1128w, https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-datastudio.google.com-2020.10.30-15_09_45-700x447.png 700w, https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-datastudio.google.com-2020.10.30-15_09_45-1024x655.png 1024w, https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-datastudio.google.com-2020.10.30-15_09_45-768x491.png 768w" sizes="(max-width: 1128px) 100vw, 1128px" /></a><figcaption id="caption-attachment-7106" class="wp-caption-text">Custom SQL query in Data Studio</figcaption></figure>
<p>In our example, we&#8217;re going to analyze the results of an A/B test run using <a href="https://vwo.com/" target="_blank" rel="noopener noreferrer">VWO</a>. We&#8217;re using VWO&#8217;s <a href="https://help.vwo.com/hc/en-us/articles/360021308973-Integrating-VWO-with-Universal-Analytics-by-Using-Google-Tag-Manager-Custom-Events-">custom-event-based integration</a> to send experiment data into Google Analytics.</p>
<p>It&#8217;s always a good idea to try your query in the BigQuery&#8217;s query editor before implementing it in Data Studio.</p>
<figure id="attachment_7107" aria-describedby="caption-attachment-7107" style="width: 1134px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.30-15_18_28.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-7107" src="http://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.30-15_18_28.png" alt="Google Analytics VWO BigQuery" width="1134" height="982" srcset="https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.30-15_18_28.png 1134w, https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.30-15_18_28-700x606.png 700w, https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.30-15_18_28-1024x887.png 1024w, https://reflectivedata.com/wp-content/uploads/2020/10/screenshot-console.cloud_.google.com-2020.10.30-15_18_28-768x665.png 768w" sizes="(max-width: 1134px) 100vw, 1134px" /></a><figcaption id="caption-attachment-7107" class="wp-caption-text">Google Analytics VWO BigQuery</figcaption></figure>
<p>Once we&#8217;re happy with the query and the results it returned, it&#8217;s time to put the query in Data Studio.</p>
<p>In Data Studio, we can start building all sorts of cool visualizations. For example this one for visualizing the funnels for Control and Variant 1 of our A/B test.</p>
<figure id="attachment_7110" aria-describedby="caption-attachment-7110" style="width: 1237px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-16.02.53.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-7110" src="http://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-16.02.53.png" alt="A/B test visualization Google Data Studio" width="1237" height="499" srcset="https://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-16.02.53.png 1237w, https://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-16.02.53-700x282.png 700w, https://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-16.02.53-1024x413.png 1024w, https://reflectivedata.com/wp-content/uploads/2020/10/Screenshot-2020-10-30-at-16.02.53-768x310.png 768w" sizes="(max-width: 1237px) 100vw, 1237px" /></a><figcaption id="caption-attachment-7110" class="wp-caption-text">A/B test visualization Google Data Studio</figcaption></figure>
<p>Data Studio is really flexible when it comes to building custom reports, interactive dashboards and quick visualizations. If you can&#8217;t find the right chart from the <a href="https://michaelhoweely.com/2019/04/14/an-overview-of-all-google-data-studio-chart-types-in-2019/" target="_blank" rel="noopener noreferrer">default chart types</a>, check out the <a href="https://developers.google.com/datastudio/visualization" target="_blank" rel="noopener noreferrer">community visualizations</a> or go ahead and <a href="https://developers.google.com/datastudio/visualization/define-config" target="_blank" rel="noopener noreferrer">build one</a> yourself.</p>
<h2>Conclusion</h2>
<p>Working with your Google Analytics data using SQL opens up a whole new world of analysis opportunities for your digital analytics data. You no longer have to use the metrics and concepts defined and pre-calculated by Google. Feel free to come up with your own rules for things like calculating bounce rate or defining a session.</p>
<p>To get started, you first need to transfer your Google Analytics data into a data warehouse that supports SQL queries. For example, Google BigQuery. There are a few ways you can send Google Analytics data into BigQuery, our recommended method is <a href="http://reflectivedata.com/analytics-data-pipeline/from-google-analytics-to-bigquery/" target="_blank" rel="noopener noreferrer">Parallel Tracking</a>. This will provide you with the most complete dataset, delivered in near-real-time and for a fraction of the cost of GA 360.</p>
<p>Once you have data in your database or data warehouse, you can start writing ad-hoc SQL queries right away. To take your productivity to a next level, we recommend running your queries and reports through a BI platform (i.e. Data Studio).</p>
<p>If you have any questions about analyzing your Google Analytics data using SQL or sending data into a data warehouse, feel free to ask them in the comments below.</p>
<p>The post <a href="https://www.reflectivedata.com/how-to-query-google-analytics-data-using-sql/">How to Query Google Analytics Data Using SQL</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/how-to-query-google-analytics-data-using-sql/feed/</wfw:commentRss>
			<slash:comments>6</slash:comments>
		
		
			</item>
		<item>
		<title>How to Avoid Google Analytics Sampling and Data Limits?</title>
		<link>https://www.reflectivedata.com/how-to-avoid-google-analytics-sampling-and-data-limits/</link>
					<comments>https://www.reflectivedata.com/how-to-avoid-google-analytics-sampling-and-data-limits/#comments</comments>
		
		<dc:creator><![CDATA[Silver Ringvee]]></dc:creator>
		<pubDate>Tue, 25 Aug 2020 13:27:30 +0000</pubDate>
				<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Data Pipeline]]></category>
		<category><![CDATA[Google Analytics]]></category>
		<category><![CDATA[Tools]]></category>
		<guid isPermaLink="false">http://reflectivedata.com/?p=4677</guid>

					<description><![CDATA[<p>Google Analytics, while being by far the most popular tool in its segment, does have a few limitations that can make this, otherwise nearly perfect tool, unsuitable for a large number of companies.</p>
<p>The main limitations of Google Analytics are related to sampling and data collection limits. Most affected are companies that can't afford the premium 360 version of Google Analytics (~150k/year) but still have a good amount of traffic visiting their websites. In general, Google Analytics properties with >1M sessions/month or >10M hits/month are being affected by some heavy sampling and data collection limits.</p>
<p>In this article, we're going to cover the different types of limitations present in the free version of Google Analytics and provide solutions/workarounds to all of them. Oh, and the solution, in most cases, does not include buying the 360 version.</p>
<p>The post <a href="https://www.reflectivedata.com/how-to-avoid-google-analytics-sampling-and-data-limits/">How to Avoid Google Analytics Sampling and Data Limits?</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Google Analytics, while being by far the most popular tool in its segment, does have a few limitations that can make this, otherwise nearly perfect tool, unsuitable for a large number of companies.</p>
<p>The main limitations of Google Analytics are related to sampling and data collection limits. Most affected are companies that can&#8217;t afford the premium 360 version of Google Analytics (<a href="https://www.thirdandgrove.com/insights/is-google-analytics-360-worth-price-tag/" target="_blank" rel="noopener noreferrer">~150k/year</a>) but still have a good amount of traffic visiting their websites. In general, Google Analytics properties with &gt;1M sessions/month or &gt;10M hits/month are being affected by some heavy sampling and data collection limits.</p>
<p>In this article, we&#8217;re going to cover the different types of limitations present in the free version of Google Analytics and provide solutions/workarounds to all of them. Oh, and the solution, in most cases, does not include buying the 360 version.</p>
<h2>Types of limitations in Google Analytics</h2>
<p>Here&#8217;s a quick overview of the different types of limitations that come with the free version of Google Analytics.</p>
<h3>Sampling in reports</h3>
<p>Sampling on the reporting level means that, even if Google Analytics collected 100% of the hits, the reports you are seeing in the user interface are not based on 100% of those hits/sessions. In general, sampling starts if the period you are looking at <a href="https://support.google.com/analytics/answer/2637192" target="_blank" rel="noopener noreferrer">contains more than 500k sessions</a> in total. It can start a lot sooner, though, when more advanced custom segments are in use.</p>
<figure id="attachment_4693" aria-describedby="caption-attachment-4693" style="width: 499px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2020/08/sampling-in-google-analytics-reporting.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-4693" src="http://reflectivedata.com/wp-content/uploads/2020/08/sampling-in-google-analytics-reporting.png" alt="Sampling in Google Analytics reports" width="499" height="133" /></a><figcaption id="caption-attachment-4693" class="wp-caption-text">Sampling in Google Analytics reports</figcaption></figure>
<h3>Data collection limits</h3>
<p>From the official <a href="https://support.google.com/analytics/answer/1070983" target="_blank" rel="noopener noreferrer">documentation</a>.</p>
<blockquote><p>If a property sends more hits per month to Analytics than allowed by the <a href="http://www.google.com/analytics/terms/us.html" target="_blank" rel="noopener noreferrer">Analytics Terms of Service</a>, there is no assurance that the excess hits will be processed. If the property&#8217;s hit volume exceeds this limit, a warning may be displayed in the user interface and you may be prevented from accessing reports.</p></blockquote>
<p>Data collection limits that apply to all free Google Analytics accounts.</p>
<ul>
<li>up to 10M hits total per month</li>
<li>200,000 hits per user per day</li>
<li>500 hits per session</li>
</ul>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.36.03.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-5557" src="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.36.03.png" alt="Google Analytics Data Collection Limits" width="202" height="136" /></a></p>
<h3>Data processing latency</h3>
<p>Processing latency in Google Analytics is <a href="https://support.google.com/analytics/answer/1070983" target="_blank" rel="noopener noreferrer">up to 24-48 hours</a>. Standard accounts that send more than 200,000 sessions per day to Analytics will result in the reports being refreshed only once a day. This can delay updates to reports and metrics for up to two days.</p>
<p>This makes Google Analytics quite useless for things like monitoring real-time performance, detecting usability issues, automated reports on an hourly basis, real-time product recommendations, and other use cases that require fresh data (i.e. machine learning, marketing automation).</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.31.08.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-5551" src="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.31.08.png" alt="Google Analytics data processing latency" width="872" height="201" srcset="https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.31.08.png 872w, https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.31.08-700x161.png 700w, https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.31.08-768x177.png 768w" sizes="(max-width: 872px) 100vw, 872px" /></a></p>
<h3>Custom dimensions and metrics</h3>
<p>There are <a href="https://support.google.com/analytics/answer/2709828?hl=en" target="_blank" rel="noopener noreferrer">20 indices available</a> for different custom dimensions and 20 indices for custom metrics in each free Google Analytics property. 360 accounts have 200 indices available for custom dimensions and 200 for custom metrics.</p>
<p>While 20 is plenty for a small business with a simple website, this number can become quite limiting if you need custom dimensions for things like A/B testing data (one for each live experiment), e-commerce variables, user-specific information, and more.</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.27.43.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-5550" src="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.27.43.png" alt="Google Analytics custom dimensions and metrics limit" width="777" height="166" srcset="https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.27.43.png 777w, https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.27.43-700x150.png 700w, https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-12.27.43-768x164.png 768w" sizes="(max-width: 777px) 100vw, 777px" /></a></p>
<h3>Aggregated metrics and no access to raw data</h3>
<p>Unfortunately, in the free version of Google Analytics, there is no way to access the raw hit-level data. This means that if the way it calculates certain metrics or aggregates hits into sessions doesn&#8217;t suit your business requirements then you&#8217;re out of luck – without access to raw data, you can&#8217;t define your own calculations or aggregations.</p>
<p>Also, in case you would like to analyze the user journey of a specific visitor, there is no way to easily query and analyze all hits from one visitor in a timely order.</p>
<p>Google Analytics 360 does have a BigQuery export but, by default, the metrics and sessions are still defined and pre-aggregated just like in the reports.</p>
<h3>Hit payload limited to 8000 bytes</h3>
<p>Depending on your analytics setup, some of your hits may include a lot data – ecommerce product impressions, several custom dimensions etc. The Google Analytics hit payload size is <a href="https://developers.google.com/analytics/devguides/collection/protocol/v1/reference" target="_blank" rel="noopener noreferrer">limited to 8000 bytes</a>, this means that all hits greater than this will be ignored by Google Analytics&#8217; data processing engine and your dataset ends up being incomplete. What&#8217;s worse, Google Analytics doesn&#8217;t alert you when this is happening!</p>
<p>This affects both the free and 360 versions of Google Analytics.</p>
<figure id="attachment_5548" aria-describedby="caption-attachment-5548" style="width: 1608px" class="wp-caption aligncenter"><a  href="http://reflectivedata.com/wp-content/uploads/2020/08/ga-maximum-payload-size.jpg" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="size-full wp-image-5548" src="http://reflectivedata.com/wp-content/uploads/2020/08/ga-maximum-payload-size.jpg" alt="Google Analytics hit payload length limit 8000 bytes" width="1608" height="428" srcset="https://reflectivedata.com/wp-content/uploads/2020/08/ga-maximum-payload-size.jpg 1608w, https://reflectivedata.com/wp-content/uploads/2020/08/ga-maximum-payload-size-700x186.jpg 700w, https://reflectivedata.com/wp-content/uploads/2020/08/ga-maximum-payload-size-1024x273.jpg 1024w, https://reflectivedata.com/wp-content/uploads/2020/08/ga-maximum-payload-size-768x204.jpg 768w, https://reflectivedata.com/wp-content/uploads/2020/08/ga-maximum-payload-size-1536x409.jpg 1536w" sizes="(max-width: 1608px) 100vw, 1608px" /></a><figcaption id="caption-attachment-5548" class="wp-caption-text"><a href="https://www.simoahava.com/analytics/send-google-analytics-payload-length-as-custom-dimension/" target="_blank" rel="noopener noreferrer">Source</a></figcaption></figure>
<h2>Workarounds to sampling and other limitations</h2>
<p>Now, let&#8217;s take a look at the same list of limitations and provide a solution/workaround to each of them.</p>
<h3>Sampling in reports</h3>
<p><strong>Option 1</strong> &#8211; Divide your queries into smaller chunks and join data using another tool (Excel, Pandas, R). This can be done manually in the UI, using the Reporting API, or by using a library for your favorite programming language (i.e. <a href="https://code.markedmondson.me/gago/" target="_blank" rel="noopener noreferrer">gago</a> for Go) that has an &#8220;anti sampler&#8221; feature built-in.</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.40.20.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-5558" src="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.40.20.png" alt="Google Analytics Sampling Workaround" width="399" height="191" /></a></p>
<p><strong>Option 2</strong> &#8211; Collect raw hit-level Google Analytics data in your data warehouse (i.e. BigQuery). This process is known as <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/" target="_blank" rel="noopener noreferrer">parallel tracking</a> and functions by duplicating all hits that are sent to your Google Analytics property into your data warehouse. This allows you to write ad-hoc queries in standard SQL without any sampling.</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.45.25.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-5559" src="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.45.25.png" alt="Google Analytics Data in BigQuery" width="567" height="471" /></a></p>
<h3>Data collection limits</h3>
<p><strong>Option 1</strong> &#8211; To get past the 10M hits/mo, 200,000 hits per user per day and 500 hits per session limits, the same Google Analytics <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/" target="_blank" rel="noopener noreferrer">parallel tracking</a> solution can be used. This means that even if Google Analytics starts to skip some hits, they&#8217;re still available in your data warehouse and ready to be queried.</p>
<p><strong>Option 2</strong> &#8211; Set up multiple Google Analytics properties. This can be a little tricky but it is possible to divide your Google Analytics integration into multiple properties. Depending on how much traffic your site gets, you might divide all hits from the first week of the month into one property, hits from the second week into another property etc. Alternatively, you could divide hits based on the Client ID into as many groups/properties as needed. Then, you can use the Reporting API to pull data from all properties and join them using Excel, Pandas or some other tool.</p>
<h3>Data processing latency</h3>
<p>If you need to monitor the site&#8217;s performance in real-time or need data for product recommendations or other machine-learning efforts, the 24-48 hour delay is probably way too long. Your only option to get access to near-real-time Google Analytics data is to use the <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/" target="_blank" rel="noopener noreferrer">parallel tracking</a> technology. With parallel tracking, processed hit data will be available in ~5 seconds after it was collected from your site. This allows you to build alert systems, monitor performance in real-time or feed your machine learning algorithms with fresh data at all times.</p>
<h3>Custom dimensions and metrics</h3>
<p>As mentioned earlier, Google Analytics free version limits you to <a href="https://support.google.com/analytics/answer/2709828?hl=en" target="_blank" rel="noopener noreferrer">20 indices of custom dimensions and metrics</a>. If you hit the limit and need to add more, your first action should be to check whether there&#8217;s a dimension or metric that you no longer need. If that&#8217;s not the case, your only solution is to either upgrade to Google Analytics 360 (to get 200 metrics and dimensions) or to use <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/" target="_blank" rel="noopener noreferrer">parallel tracking</a> and get access to unlimited custom metrics and dimensions. When using parallel tracking, dimensions and metrics with indices 20+ will not show up in Google Analytics reports but are available in your data warehouse.</p>
<h3>Aggregated metrics and no access to raw data</h3>
<p>If you&#8217;re not happy with how Google Analytics aggregates data into sessions or calculates certain metrics, you need access to the raw hit-level data that allows you to define your own rules for almost any kind of aggregation or calculation. There are two options you can sue to get access to raw hit-level Google Analytics data.</p>
<p><strong>Option 1</strong> &#8211; You guessed it! <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/" target="_blank" rel="noopener noreferrer">Parallel tracking</a> sends the raw hit-level Google Analytics data into your data warehouse (with no sampling) and you can use standard SQL to query your data and tools like Excel, R or Pandas to make all sorts of calculations and aggregations. BigQuery has a native connection with most BI and data visualisation tools so reporting is easy and flexible.</p>
<p><strong>Option 2</strong> &#8211; If your Google Analytics property is not affected by the data collection limits (it gets &lt;10M hits/mo), you could configure your custom dimensions to collect data like Client ID, Session ID, Hit Timestamp and Hit Type. With all these dimensions available in your dataset, you can use the Reporting API to pull the raw hit-level data into your local machine or data warehouse. If your property receives more than 10M hits a month, this option is only available if you split your data between multiple properties.</p>
<p><a  href="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.47.27.png" data-rel="lightbox-gallery-0" data-rl_title="" data-rl_caption="" title=""><img loading="lazy" decoding="async" class="aligncenter size-full wp-image-5562" src="http://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.47.27.png" alt="Google Analytics Custom Dimensions for Unsampled Data" width="801" height="207" srcset="https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.47.27.png 801w, https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.47.27-700x181.png 700w, https://reflectivedata.com/wp-content/uploads/2020/08/Screenshot-2020-08-25-at-13.47.27-768x198.png 768w" sizes="(max-width: 801px) 100vw, 801px" /></a></p>
<h3>Hit payload limited to 8000 bytes</h3>
<p>If your <a href="https://developers.google.com/analytics/devguides/collection/analyticsjs/enhanced-ecommerce" target="_blank" rel="noopener noreferrer">enhanced ecommerce</a> transactions include many products or you have lots of custom dimensions or for some other reason some of your hits exceed the 8000-byte limit then you are losing data. Google Analytics will, without any warnings, simply skip those hits. You have three options for solving this issue.</p>
<p><strong>Option 1</strong> &#8211; Send less data by removing some data points. Simo Ahava has written a great <a href="https://www.simoahava.com/analytics/automatically-reduce-google-analytics-payload-length/" target="_blank" rel="noopener noreferrer">blog post</a> on automatically reducing the Google Analytics payload length by removing some of the unnecessary data points and creating an order for removing other data points until the required payload size is reached.</p>
<p><strong>Option 2</strong> &#8211; Use the data import feature in Google Analytics to import extra information after the data has been sent to Google Analytics. This way you can send only the most important information (Client ID, transaction ID etc.) with the main hit and import other details (product information, custom dimensions etc.) later using the import feature. Check out <a href="https://www.bounteous.com/insights/2017/02/16/tracking-large-transactions-w-google-analytics-google-tag-manager/" target="_blank" rel="noopener noreferrer">this post</a> by Dan Wilkerson for a detailed guide for doing this with Google Tag Manager.</p>
<p><strong>Option 3</strong> &#8211; With <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/" target="_blank" rel="noopener noreferrer">parallel tracking</a> your hits are limited to 16 000 bytes (double from Google Analytics) and this can be raised even more on request.</p>
<h2>Key takeaways</h2>
<p>Now that you have a pretty good overview of the limitations that come with the free version of Google Analytics and some that come with the 360 (premium) version, it&#8217;s a good time to assess your own Google Analytics implementation(s). Does any of the limitations affect your data and its accuracy? May some of the limitations become an issue in the future?</p>
<p>Knowing the limitations of a tool is extremely important. You may be investing a lot of time and money into a setup that may not be sufficient anymore as your business grows.</p>
<p>With all its limitations, Google Analytics is still an extremely powerful tool that has an awesome community. If sampling or some of the other limitations are an issue for you, I believe you can fix them with some of the solutions suggested in this blog post.</p>
<p>Most of the sampling and data collection limitations of Google Analytics are solvable with <a href="http://reflectivedata.com/services/google-analytics-parallel-tracking/" target="_blank" rel="noopener noreferrer">parallel tracking</a>, a system that sends raw hit-level Google Analytics data into BigQuery (or any other data warehouse) in real-time.</p>
<p>***</p>
<p>Should you have any other issues with Google Analytics or have an alternative solution to any of the problems mentioned in this post, please share your thoughts in the comments below.</p>
<p>The post <a href="https://www.reflectivedata.com/how-to-avoid-google-analytics-sampling-and-data-limits/">How to Avoid Google Analytics Sampling and Data Limits?</a> appeared first on <a href="https://www.reflectivedata.com">Reflective Data</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.reflectivedata.com/how-to-avoid-google-analytics-sampling-and-data-limits/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
	</channel>
</rss>
