Back to articles

Data minimization in GA4 and ad pixels

Data minimization is a GDPR principle—and it also improves data quality. Less noise, clearer events, fewer compliance surprises, and faster debugging when something breaks in production.

Minimization does not mean “track nothing.” It means collecting only what you need for defined purposes, keeping it only as long as needed, and restricting access to teams that require it.

GA4: what to trim

Redundant parameters

  • Custom dimensions duplicating built-in fields (page_location, session_id, device category)
  • Multiple event names for the same user action (form_submit vs contact_click vs lead)

PII and sensitive fields

Never send to GA4 (even hashed without legal review):

  • Email, phone, name, address in event parameters
  • Free-text form fields, medical/financial details
  • Government IDs or full credit card data

Use hashed identifiers for Ads enhanced conversions only with explicit consent and documented process.

User-ID scope

Enable User-ID only when:

  • You have lawful basis and disclosure
  • Users understand cross-device linking
  • Internal access controls protect joined profiles

Avoid default User-ID on all logged-in users “because GA4 docs suggest it.”

Ad pixels: what to avoid

  • Advanced matching with raw PII from checkout fields
  • Automatic event capture pulling query strings with email tokens (?email=)
  • Duplicate events from browser pixel + CAPI without event_id deduplication
  • Over-broad custom events sending entire product catalog objects on every page

Retention and access controls

GA4 Admin:

  • Set event data retention to match policy (2 or 14 months—justify 14)
  • Disable Google Signals if not used (reduces cross-device complexity)
  • Review connected Google Ads / BigQuery links regularly

BigQuery export:

  • Column-level access in warehouse; no open “analytics” group with all PII joins
  • Scheduled deletion jobs for expired partitions

Looker Studio:

  • Row-level filters for regional teams; no public dashboards with sensitive dimensions

Event design workshop (recommended)

Gather marketing, product, data, legal for 90 minutes:

  1. List decisions each KPI drives
  2. Map minimum events/parameters required
  3. Kill “nice to have” fields inherited from UA habits
  4. Publish event dictionary v1 with owners

Minimization checklist before launch

  • Every parameter has a documented business purpose
  • No string fields accept unbounded user-generated content
  • Consent gates match data sensitivity (marketing vs analytics)
  • Retention configured in GA4 and warehouse
  • DSAR/erasure process covers analytics IDs where feasible

Benefits beyond compliance

  • Smaller BigQuery bills and faster queries
  • Cleaner funnel reports (fewer duplicate events)
  • Easier agency onboarding with clear schema docs
  • Lower risk in enterprise security reviews

Minimization pairs well with consent: fewer sensitive fields mean fewer debates about whether a tag should fire at all—and faster answers when users ask “what do you hold about me?”