Managing Data Quality

Neelesh Shastry
Neelesh Shastry
  • Updated

Overview

The Data Quality (DQ) tab on the pipeline page is the command‑center for authoring, organizing, and evaluating rules that measure the health of each entity in your Syncari hub.

Key capabilities

CapabilityWhat it lets you do
Rule authoringCreate granular checks—ranging from simple field validations to complex multi‑field logic—that score records as Pass or Fail.
Category managementGroup rules into logical buckets (e.g., Completeness, Conformity). Syncari ships several starter categories; you can add your own to match internal standards.
Pipeline integrationLeverage temporary variables computed in pipelines to express rich, reusable logic.
 Data Quality DashboardThe Data Quality Dashboard in Insights Studio visualizes scores by entity, category, and individual rule over time, with built‑in DQ trends.

 

Authoring Data Quality Rules

Navigate to the Data Quality Tab

From Sync Studio → Your Entity → Data Quality, you’ll see the existing rule grid (above). Use the Create Rule button to open the rule designer.

Create a Rule

  1. Click Create Rule.
  2. Fill out:
    • Name – clear, action‑oriented (Is Unique Record).
    • Policy – choose Report, Warn, or Fail.
    • Scope – select the fields or records the rule evaluates.
    • Category – pick an existing category or Add Category.
    • Condition – combine fields, TVs, and operators.
  3. Validate to preview hit counts → Save.

Need advanced operators? Open the operator dropdown for options such as Is PhoneNumber, In Reference Data, Not Equals, and more.

Leverage Pipeline‑Derived Variables

Complex logic is often easier to compute upstream in a pipeline and stored in a temporary variable (TV).

  1. In Pipelines → Your Pipeline, add relevant functions and actions that evaluate your DQ rule.
  2. Output the result to a temporary variable via the Set Value function.
  3. Save  the pipeline.
  4. Now you can use this temporary variable in DQ Rules

 

 

Managing Categories

Categories help stakeholders filter dashboards and bulk‑toggle rule groups.

Syncari ships the following starters:

Starter CategoryPurpose
CompletenessMandatory field presence
ConformityFormat and pattern checks
UniquenessDuplicate detection
ValidityField‑to‑field cross checks

Add or edit categories via Manage Categories. Re‑assign rules at any time without losing history.

 

 

Monitoring Quality in Insights Studio

 

Open the Dashboard

  • Insights Studio → Dashboards → Data Quality Dashboard presents entity‑level and category‑level scorecards.
  • Hover over any bar or line to view exact scores.

Built‑in Reports

ReportWhat it showsDrill‑down
Current Score by EntityWeighted score per entity.Entity → Record list
Overall Score by CategoryPass/Fail counts per category.Category → Rule list
Score Over TimeTrendline per entity or category.Point → Daily variance

Filtering & Time Windows

Use the filter bar to adjust Date Range, Entity, Category, or Severity. Trendlines adjust in real time for pre/post remediation comparison.

 

 

 

Best Practices

  1. Start small—enable a narrow set of critical rules to avoid alert fatigue.
  2. Compute once, reuse everywhere—centralize complex formulas in pipelines, exposing TVs to rules.
  3. Version intentionally—clone rules before major logic changes to keep historical comparability.
  4. Align categories to owners—map each category to a steward (e.g., Finance owns Billing Integrity).
  5. Schedule reviews—monitor dashboard trendlines weekly; investigate spikes quickly.

 

Troubleshooting & FAQs

SymptomResolution
Rule not evaluatingConfirm pipeline temporary variable is populated before DQ runs; check node execution order.
Dashboard missing entityEnsure at least one active rule exists for that entity.
Performance lag on large entitiesReduce the number of rules

 

Appendix: Regular Expression Cookbook

Regular expressions are very powerful tools to match patterns. Here are some common regular expressions you can use in DQ rules

Common Regular Expressions for Data Quality

#PurposeRegular Expression
1Email address (RFC 5322 trimmed)^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.[A-Z]{2,}$ (use case‑insensitive flag)
2US phone number (E.164)^\+1\d{10}$
3Generic international phone (E.164, 8–15 digits)^\+\d{8,15}$
4US ZIP or ZIP+4^\d{5}(-\d{4})?$
5ISO‑3166‑1 alpha‑2 country code^[A-Z]{2}$
6State/Province 2‑letter code (US/CA)^[A-Z]{2}$
7UUID v4^[0-9a-fA-F]{8}-[0-9a-fA-F]{4}-4[0-9a-fA-F]{3}-[89abAB][0-9a-fA-F]{3}-[0-9a-fA-F]{12}$
8Date YYYY‑MM‑DD^\d{4}-\d{2}-\d{2}$
9URL (HTTP/HTTPS)^(https?):\/\/[\w.-]+(?:\/[\w\-._~:/?#\[\]@!$&'()*+,;=]*)?$
10Alphabetic name with spaces & punctuation^[A-Za-z][A-Za-z \-']*$

Related to

Share this

Was this article helpful?

0 out of 0 found this helpful