Data Snacks

Data Snack: Opportunities in generating synthetic financial data

  • Collecting, organizing and sharing data can be a costly and risky process.
  • FIs and Big Tech are looking into generating 'synthetic data'. Which mimics real data but removes the risk of privacy breaches occurring due to exposure of Personally Identifiable Information.

Email a Friend

Data Snack: Opportunities in generating synthetic financial data

While data can be exceptionally useful for analytics and strategizing, mismanaging access to it can lead to significant security risks for both organizations and consumers. Personally Identifiable Information  poses a challenge for organizations, who generally want to retain as much detail as they can, without exposing customers to privacy risks.

One solution is synthetically generated data, which mimics real data sets but does not hold any PII. Moreover, synthetic data circumvents the labor and costs attached to data collection and organization, allowing teams to develop algorithms faster and with less red tape.

In the past year companies like Microsoft, Google, and Amazon have all spoken to the importance of synthetic data and its use in their current architecture. San-Diego based startup and synthetic data creator closed a $50 million Series B funding round in October, led by Anthos Capital. Their products, such as a privacy toolkit, safeguard synthetic data from adversarial attacks and also enables teams to de-bias and anonymize their data sets, while also allowing for the sharing of data among teams more securely.

JP Morgan’s AI research has developed the following model for generating synthetic data sets: 

Source: JP Morgan

The flow diagram is explained by JP Morgan as follows: 

Step 1:  Compute metrics for the real data
Step 2:  Develop a Generator (may be statistical methods or an agent-based simulation)
Step 3:  (Optional) Calibrate the Generator using the real data
Step 4:  Run the Generator to generate synthetic data
Step 5:  Compute metrics for the synthetic data
Step 6:  Compare the metrics of the real data and synthetic data
Step 7:  (Optional) Refine the Generator to improve against comparison metrics

In their research on the subject, JP Morgan found that tabular data in retail banking and time series of market microstructure data are the most in need of protection by financial institutions.

Tune into our Data Day Conference on the 21st of June to find out more about how data is changing the fintech landscape. 

0 comments on “Data Snack: Opportunities in generating synthetic financial data”

Data Snacks, Member Exclusive

Data Snack: Millennial retail traders rise during the bear market

  • Millennials opened 46% of all new retail trading accounts in the first half of 2022, according to DriveWealth’s Global Investor Report.
  • The ability to invest with small dollar amounts was the most frequently cited motivator to start investing.
Lindi Miti | December 01, 2022
Data Snacks

Data Snack: Digital authentication is a hard nut to crack for banks

  • The age of passwords seems to be giving way to more frictionless mechanisms like biometrics or behavioral analysis.
  • Passwords, biometrics and Face ID are the computer engineer’s answer to traditional and real-world authentication processes.
Rabab Ahsan | November 08, 2022
Data Snacks

Data Snack: PayPal continues to report slow user growth, focusing on monetization

  • PayPal reported its third-quarter results last week, beating market expectations on both top and bottom lines.
  • However, much of the focus went towards its stagnant user growth figures, which left the market pondering over the firm's strategy through the market downturn.
Iulia Ciutina | November 07, 2022
Data Snacks, Member Exclusive

Data Snack: Cross-border payments prepare to digitize; cards and ATMs lose their sheen

  • Consumers and businesses alike now demand fast cross-border payments, but current systems can only go so far.
  • Newer technologies like APIs, tokenization, and blockchain come with the promise of secure lines and instant settlements.
Subboh Jaffery | October 31, 2022
Data Snacks

Data Snack: Crypto apps earn the most revenue per user, Cash App ranks third

  • A recent study explored the revenue made by finance apps to find out how much, on average, each app is making per user.
  • Results showed that crypto companies take the top two spots, followed by Cash App, which ranks third on the list.
Lindi Miti | October 28, 2022
More Articles