Public Money Beneficiaries

Finding Beneficiaries of Public Money

By Alex Yeung.

Part 1: The Matching Process

Linking entities is a common theme in procurement data, but it is a significant problem in the UK. The concern with entity matching is the cost of false positives. This issue is particularly stark when finding the beneficiaries of money. The main challenge is that name matching is largely insufficient for people or companies, and here’s why.

Take for example, the “John Smith” conundrum.

“John Smith” – matches – “John Smith” is insufficient information to be useful.

“John Smith, Born: 1971/01/21, Reigate, Surrey” matches “Jon Smith, DoB: 21/01/1971, Surrey, UK”

This match is more useful because of the other pieces of data that corroborate this match, such as the date of birth or a string of text from the address. Even then for common names such as John Smith, it is possible that there might be two born on the same date who reside in the same town.

Lost In Translation

This problem is exacerbated by some names, especially those from the Far East, who are character-based. There’s often little consistency in how they’re represented in other languages. Take the author’s Chinese name: it can be anglicised to Saiman, Sai mun, or even Simon. The latter might not even be the native language name as many people adopt a name like ‘Thomas’.

Therefore, for John Smith and other names, much of the battle for matching is finding supporting data that allows the match to be made. This includes investigation using publicly available data to corroborate matches.

Matching Bias

Of course, the reverse is also possible. Many cultures and countries have more unique naming systems such as longer and/or more unique names. Take the name of present UK chancellor Rishi Sunak. At the time of writing, on Companies House there are only three entries for company officers of this name, with two entries having the same month as a birth date. This narrows the scope for verification somewhat. Contrast this to 375,407 entries for a search for John Smith! (https://find-and-update.company-information.service.gov.uk/search/officers?q=john+smith#)

Batch Matching Names

Of course, when it comes to Companies House, a more efficient approach is needed. Even generously assuming that it would take 5 minutes on average to verify each name and 8-hour workdays, it would take over 3900 years to go through John Smith alone. To put that into context, the last known woolly mammoth reportedly died 4000 years ago. (https://www.sciencedirect.com/science/article/pii/S0277379119301398).

We Just Use A Script.

Using our data infrastructure and our algorithms, we can compare two lists within days, not millennia. Of course, we add our own investigatory magic to verify our matches. We have to. To give an example, here is what a machine might see:

Name list 1: glmb zimrh

Name list 2: glmb qznvh zimrh

Judging solely by what can be seen, a match is not immediately obvious in these two strings. In fact, what has been done is a simple inversion of the alphabet for the following two strings:

Tony Arnis

Tony James Arnis

Note: ‘Tony Arnis’ and ‘Tony James Arnis’ are both names made up for this article.

To the human eye, it’s pretty obvious that this is a match. This is because an Anglophone reader would have a priori knowledge of common surnames (Arnis is not a common surname: https://find-and-update.company-information.service.gov.uk/search/officers?q=arnis). The script does not have this a priori knowledge and would likely reject it. Of course, it can do: a more advanced algorithm trained on substantial amounts of prepared data might well address this but this takes time and resources to develop and train.

Stuck In The Middle

The ‘Tony James Arnis’ issue is representative of a broader challenge: middle names are really annoying for name matches because they are so inconsistently applied. A list of names might have middle names, it might not. Another list of names might have these names, it might not. Even personal use of middle names is not consistent. Incidentally, the UK passport only has so much space. If a person has too many middle names, one or more might be cut off partway. Again, this requires investigatory work to get right.

A Titular Distinction

Similar to the middle name problem is titles. Titles can appear anywhere within a name and means that script matching can reject what might appear to be perfectly good matches. Mrs, Dr., Prof., Professor, LLM, MCRVS, Eng, FBCS, these are but a few titles that can confound simpler scripts. This is especially troublesome when there are no consistent conventions for naming within any lists of names: some lists might put Dr. in the front of the name, others at the back. There are ways around this however through better scripts, but that’s a story for another time.

The challenges are many but not insurmountable. Matching is critical in bringing beneficial ownership data and procurement data together. With our colleagues at Open Ownership, we will continue to think about how we will do this in the future, both with the tools at our disposal and the tools we can create.

Share on twitter
Share on linkedin
Share on print
Share on email
November 24, 2020

Goodbye Defence Contracts Online, Hello Defence Sourcing Portal.

The Chief Commercial Officer of the Ministry of Defence has published details of planned changes to Defence Contracts Online. You can see his letter here.

November 18, 2020

High priorities in PPE lead to missed opportunities

The NAO has just published a report criticising the government for using a ‘high priority lane’ where suppliers that were known to MPs were prioritised

November 18, 2020

The European Procurement Report. An Assessment

The data analysis from our recent report, An Assessment of European Procurement Around Covid, shows that the C-19 pandemic has affected procurement data across Europe.

November 18, 2020

European Procurement Around Covid-Our Data Analysis

Procurement data often uses CPV or Common Procurement Vocabulary codes as a classification of different notices. In creating the Assessment of European Procurement Around Covid

November 18, 2020

Review All Open Covid Contracts Here

In October, we openly published all the contracts let by the UK public sector around Covid. In light of today’s report by the National Audit

November 17, 2020

European Government Procurement Report- A Review of Procedure Type

In our Assessment of European Procurement Around Covid report, we sought to determine the extent to which different countries relied on direct contract awards, those

November 9, 2020

New Covid Procurement Report Launches Today.

We are pleased today to launch our latest report, An Assessment of European Procurement Around Covid. The first wave of Covid-19 in Spring 2020, brought

November 5, 2020

Government Procurement Rises as UK Heads Back into Lockdown

Recently, we published a very important data set. All known government medical equipment and protective clothing contracts were published and can be found here. This

November 4, 2020

UK Soon To Join WTO’s Government Procurement Agreement

On the 7th October 2020, the WTO’s Government Procurement Act (GPA) members granted the UK the right to submit its GPA instrument of accession to

October 22, 2020

Can Anyone Do Business With Government?

By Fiona Hunt It’s understandable that for many SME’s, doing business with government is not high on their prospecting list. In the media, most of

October 20, 2020

New Government Measures Include Social Value in Procurement

By Fiona Hunt In late September 2020, the UK Government released a new plan for delivering greater social value through its procurement processes. The new

October 20, 2020

Consultancy contracts published

Today we’re publishing all of the recent consultancy contracts for the UK. This data is free to download and reuse. There has been high levels

October 18, 2020

Opening Covid Contracts

Today we’re openly publishing all of the contracts let by UK public sector for medical equipment and protective workwear. This data is free for anyone

October 8, 2020

How Does Procurement Compare By Country?

-Fiona Hunt. This simple graph gives us a fascinating insight into the respective size of government spending globally. From the recent report by Open Contracting

October 6, 2020

Why Good Data Matters More Than Ever

Governments spend a lot. How we spend this money is important for a whole slew of reasons. We often overlook how well procurement functions on

October 1, 2020

Giving Away Data Makes Business Better

-Ian Makgill Giving away all your data for free sounds like a bad idea, especially when you’re a business whose aim is to sell data.

September 28, 2020

UK Leading Procurement Transparency Alongside Columbia and Ukraine.

By Fiona Hunt. The recent report on global government spending by Open Contracting Partnership and Spend Network, brought to light for the first time, the

September 22, 2020

7 Rules for Data Integrity

Not all data is the same. It might have come from the same source, but how it gets treated is vital. If a data company

September 10, 2020

What Needs To Happen to Open Public Procurement- Skills

-Ian Makgill, Founder, Spend Network. Our last post in this series of three covers the skills you need to maintain a procurement publication service. Governments

September 9, 2020

What Needs To Happen To Open Public Procurement – Technology

-Ian Makgill, Founder, Spend Network In the second part of our series,we’re going to look at technology. Getting data into the public sphere can often

September 8, 2020

What Needs To Happen to Open Public Procurement- Policy

-Ian Makgill, Founder, Spend Network In this three-part series, I’m going to cover the basic requirements for implementing a contracting transparency initiative, starting with policy.

September 3, 2020

Quantifying Global Public Procurement: Our Methodology

The Open Contracting Report ‘Global Procurement Spend’, is to our knowledge, the most comprehensive study of global public procurement yet. We are proud to be

September 3, 2020

How Transparency Benefits Suppliers

Why suppliers gain from better data on their markets. -Ian Makgill, Founder, Spend Network. Better information creates better outcomes for those bidding into a market.

September 2, 2020

Understanding Global Procurement

Why are we looking at procurement globally, when collaboration between countries is so difficult? Ian Makgill, Founder, Spend Network Analysing public procurement on a global

September 1, 2020

$13 Trillion – The Global Value Of Public Procurement

Ian Makgill, Alex Yeung, Lindsey Marchessault  We often hear about the scale of public procurement. With vast sums of money at stake, it is important

August 31, 2020

Our Partnership With Open Contracting

-Ian Makgill, Founder, Spend Network We’re passionate advocates for contracting transparency. Not just because it makes it harder for corrupt officials and their partners, but

August 27, 2020

Which Country’s Purchasing Changed During The Covid-19 Crisis?

  Looking at the data on tenders over the past twelve months gives some insight into the ways that different governments are operating during the

August 25, 2020

What Is The Right Price For PPE?

-Ian Makgill, Founder, Spend Network What’s the right price for PPE gowns and masks? With every Government in the world yanking hard on the same

August 18, 2020

One Among Thousands – Creating a Dictionary of Public Sector Buyers.

Here are just three among thousands of public sector buyers: YPO, JISC, NOE CPC. Who are they? What do they do? Are they a private

August 13, 2020

Data Works: Ensuring Quality Through Validation

-Ian Makgill, Founder, Spend Network When is something right? We process over 150,000 records a month. Our database now has more than 150m rows in

August 11, 2020

Data Quantity on Contracts Finder: The Long View.

Covid-19 has shown us the importance of transparency in procurement. A transparent process allows suppliers and buyers to clearly identify each other, and set benchmarks

August 6, 2020

Data Works: Creating Machine Learning Models For Categorisation

-Ian Makgill, Founder Spend Network We’re working hard on a new set of machine learning models for categorisation. It has proven to be really challenging.

August 4, 2020

Details Matter FT.com

Last week, we had an article featured in the Financial Times. Working with their team is always fantastic. They cut through to the issues and

July 30, 2020

Why Beneficial Ownership Data Should Interest You.

The Beneficial Ownership Data Standard (BODS) is an open data standard for publishing data on company ownership and control. You can find out more about

July 28, 2020

Which Country’s Purchasing Changed the Most During the Covid-19 Crisis?

  Looking at the data on tenders over the past twelve months gives some insight into the ways that different governments are operating during the

July 21, 2020

Implementing procurement analytics? Start with service design.

-Ian Makgill, Founder, Spend Network If you’re unsure what service design is, a good place to start is here . It’s a fascinating area of

July 14, 2020

5 Steps To Tackle Public Procurement Corruption.

Governments have rightly relaxed procurement rules in order to act quickly on Covid-19, but stories already abound on alleged corruption and financial mismanagement. For instance,

July 9, 2020

Your supplier is on mute.

Does video conferencing make it easier to evaluate suppliers? We’re just finishing up a bid. Our heads are full of equality policies, pricing and schedules.

July 7, 2020

Do Governments Want to Trade with Small Business?

Why Covid-19 should change Government’s view of contracting with SME’s. -By Ian Makgill, Founder, Spend Network. In April the Government awarded a £108m contract to

May 11, 2020

The Case For Using Data In Public Procurement

Our current model of procurement uses data for limited purposes. The primary analysis conducted by procurement teams is spend analysis, where buyers seek to identify

March 30, 2020

Tackling Covid-19 with open procurement data

To help public sector buyers around the world combat the Covid-19 emergency, we have built our supplier search tool (http://openopps.com/covid-19).  Our free service allows buyers

February 3, 2020

Public procurement data needs work.

Government data is much like infrastructure, valuable when it is cared for, troublesome when it is left unloved. Government spends nearly £400bn a year with

December 18, 2019

New OJEU Procurement Thresholds for 2020

The Government has recently updated the procurement thresholds for public sector procurement processes for 2020. We outline what this means for you and your business.

December 12, 2019

A free hand or a free market? How competitive is public procurement in Europe? Part three of a three part series

This article is the last of a three part series on competitive tendering in public procurement in Europe. We have already learned that European tendering

December 12, 2019

A free hand or a free market? How competitive is public procurement in Europe? Part two of a three part series

This article is the second part of a three part series on competitive tendering in public procurement in Europe. There is a correlation between competitive

December 12, 2019

A free hand or a free market? How competitive is public procurement in Europe? Part one of a three part series

In this series of articles, we will explore how competitive the overall public procurement tendering process is for European countries. The most value for money

December 12, 2019

Counting the human cost of supported accommodation

BBC Newsnight worked with Spend Network to report on supported accommodation for looked after children. This forms part of their series Britain’s Hidden Children’s Homes.

December 12, 2019

‘Data that Cares’, a report into the Adult Health and Social Care market using our data

  We are pleased to announce the release of the FCC/IPPR report ‘Data that Cares’! This is a report into the Adult Health and Social

December 12, 2019

Region profile: Insurance in Europe

In this article, we explore public sector insurance tenders in Europe. First, we look at how many tenders are being issued year by year by

December 12, 2019

Country Profile: South Africa

South Africa is seeing an increased interest in public procurement owing to corruption and the role it plays in economic development. The question is: where

December 12, 2019

How closed is Malaysia? Country Profile: Malaysia

In the wake of the 1MBD scandal and the arrival of a new government, has any meaningful change occurred in Malaysia procurement?  The answer is

December 12, 2019

The French Connection: public sector demand in France

In this article we’ll be looking at France’s public sector demand over the past 5 years to see what the largest categories are. From our

December 12, 2019

Building Europe, a Region Profile: Construction in Europe 2016 to mid 2019

For all its recent troubles, Europe is seeing growth in one sector: construction in the public sector. In this article, we explore the trends in

December 12, 2019

Procurement jargon explained

Do you need a quick way to explain procurement terms? Here’s a run down of some common terms used in procurement, including acronyms to watch

December 12, 2019

6 golden rules for selling to Government

Globally, public sector contract spending is $13 trillion a year. This is a great opportunity for all kinds of businesses from consultancies to caterers. There

December 9, 2019

Never miss an opportunity – free

This is another post it should be long enough

June 26, 2019

Customer Profile: Critiquie

We’re starting an occasional series featuring our customers, this week, Critiquie.com. Critiquie is an online feedback service, that allows businesses and government to get fast, efficient

June 26, 2019

Why do Government IT projects fail?

Why do Government IT projects consistently fail? Below are eight reasons why we keep getting this wrong. 1. We’ve bought into a tech utopia Believe

June 26, 2019

Our users: NIC Group

Next in our series featuring our customers, this week: NIC Group (www.nicgroup.co.uk). NIC Group are an award winning facilities management company with customers across retail,