<?xml version="1.0" encoding="UTF-8"?><ns2:project xmlns:ns1="http://gtr.rcuk.ac.uk/gtr/api" xmlns:ns2="http://gtr.rcuk.ac.uk/gtr/api/project" xmlns:ns3="http://gtr.rcuk.ac.uk/gtr/api/fund" xmlns:ns4="http://gtr.rcuk.ac.uk/gtr/api/person" xmlns:ns5="http://gtr.rcuk.ac.uk/gtr/api/project/outcome" xmlns:ns6="http://gtr.rcuk.ac.uk/gtr/api/organisation" ns1:created="2026-06-03T15:52:43Z" ns1:href="http://gtr.ukri.org/gtr/api/projects/88FF8361-D6C8-42AA-A2AD-080ABDA12240" ns1:id="88FF8361-D6C8-42AA-A2AD-080ABDA12240"><ns1:links><ns1:link ns1:href="http://gtr.ukri.org/gtr/api/persons/FA4ED564-4534-4CEE-9701-435C037DAAD6" ns1:rel="PM_PER"/><ns1:link ns1:href="http://gtr.ukri.org/gtr/api/organisations/2F7F2F8D-EEF6-4DEB-8825-AA72B9EFB51F" ns1:rel="LEAD_ORG"/><ns1:link ns1:href="http://gtr.ukri.org/gtr/api/organisations/2F7F2F8D-EEF6-4DEB-8825-AA72B9EFB51F" ns1:rel="PARTICIPANT_ORG"/><ns1:link ns1:end="2016-04-29T23:00:00Z" ns1:href="http://gtr.ukri.org/gtr/api/funds/B69B2557-0FD7-4B34-91F7-9934E7A1C9CE" ns1:rel="FUND" ns1:start="2015-04-30T23:00:00Z"/></ns1:links><ns2:identifiers><ns2:identifier ns2:type="RCUK">710710</ns2:identifier></ns2:identifiers><ns2:title>PeopleGraph</ns2:title><ns2:status>Closed</ns2:status><ns2:grantCategory>GRD Proof of Concept</ns2:grantCategory><ns2:leadFunder>Innovate UK</ns2:leadFunder><ns2:abstractText>3Desk Ltd has discovered that there is no available comprehensive internet search tool for
people. Some general search engines and specific membership networking and social media
platforms (e.g. LinkedIn, Facebook) exist, but none work like a ‘Google for people’. The
PeopleGraph Project objectives are to further investigate and prove the techniques necessary
to build a viable people search engine that operates at the scale, speed and accuracy required
and to further investigate the market potential.
The top 300 most used platforms and sites on the web currently contain over 20bn people
profiles, projected to grow to 50bn within 2 yrs. ~300 additional such platforms identified also
contain significant data. The search process being developed involves data gathering (profile
discovery) followed by information normalisation and decoration, resulting in identity
resolution and matching.
3Desk’s techniques to process and handle the huge volumes of data required are in initial
development but many critical issues of speed and accuracy remain to be solved.
Key challenges in profile discovery are the quality and depth of returned data, crawler
performance and optimal strategies for regular dataset refresh. Challenges in identity
matching are improving the speed, accuracy and quality of outputs. Common techniques and
algorithmic approaches have been largely addressed so further developments will involve
innovative ground-breaking work in the following:
Web crawler efficiency, Natural Language Processing (NLP), Graphing and linkage
resolution on 100 billion nodes with a short, efficient processing time, Single and Relational
entity matching algorithms, Data indexing, rating and ranking.
Expected benefits will be the creation of new business models and services in multiple
sectors, e.g. identity verification, recruitment, improved government data and services,
criminal investigation, enhanced personalised marketing and market analysis, to name a few.</ns2:abstractText></ns2:project>