SocialLinks database - is a graph database, in which we have already uploaded about 7 billion. (6.4 bln. at the end of 2017) records about people, companies, places and their connections.
Most of the data is obtained by parsing a variety of white and yellow pages, company registers, business directories, social networks and other open online sources.
We ensure that fields with the same entities are always called the same, i.e. you can be sure that in the phone field-always there will be only phone numbers, in the field alias - nicknames or usernames, in ip - ip-addresses, etc. This allows searching immediately in the whole database, as well as to supplement the output of the found results with related records.
We also do normalization and cleanup of basic fields .
In building relationships, we use both natural links by clear identifiers (mail, phone, social network ID, IP), and links from the original data source. For example, when one record contains information about a person and company or a list of employees in a company.
The following is the statistics for the most common fields:
SL DB contains data from all over the world and serves as a great addition to our online sources.
Based on data from SL DB we made several separate transformations:
- [SL DB] IP to Emails and [SL DB] Email to IPs
- [SL DB] IP to Phones and [SL DB] Phone to IPs
- [SL DB] To Emails @domain