I am more of a passive user on LinkedIn. I hardly like, share or post any status on LinkedIn. However, just out of curiosity I downloaded the data which LinkedIn stored for me and decided to analyze it. To get your data it is quite easy. Go to your LinkedIn account, go to settings, and select Privacy and go to download your Data. It will ask you to enter your LinkedIn password for security reasons.
After that it will notify that they will notify you once the data is ready to download. Then immediately within 10-15 minutes I received an email from LinkedIn saying that Part 1 of your data is ready for download. It also says that the second part of your data would be ready within 24 hours. It will redirect to your LinkedIn profile again to download the data from where you can download the zip file.
After unzipping the file, this is how the folder structure looks:
1) Videos.csv contained just one line https://www.linkedin.com/psettings/member-data/videos and when I went to that link it said “You haven’t uploaded any videos L “which is true. I haven’t uploaded any videos on LinkedIn.
2) Skills.csv contained all the skills which I had mentioned on the profile. Some of them were R, Data Analysis, Statistics, Data Science etc. The skills which other people endorse us for (even if we don’t have it 😛 ). I actually even expected them to store the count of how many people have endorsed the skill because that is an important number to keep. Anyway, if they are showing it on the profile they must be definitely be storing it somewhere.
3) Registration.csv had the details when I registered for the website, I suppose. I don’t actually clearly recall the date and time I signed up for LinkedIn but I am assuming this is correct. The other columns were blank.
From the IP address, I checked what details I can find out. A basic search reveals this details from IP.
So, I was at home when I signed up for LinkedIn.
4) Projects.csv includes the projects which you have added on LinkedIn along with its description, URL (if any), its start and end date as well.
5) Profile.csv maintains details of your profile which you have shared. Your name, address, birth date etc.
6) Positions.csv keeps a record of all your employment details. The organizations you have worked with, your title there, the duration for which you worked etc.
7) Phone Numbers.csv contained only my phone number in it.
8) Messages.csv had all the conversation/messages I had over LinkedIn. All the messages sent/received. One thing worth noticing was it had 1074 messages in total and the last message was from November 2014. It is hard to believe that I did not receive any messages from the time I signed up on LinkedIn from 2011 to 2014. Or do they show only top 1000 messages or so?
9) Languages.csv contains languages and their proficiency.
10) Invitations.csv contains information about all the invitations sent by you and received by you. The time the invitation was send and if any invitation message was sent along with it. Also this had around 2k rows which is way less than my total connections and my oldest connection was from 2017 so I believe even this has some filter in it like messages.csv.
11) Imported Contacts.csv has all the contacts which you have imported from your personal email address. First and Last Name of imported contact, their email address, when was the contact imported , their phone number (if any).
12) Email Addresses.csv includes your email address. I had two email address, one was primary and the other one secondary. It also has a flag if the email addresses are verified or not.
13) Education.csv like the Positions.csv has details of your education whatever you have uploaded.
14) Courses.csv includes all the courses you have taken which you have included on the platform.
15) Connections.csv Now this is I think the most important csv of all. This has got list of all your connections along with their email addresses, the company they work for, their position and the time when you were connected with them. So one thing we need to keep in mind is that when we are connecting with anybody we are giving away our email address to them.
16) Certifications.csv has the certificates which you have included.
17) Cause you Care About.csv This too is straight forward info which you have shared.
Media Files – has the media files that you have shared on the platform. Any images / document that is uploaded.
Jobs – This gets divided into two parts. One csv is for our job preference (Job Seeker Preference.csv) which even I don’t remember when I set it up. Says I am looking for job casually and ready to join in 4 to 6 months. The other csv which is Job Applications.csv has all the details of all the job you have applied it till now. The time when you applied, the title for which you applied, the company name and the resume name which you uploaded.