Feeling duped by Facebook?

ao link

Members

Contact

Free AI assessment

New to DataIQ?

Take our FREE data literacy indicator now

Unlock the power of data - take our FREE data literacy indicator now

So now we know the truth about our Facebook friends - one in twelve may not be real friends. In fact, they may not even be real. The social network has revealed that 8.5 per cent of its 955 million accounts are not necessarily created and run by actual individuals.

As part of becoming a publicly-listed company, Facebook has been forced to open itself up to scrutiny for the first time in its short, eight-year life. Investors already disappointed with the losses they have made in its shares will have been keen to understand more about the active user base and how it will be monetised.

From that point of view, the news is not good for two reasons. The first is what Facebook reported about those user accounts: 4.8 per cent are duplicates (run in parallel to a main account), 2.4 per cent are misclassified (created on behalf of a pet or a business) and 1.5 per cent are “undesirable” (created for spamming purposes).

Each of those types of account breach Facebook’s terms of service and the company says that it makes efforts to identify and suppress such behaviour. In the case of pets and bots, it seems likely that big data is being analysed to spot behaviours that are not real.

Which brings us to the second reason why this revelation is not good news for investors. According to Facebook’s filing, “we are continually seeking to improve our ability to identify duplicate or false accounts and estimate the total number of such accounts, and such estimates may be affected by improvements or changes in our methodology.” Evidence of these improvements can be found in its announcement that a flaw was discovered in its geo-location attribution algorithm in June and that it is now identifying where users are more accurately.

But while this points to a positive approach to ensuring all users are genuine, another statement gives less confidence: “These estimates are based on an internal review of a limited sample of accounts and we apply significant judgment in making this determination, such as identifying names that appear to be fake or other behavior that appears inauthentic to the reviewers. As such, our estimation of duplicate or false accounts may not accurately represent the actual number of such accounts.”

Take a gulp and read that again. The world’s largest social network is awash with data, yet it is still using sampling and estimates to deal with the most basic issue - user ID. In my opinion, the reason is simple - the site has grown from a walled garden digital proposition to a Big Data giant without passing through any interim stages of data management and data quality.

Signing up for Facebook is easy and that is its problem. Without validation and matching routines right at the entry point, the network can not hope to maintain the credibility of its user profiles. Internal data cleansing and deduplication are standard practices at almost every other business, but not here, apparently.

With its IPO, Facebook raised $16 billion. It is a shame none of that appears to have been spent on installing data quality measures that are widespread and proven elsewhere. (And as I mentioned in a previous blog, other back office processes seem under-invested, too.)

Until it fixes that, returning some of that value to investors remains a distant prospect.

Log in to read the entire article

Gain access to the entire article by logging in or registering for a free account here.

Did you find this content useful?

Thank you for your input

Thank you for your feedback

Next read

Starting a data academy programme: A blueprint for success

Organisations need to implement their own data academies to prepare for long-term success as data’s place in the business world continues to rapidly evolve.

Next read

A case of the AI biter bit?

23 Apr 2024by David Reed

DataIQ’s Chief Knowledge Officer and Evangelist, David Reed, examines the hype cycle around generative AI and the actual speed of transformation being seen.

Pioneering AI initiatives revealed: DataIQ Announces 2024 AI Awards Shortlist

15 Apr 2024by Alex Roberts

The shortlist for the 2024 DataIQ AI Awards has been unveiled, with the winners to be announced at the DataIQ Summit on May 21.

Final chance to enter the 2024 DataIQ Awards and demonstrate your team’s prowess

08 Apr 2024by Alex Roberts

The final deadline for submissions to the 2024 DataIQ Awards – 26 April – is rapidly approaching, so make sure you have entered to clinch a title.

You may also be interested in

CDO Challenges – Stressing the importance of a data strategy

DataIQ is a trading name of IQ Data Group Limited
10 York Road, London, SE1 7ND

We use cookies so we can provide you with the best online experience. By continuing to browse this site you are agreeing to our use of cookies. Click on the banner to find out more.

Cookie Settings