How accurate are online hate speech detection tools?

Mar 12th '24

Automated classifiers are tools used to detect harmful content, such as hate speech. These safety measures can be used to significantly reduce people’s experiences of harm online. Researchers also use these tools to identify how a change to a platform (for example, when it changes its rules or removes certain content or users) impact the frequency of hate speech.


However, according to Ofcom analysis, it is important for researchers to indicate which classifiers they have used and how they have performed. This is because the performance of classifiers may vary substantially. For example, widely-used classifiers may perform poorly in relation to some datasets.


Ofcom has analysed the performance of two hate speech classifiers: Perspective API – the most commonly used ‘off-the-shelf’ classifier – and HateXplain, which was trained on similar data to the test dataset used for this assessment. The purpose was to explore how these different classifiers perform and, then, the implications for research on the effectiveness of these types of safety measures.


Ofcom found that Perspective API identified 13% of all hate speech in the test dataset, compared to 78% with HateXplain. This highlights that accuracy is significantly improved when using a classifier trained on a dataset from the same platform and user base.


They also found the performance of the classifiers varied depending on the target of the hate speech. The volume of errors made by the Perspective API classifier in identifying hate speech targeted at certain ethnic groups, in the dataset Ofcom used, renders it no better than random guessing.


The results suggest that, in relation to certain datasets and in comparison to classifiers which have been developed using similar datasets, Perspective API may make biased errors when predicting hate speech targeted at certain protected characteristics. Ofcom used Perspective API because it easily available and widely used. The purpose of this analysis is not to say that Perspective API is generally a poor performing classifier. Rather, that it may sometimes perform poorly, or poorly in comparison with other automated classifiers.


Based on this analysis, Ofcom believe it is important that, when researchers use hate speech classifiers to identify the frequency of hate speech, they also report how well the classifier performed and how it performs relative to other available classifiers. Otherwise, the results they present may not be robust.


To implement the UK’s online safety laws, Ofcom must produce Codes of Practice and Guidance that set out safety measures online services can adopt to protect their users and comply with their new duties.


The study forms part of a substantial programme of research to inform Ofcom’s regulatory approach. They will update the Codes over time as evidence base improves and as technology and harms evolve.


Source:  Office of Communication (Ofcom)


About Ofcom

Ofcom is the regulator for the communications services that we use and rely on each day.



About us

LS Consultancy are experts in Marketing and Compliance, and work with a range of firms to assist with improving their documents, processes and systems to mitigate any risk.


We provide a cost-effective and timely bespoke copy advice and copy development services to make sure all your advertising and campaigns are compliant, clear and suitable for their purpose.


Our range of innovative solutions can be tailored to suit your unique requirements, no matter whether you’re currently working from home, or are continuing to go into the office. Our services can be deployed individually or combined to form a broader solution to release your energies and focus on your clients.


Contact us today for a chat or send us an email to find out how we can support you in meeting your current and future challenges with confidence.


Explore our full range today.


Need A Regulatory Marketing Compliance Consultant? A Bit More About Us


Contact us


Why Not Download our FREE Brochures! Click here.


Call Us Today on 020 8087 2377 or send us an email.



Connect with us via social media and drop us a message from there. We’d love to hear from you and discuss how we can help.


Facebook | Instagram | LinkedIn | X (formally Twitter) | YouTube


Contact us