In our SAMbot project, we use machine learning models to assess abusive language. These models are ever-evolving, which means that during each deployment, our data is more accurate and informed.
Please note that compared to our previous SAMbot deployments, we have evaluated abusive tweets significantly differently in this election.
Our machine learning model makes a confidence prediction to assess whether a tweet should be considered “abusive.” When measuring abuse, our model gives each tweet a score from 0% to 100% for each category, based on how confident it is that the tweet is abusive in nature.
Previously with SAMbot deployments, we used a 51% confidence prediction to evaluate abuse; we have changed to now use a 70% confidence prediction. This change means that our analysis will be more accurate, and that cumulative results will paint a better picture of how abuse is distributed across the entire election and across all candidates.
Simultaneously, this change also means that some nuanced and subtle forms of abusive language may be missed by our machine learning model, and will make it appear at first glance as if there is comparatively less abusive content present, which is not necessarily the case. Machine learning models will never be able to monitor all abusive language across an election, as the subjective nature of what constitutes “abuse” does not permit the possibility of 100% accuracy. This methodological change allows us to more accurately represent how abuse is distributed overall.
We have made this change as part of our ever-evolving intention to strive for more accurate and ethical methodological practices within the field of social media and machine learning research. Using confidence intervals in this way is in line with recommendations for social science research.
This change makes abuse volumes look considerably lower than in previous elections we have tracked (however, SAMbot data should never be compared across elections regardless). Please consider these significant methodological changes while interpreting the data in this report.