Cyberleagle: Illegal content judgements - fitting the Illegal Harms consultation to the Online Safety Act

This is Part 5 of a series of reflections on Ofcom’s Illegal Harms Consultation under the Online Safety Act (OSA). Ofcom’s consultation (which closed in February 2024) ran to a mammoth 1728 pages, plus an additional 77 pages in its recent further consultation on torture and animal cruelty. The results of its consultation are expected in December.

For readers not fully conversant with the OSA, the reason why Ofcom has to consult at all is that the OSA sets out most of the illegal content service provider duties in stratospherically high-level terms, anticipating that Ofcom will bring the obligations down to earth by means of concretely articulated Codes of Practice and Guidance. If the Act were an algorithm, this would be a non-deterministic process: there is no single answer to the question of how the high-level duties should be translated into detailed measures. The number and range of possibilities are as good as infinite.

The main contributor to this state of affairs is the way in which the Act frames the service providers’ duties as requirements to put in place “proportionate” systems and processes designed to achieve stipulated aims. That leaves tremendous latitude for debate and judgement. In simple terms, Ofcom’s task is to settle on a set of systems and processes that it considers to be proportionate, then embody them in concrete Codes of Practice, recommended measures and guidance. Those proposed documents, among other things, are what Ofcom has been consulting on.

Of course Ofcom does have to work within the statutory constraints of the Act. It cannot recommend measures that stray outside the boundaries of the Act. The measures that it does recommend should interact sensibly with the duties defined in the Act. For abstractly expressed duties, that presents little problem. However, a tightly drawn statutory duty could have the potential to collide with specific measures recommended by Ofcom.

Awareness of illegality

One such duty is Section 10(3)(b). This requires a U2U service provider to have proportionate systems and processes in place designed swiftly to take down illegal content upon becoming aware of it. This is a relatively concrete duty, verging on an absolute takedown obligation (see discussion in Part 3 of this series).

A service provider will therefore need to understand whether - and if so at what point – the takedown obligation kicks in when it is implementing Ofcom’s operational recommendations and guidance. That turns on whether the service provider has ‘become aware’ of the presence of illegal content.

Behind that innocuous little phrase, however, lie significant issues of interpretation. For instance, if an automated system detects what it thinks is illegal content, does that trigger the Section 10(3)(b) takedown duty? Or is it triggered only when a human becomes aware? If human knowledge is necessary, how does that square with Section 192, which requires a provider to treat content as illegal if it has reasonable grounds to infer illegality – and which specifically contemplates fully automated systems making illegality judgements?

Ofcom’s consultation does not spell out in terms what interpretations have been assumed for the purposes of the draft Codes of Practice, Guidance and other documents that Ofcom is required to produce. It is thus difficult to be sure how some aspects of the proposed recommended measures are intended to mesh with S.10(3)(b) of the Act.

This table lists out the questions of interpretation of S.10(3)(b).

S.10(3)(b) duty	Interpretation question	Significance
“A duty to operate a service using proportionate systems and processes designed to … where the provider is alerted by a person to the presence of any illegal content, or becomes aware of it in any other way, swiftly take down such content.”	Does “becomes aware” mean that a human being has to be aware?	Some of Ofcom’s recommendations involve automated detection methods. Could the swift takedown duty kick in during automated detection, or does it apply only if the content is passed on to a human moderator?
	Does “aware” mean the same as “reasonable grounds to infer” in S.192(5) and (6) (illegal content judgements)?	If the provider has reasonable grounds to infer that content is illegal, it must treat it as such (S.192(5)). Does that mean it must swiftly take it down under S.10(3)(b), or does “aware” set a different threshold?
	If “aware” means the same as “reasonable grounds to infer” in S.192(5) and (6), is the answer to the ‘human being’ question affected by the fact that S.192 expressly contemplates that a judgement may be made by automated systems alone?

It is also noteworthy that the obligation under Section 66 to refer previously undetected and unreported CSEA content to the National Crime Agency is triggered by the provider becoming ‘aware’ of the content – again, not further defined. In the context of S.66, the Information Commissioner in its submission to the Ofcom Illegal Harms consultation observed:

“Our reading of measure 4G is that it could allow for the content moderation technology to be configured in such a way that recognises that false positives will be reported to the NCA. Whilst we acknowledge that it may not be possible to completely eliminate false positives being reported, we are concerned that a margin for error could be routinely “factored into” a service’s systems and processes as a matter of course. This is unlikely to be compatible with a service taking all reasonable steps to ensure that the personal data it processes is not inaccurate.

We therefore consider that services should be explicitly required to take into account the importance of minimising false positives being reported to the NCA.”

Human awareness only? Consider a hypothetical Code of Practice measure that recommends automated detection and blocking of a particular kind of illegal user content. Can detection by an automated system constitute the service provider becoming aware of it, or (as an English court in McGrath v Dawkins, a case concerning the eCommerce Directive hosting shield, appears to have held) only if a human being is aware?

If the latter, then Ofcom's hypothetical recommendation will not interact with S.10(3)(b). If the former, then the possibility that the S.10(3)(b) removal obligation would be triggered during automated detection has to be factored in. The Ofcom consultation is silent on the point.

Awareness threshold Relatedly, what is the threshold for awareness of illegal content? S.10(3)(b) has similarities to the eCommerce Directive hosting liability shield. Eady J said of that provision: “In order to be able to characterise something as ‘unlawful’ a person would need to know something of the strength or weakness of available defences” (Bunt v Tilley). Has that standard been carried through to S.10(3)(b)? Or does the standard defined in S.192 OSA apply?

S.192 stipulates the approach to be taken where a system or process operated or used by a provider of a service for the purpose of compliance with duties under the Act involves a judgement by a provider about whether content is illegal content:

"In making such judgements, the approach to be followed is whether a provider has reasonable grounds to infer that content is content of the kind in question (and a provider must treat content as content of the kind in question if reasonable grounds for that inference exist)."

In marked contrast to Eady J's interpretation of the ECommerce Directive hosting shield, S.192 goes on to say that the possibility of a defence is to be ignored unless the provider positively has reasonable grounds to infer that a defence may be successfully relied upon.

The OSA does not address the interaction between S.10(3)(b) and S.192 in terms, contenting itself with a cryptic cross-reference to S.192 in the definition of illegal content at S.59(16):

"See also section 192 (providers' judgements about the status of content)".

The Ofcom consultation implicitly takes the position that awareness (at any rate by a human moderator — see Automated Illegal Content Judgements below) is synonymous with the S.192 standard:

"When services make an illegal content judgement in relation to particular content and have reasonable grounds to infer that the content is illegal, the content must however be taken down" (Illegal Judgements Guidance Discussion, para 26.14)

Mixed automated-human illegal content judgements.

Returning to our hypothetical Code of Practice measure that recommends automated detection and blocking of a particular kind of illegal user content, such a system would appear to involve making a judgement about illegality for the purpose of S.192 regardless of whether a removal obligation under S.10(3)(b) is triggered.

If an automated detection system flags up posts for subsequent human review, the final word on illegality rests with human moderators. Does that mean that their judgement alone constitutes the illegality judgement for the purpose of S.192? Or is the initial automated triage also part of the illegality judgment? S.192 contemplates that ‘a judgement’ may be made by means of ‘automated systems or processes together with human moderators’. That may suggest that a combined judgement comprises the whole system or process.

If so, does that imply that the initial automated detection, being part of the illegal content judgement process, could not apply a higher threshold than the ‘reasonable grounds to infer’ test stipulated by S.192?

That question assumes (as does S.192 itself) that it is possible to embed within any given technology an inference threshold articulated in those terms; which brings us to our next topic.

Automated illegal content judgements One of the most perplexing aspects of the OSA has always been how an automated system, operating in real time on limited available information, can make accurate judgements about illegality or apply the methodology laid down in S.192: such as determining whether it has reasonable grounds to make inferences about the existence of facts or the state of mind of users.

Undaunted, s.192 contemplates that illegality judgments may be fully automated:

“... whether a judgement is made by human moderators, by means of automated systems or processes or by means of automated systems or processes together with human moderators."

The OSA requires Ofcom to provide Guidance to service providers about making illegality judgements. It has produced a draft document, running to 390 pages, setting out how the S.192 criteria should be applied to every priority offence and a few non-priority offences.

Ofcom's draft Guidance appears to assume that illegality judgements will be made by human moderators (and implicitly to equate awareness under S.10(3)(b) with reasonable grounds to infer under s.192):

"The process of making an illegal content judgement, as set out in the Illegal Content Judgement Guidance, presupposes that the content in question has been brought to the attention of a moderator making such a judgement, and as a result [the S.10(3)(b) awareness] requirement is fulfilled." (Illegal Judgements Guidance Discussion, para 26.14 fn 5)

Human involvement may be a reasonable assumption where decisions are reactive. However, Ofcom has included in its draft Codes of Practice proactive prevention recommendations that are either automated or at least encompass the possibility of fully automated blocking or removal.

Annex 15 to the consultation discusses the design of various kinds of automated detection, but does not address the possibility that any of them involves making an illegal content judgement covered by S.192.

In apparent contrast with the human moderation assumed in the footnote quoted above, the Illegal Content Judgements Guidance also describes itself as 'technology-agnostic'.

“26.38 Our draft guidance therefore proposes a ‘technology-agnostic approach’ to reasonably available information and to illegal content judgements in general. We have set out which information we believe is reasonably available to a service, regardless of technology used to collect it, on an offence-by-offence basis. It is our understanding that, while automated tools could be used to collect more of this information or to do so more quickly, there is no additional class of information which automated tools could have access to that human moderators could not. We therefore take the view that information may be collected using any approach the service prefers, so long as when it is factored into an illegal content judgement, this is done in a way which allows a reasonable inference to be made.”

and:

"A1.42 We have recommended three automated content technologies in our Codes of Practice; hashing technology recognising child sexual abuse material; URL detection technology recognising URLs which have previously been identified as hosting child sexual abuse material (CSAM); and search to detect content containing keywords strongly associated with the sale of stolen credentials (i.e. articles for use in fraud). These technologies do not offer an additional class of information that human moderators could not. We therefore take a 'technology-agnostic approach' to illegal content judgements."

The usual concern about reasonably available information, however, is not that automated content moderation technologies will have additional information available to them compared with human moderators, but that they will tend to have less. Moreover, they will be required to make decisions based on that information on the fly, in real time. Consequently such decisions are liable to be less accurate than those of human moderators, even if automated technology could be regarded as otherwise equivalent to a human being in its ability to make judgements.

The thinking may be that since the elements of a given offence, and the evidence required to establish reasonable grounds to infer, are in principle the same regardless of whether illegality judgements are made by automated systems or human beings, there is no need to differentiate between the two in the Guidance.

However, it seems artificial to suggest (if that is what is being said) that automated illegality judgements do not give rise at least to practical, and quite likely deeper, issues that differ from those raised by human judgements. The “technology-agnostic” label is not, in truth, a good description. The draft guidance may be agnostic, but if so the agnosticism is as to whether the judgment is made by a human being or by technology. That is a quite different matter.

Ofcom’s automated moderation recommendations

This brings us to Ofcom's specific automated moderation recommendations. Do any of them involve making illegal content judgements to which S.192 would apply? For simplicity this discussion focuses on U2U service recommendations, omitting search engines.

To recap, Ofcom recommends three kinds of U2U automated detection and blocking or removal of illegal content (although for different categories of service in each case):

• Perceptual hash matching against a database of known CSAM material (draft U2U Code of Practice, A4.23)

• URL matching against a list of known CSAM URLs (draft U2U Code of Practice, A4.37)

• Fuzzy keyword matching to detect articles for use in fraud (draft U2U Code of Practice, A4.45)

Each of these recommendations envisages that at least some moderation decisions will be taken without human involvement.

For CSAM perceptual hash matching the draft Code of Practice provides that the provider should ensure that human moderators are used to review "an appropriate proportion" of content detected as CSAM. The remainder, implicitly, would be swiftly taken down or blocked automatically in accordance with draft Code of Practice para A4.24, without human review. The draft CoP sets out how a service provider should go about deciding what proportion of detected content it is appropriate to review.

For CSAM URL matching the draft Code of Practice contains no provision for human review.

For fraud detection using fuzzy keyword matching the draft U2U Code of Practice requires the provider to consider the detected content in accordance with its internal content policies. The consultation explains that:

"… all large services and those that have assessed themselves as having a medium or high risk for any type of offence should set internal content policies which specify how content moderation systems and processes moderate content and resource them accordingly." [14.230] fn 254.

Such policies could include automatic takedown of detected items. Whilst Ofcom say that “we are not recommending that services take down all content detected by the technology' ([14.249]), such action is within the range of the recommended measure.

"Implementations that substantially impact on freedom of expression, including the automatic take down of detected content, could be in accordance with the measure in our Code of Practice.” [14.283]

The reliance on internal moderation policies appears to be intended to provide services with discretion as to what steps to take with automatically detected content:

"… whether or not such content were, incorrectly, subject to takedown would depend on the approach, to content moderation adopted by the service, rather than the content's detection by the keyword detection technology in and of itself." [14.284]

Whilst the draft Code of Practice provides for human review of a reasonable sample of detected content, that appears to be a periodic, after the event, review rather than part of the decision-making process.

Do any of these three recommended systems and processes involve a S.192 judgement "by the provider" as to whether the detected user content is illegal?

Even for URL matching, where the detection and removal or blocking process is entirely mechanistic, the answer is at least arguably yes. It would be quite odd if the fact that a provider is relying on a pre-verified third party list of URLs meant that the provider was not making an illegality judgement, given that the very purpose of the overall system or process is to distinguish between legal and illegal content.

The same argument applies to perceptual hashing, but more strongly since there is an element of judgement involved in the technical detection process as well as in compiling the list or database.

The fuzzy keyword fraud detection recommendation is more obviously about making judgements. The draft Code of Practice recommends that fuzzy keyword technology should be used to assess whether content is 'likely' to amount to an offence (although elsewhere in the Consultation Ofcom uses the phrase 'reason to suspect'). If so, an item of content would then be considered in accordance with the provider's internal policies.

Where in the process an illegality judgement is being made could vary depending on the provider's policy. If detected content is submitted for human review, then it may be plausible to say that the illegality judgement is being made by the human moderator, who should make the decision in accordance with the 'reasonable grounds to infer' approach set out in S.192 and any relevant data protection considerations.

Alternatively, and as already discussed perhaps more in keeping with the language of S.192, the sequential automated and human elements of the process could be seen as all forming part of one illegality judgement. If so, then we could ask how Ofcom’s suggested ‘likely’ standard for the initial automated detection element compares with S.192’s ‘reasonable grounds to infer’. If it sets a higher threshold, is the system or process compliant with S.192?

If detected content is not submitted for human review, the answer to where the illegality judgement is being made could depend on what processes ensue. If takedown of detected content is automatic, that would suggest that the initial triage constituted the illegality judgement. If other technical processes are applied before final decision, then it may be the final process, or perhaps the overall combination, that constitutes the illegality judgement. Either way it is difficult to see why an illegality judgement is not being made and why the S.192 provisions would not apply.

It must be at least arguable that where automatic removal of automatically detected user content is within the range of actions contemplated by a Code of Practice recommendation, an illegality judgement governed by S.192 is being made either at some point in the process, or that the process as a whole constitutes such a judgement.

Nevertheless, neither the draft Illegal Judgements Guidance nor the draft Codes of Practice address the potential interaction of S.192 (and perhaps S.10(3), depending on its interpretation) with automated illegality judgements.