Monday 9 September 2024

Illegal content judgements - fitting the Illegal Harms consultation to the Online Safety Act

This is Part 5 of a series of reflections on Ofcom’s Illegal Harms Consultation under the Online Safety Act (OSA). Ofcom’s consultation (which closed in February 2024) ran to a mammoth 1728 pages, plus an additional 77 pages in its recent further consultation on torture and animal cruelty. The results of its consultation are expected in December.

For readers not fully conversant with the OSA, the reason why Ofcom has to consult at all is that the OSA sets out most of the illegal content service provider duties in stratospherically high-level terms, anticipating that Ofcom will bring the obligations down to earth by means of concretely articulated Codes of Practice and Guidance.  If the Act were an algorithm, this would be a non-deterministic process: there is no single answer to the question of how the high-level duties should be translated into detailed measures. The number and range of possibilities are as good as infinite.

The main contributor to this state of affairs is the way in which the Act frames the service providers’ duties as requirements to put in place “proportionate” systems and processes designed to achieve stipulated aims.  That leaves tremendous latitude for debate and judgement. In simple terms, Ofcom’s task is to settle on a set of systems and processes that it considers to be proportionate, then embody them in concrete Codes of Practice, recommended measures and guidance.  Those proposed documents, among other things, are what Ofcom has been consulting on.

Of course Ofcom does have to work within the statutory constraints of the Act. It cannot recommend measures that stray outside the boundaries of the Act.  The measures that it does recommend should interact sensibly with the duties defined in the Act. For abstractly expressed duties, that presents little problem. However, a tightly drawn statutory duty could have the potential to collide with specific measures recommended by Ofcom.

Awareness of illegality

One such duty is Section 10(3)(b). This requires a U2U service provider to have proportionate systems and processes in place designed swiftly to take down illegal content upon becoming aware of it. This is a relatively concrete duty, verging on an absolute takedown obligation (see discussion in Part 3 of this series).

A service provider will therefore need to understand whether - and if so at what point – the takedown obligation kicks in when it is implementing Ofcom’s operational recommendations and guidance. That turns on whether the service provider has ‘become aware’ of the presence of illegal content.

Behind that innocuous little phrase, however, lie significant issues of interpretation. For instance, if an automated system detects what it thinks is illegal content, does that trigger the Section 10(3)(b) takedown duty? Or is it triggered only when a human becomes aware? If human knowledge is necessary, how does that square with Section 192, which requires a provider to treat content as illegal if it has reasonable grounds to infer illegality – and which specifically contemplates fully automated systems making illegality judgements?

Ofcom’s consultation does not spell out in terms what interpretations have been assumed for the purposes of the draft Codes of Practice, Guidance and other documents that Ofcom is required to produce. It is thus difficult to be sure how some aspects of the proposed recommended measures are intended to mesh with S.10(3)(b) of the Act.

This table lists out the questions of interpretation of S.10(3)(b). 

S.10(3)(b) duty

Interpretation question

Significance

“A duty to operate a service using proportionate systems and processes designed to … where the provider is alerted by a person to the presence of any illegal content, or becomes aware of it in any other way, swiftly take down such content.”

Does “becomes aware” mean that a human being has to be aware?

Some of Ofcom’s recommendations involve automated detection methods. Could the swift takedown duty kick in during automated detection, or does it apply only if the content is passed on to a human moderator?

Does “aware” mean the same as “reasonable grounds to infer” in S.192(5) and (6) (illegal content judgements)?

If the provider has reasonable grounds to infer that content is illegal, it must treat it as such (S.192(5)). Does that mean it must swiftly take it down under S.10(3)(b), or does “aware” set a different threshold?

 

If “aware” means the same as “reasonable grounds to infer” in S.192(5) and (6), is the answer to the ‘human being’ question affected by the fact that S.192 expressly contemplates that a judgement may be made by automated systems alone?

It is also noteworthy that the obligation under Section 66 to refer previously undetected and unreported CSEA content to the National Crime Agency is triggered by the provider becoming ‘aware’ of the content – again, not further defined. In the context of S.66, the Information Commissioner in its submission to the Ofcom Illegal Harms consultation observed: 

“Our reading of measure 4G is that it could allow for the content moderation technology to be configured in such a way that recognises that false positives will be reported to the NCA. Whilst we acknowledge that it may not be possible to completely eliminate false positives being reported, we are concerned that a margin for error could be routinely “factored into” a service’s systems and processes as a matter of course. This is unlikely to be compatible with a service taking all reasonable steps to ensure that the personal data it processes is not inaccurate.

We therefore consider that services should be explicitly required to take into account the importance of minimising false positives being reported to the NCA.”

Human awareness only? Consider a hypothetical Code of Practice measure that recommends automated detection and blocking of a particular kind of illegal user content. Can detection by an automated system constitute the service provider becoming aware of it, or (as an English court in McGrath v Dawkins, a case concerning the eCommerce Directive hosting shield, appears to have held) only if a human being is aware?

If the latter, then Ofcom's hypothetical recommendation will not interact with S.10(3)(b). If the former, then the possibility that the S.10(3)(b) removal obligation would be triggered during automated detection has to be factored in. The Ofcom consultation is silent on the point.

Awareness threshold Relatedly, what is the threshold for awareness of illegal content? S.10(3)(b) has similarities to the eCommerce Directive hosting liability shield. Eady J said of that provision: “In order to be able to characterise something as ‘unlawful’ a person would need to know something of the strength or weakness of available defences” (Bunt v Tilley). Has that standard been carried through to S.10(3)(b)? Or does the standard defined in S.192 OSA apply?

S.192 stipulates the approach to be taken where a system or process operated or used by a provider of a service for the purpose of compliance with duties under the Act involves a judgement by a provider about whether content is illegal content: 

"In making such judgements, the approach to be followed is whether a provider has reasonable grounds to infer that content is content of the kind in question (and a provider must treat content as content of the kind in question if reasonable grounds for that inference exist)."

In marked contrast to Eady J's interpretation of the ECommerce Directive hosting shield, S.192 goes on to say that the possibility of a defence is to be ignored unless the provider positively has reasonable grounds to infer that a defence may be successfully relied upon.

The OSA does not address the interaction between S.10(3)(b) and S.192 in terms, contenting itself with a cryptic cross-reference to S.192 in the definition of illegal content at S.59(16): 

"See also section 192 (providers' judgements about the status of content)".

The Ofcom consultation implicitly takes the position that awareness (at any rate by a human moderator — see Automated Illegal Content Judgements below) is synonymous with the S.192 standard: 

"When services make an illegal content judgement in relation to particular content and have reasonable grounds to infer that the content is illegal, the content must however be taken down" (Illegal Judgements Guidance Discussion, para 26.14)

Mixed automated-human illegal content judgements.

Returning to our hypothetical Code of Practice measure that recommends automated detection and blocking of a particular kind of illegal user content, such a system would appear to involve making a judgement about illegality for the purpose of S.192 regardless of whether a removal obligation under S.10(3)(b) is triggered.

If an automated detection system flags up posts for subsequent human review, the final word on illegality rests with human moderators. Does that mean that their judgement alone constitutes the illegality judgement for the purpose of S.192? Or is the initial automated triage also part of the illegality judgment? S.192 contemplates that ‘a judgement’ may be made by means of ‘automated systems or processes together with human moderators’. That may suggest that a combined judgement comprises the whole system or process.

If so, does that imply that the initial automated detection, being part of the illegal content judgement process, could not apply a higher threshold than the ‘reasonable grounds to infer’ test stipulated by S.192?

That question assumes (as does S.192 itself) that it is possible to embed within any given technology an inference threshold articulated in those terms; which brings us to our next topic.

Automated illegal content judgements One of the most perplexing aspects of the OSA has always been how an automated system, operating in real time on limited available information, can make accurate judgements about illegality or apply the methodology laid down in S.192: such as determining whether it has reasonable grounds to make inferences about the existence of facts or the state of mind of users.

Undaunted, s.192 contemplates that illegality judgments may be fully automated: 

“... whether a judgement is made by human moderators, by means of automated systems or processes or by means of automated systems or processes together with human moderators."

The OSA requires Ofcom to provide Guidance to service providers about making illegality judgements. It has produced a draft document, running to 390 pages, setting out how the S.192 criteria should be applied to every priority offence and a few non-priority offences.

Ofcom's draft Guidance appears to assume that illegality judgements will be made by human moderators (and implicitly to equate awareness under S.10(3)(b) with reasonable grounds to infer under s.192): 

"The process of making an illegal content judgement, as set out in the Illegal Content Judgement Guidance, presupposes that the content in question has been brought to the attention of a moderator making such a judgement, and as a result [the S.10(3)(b) awareness] requirement is fulfilled." (Illegal Judgements Guidance Discussion, para 26.14 fn 5)

Human involvement may be a reasonable assumption where decisions are reactive.  However, Ofcom has included in its draft Codes of Practice proactive prevention recommendations that are either automated or at least encompass the possibility of fully automated blocking or removal.

Annex 15 to the consultation discusses the design of various kinds of automated detection, but does not address the possibility that any of them involves making an illegal content judgement covered by S.192.

In apparent contrast with the human moderation assumed in the footnote quoted above, the Illegal Content Judgements Guidance also describes itself as 'technology-agnostic'.

“26.38 Our draft guidance therefore proposes a ‘technology-agnostic approach’ to reasonably available information and to illegal content judgements in general. We have set out which information we believe is reasonably available to a service, regardless of technology used to collect it, on an offence-by-offence basis. It is our understanding that, while automated tools could be used to collect more of this information or to do so more quickly, there is no additional class of information which automated tools could have access to that human moderators could not. We therefore take the view that information may be collected using any approach the service prefers, so long as when it is factored into an illegal content judgement, this is done in a way which allows a reasonable inference to be made.”

and: 

"A1.42 We have recommended three automated content technologies in our Codes of Practice; hashing technology recognising child sexual abuse material; URL detection technology recognising URLs which have previously been identified as hosting child sexual abuse material (CSAM); and search to detect content containing keywords strongly associated with the sale of stolen credentials (i.e. articles for use in fraud). These technologies do not offer an additional class of information that human moderators could not. We therefore take a 'technology-agnostic approach' to illegal content judgements."

The usual concern about reasonably available information, however, is not that automated content moderation technologies will have additional information available to them compared with human moderators, but that they will tend to have less. Moreover, they will be required to make decisions based on that information on the fly, in real time. Consequently such decisions are liable to be less accurate than those of human moderators, even if automated technology could be regarded as otherwise equivalent to a human being in its ability to make judgements.

The thinking may be that since the elements of a given offence, and the evidence required to establish reasonable grounds to infer, are in principle the same regardless of whether illegality judgements are made by automated systems or human beings, there is no need to differentiate between the two in the Guidance.

However, it seems artificial to suggest (if that is what is being said) that automated illegality judgements do not give rise at least to practical, and quite likely deeper, issues that differ from those raised by human judgements. The “technology-agnostic” label is not, in truth, a good description. The draft guidance may be agnostic, but if so the agnosticism is as to whether the judgment is made by a human being or by technology. That is a quite different matter. 

Ofcom’s automated moderation recommendations

This brings us to Ofcom's specific automated moderation recommendations. Do any of them involve making illegal content judgements to which S.192 would apply? For simplicity this discussion focuses on U2U service recommendations, omitting search engines.

To recap, Ofcom recommends three kinds of U2U automated detection and blocking or removal of illegal content (although for different categories of service in each case): 

• Perceptual hash matching against a database of known CSAM material (draft U2U Code of Practice, A4.23)

• URL matching against a list of known CSAM URLs (draft U2U Code of Practice, A4.37)

• Fuzzy keyword matching to detect articles for use in fraud (draft U2U Code of Practice, A4.45)

Each of these recommendations envisages that at least some moderation decisions will be taken without human involvement.

For CSAM perceptual hash matching the draft Code of Practice provides that the provider should ensure that human moderators are used to review "an appropriate proportion" of content detected as CSAM. The remainder, implicitly, would be swiftly taken down or blocked automatically in accordance with draft Code of Practice para A4.24, without human review. The draft CoP sets out how a service provider should go about deciding what proportion of detected content it is appropriate to review.

For CSAM URL matching the draft Code of Practice contains no provision for human review.

For fraud detection using fuzzy keyword matching the draft U2U Code of Practice requires the provider to consider the detected content in accordance with its internal content policies. The consultation explains that: 

"… all large services and those that have assessed themselves as having a medium or high risk for any type of offence should set internal content policies which specify how content moderation systems and processes moderate content and resource them accordingly." [14.230] fn 254.

Such policies could include automatic takedown of detected items. Whilst Ofcom say that “we are not recommending that services take down all content detected by the technology' ([14.249]), such action is within the range of the recommended measure. 

"Implementations that substantially impact on freedom of expression, including the automatic take down of detected content, could be in accordance with the measure in our Code of Practice.” [14.283]

The reliance on internal moderation policies appears to be intended to provide services with discretion as to what steps to take with automatically detected content: 

"… whether or not such content were, incorrectly, subject to takedown would depend on the approach, to content moderation adopted by the service, rather than the content's detection by the keyword detection technology in and of itself." [14.284]

Whilst the draft Code of Practice provides for human review of a reasonable sample of detected content, that appears to be a periodic, after the event, review rather than part of the decision-making process.

Do any of these three recommended systems and processes involve a S.192 judgement "by the provider" as to whether the detected user content is illegal?

Even for URL matching, where the detection and removal or blocking process is entirely mechanistic, the answer is at least arguably yes. It would be quite odd if the fact that a provider is relying on a pre-verified third party list of URLs meant that the provider was not making an illegality judgement, given that the very purpose of the overall system or process is to distinguish between legal and illegal content.

The same argument applies to perceptual hashing, but more strongly since there is an element of judgement involved in the technical detection process as well as in compiling the list or database.

The fuzzy keyword fraud detection recommendation is more obviously about making judgements. The draft Code of Practice recommends that fuzzy keyword technology should be used to assess whether content is 'likely' to amount to an offence (although elsewhere in the Consultation Ofcom uses the phrase 'reason to suspect'). If so, an item of content would then be considered in accordance with the provider's internal policies.

Where in the process an illegality judgement is being made could vary depending on the provider's policy. If detected content is submitted for human review, then it may be plausible to say that the illegality judgement is being made by the human moderator, who should make the decision in accordance with the 'reasonable grounds to infer' approach set out in S.192 and any relevant data protection considerations.

Alternatively, and as already discussed perhaps more in keeping with the language of S.192, the sequential automated and human elements of the process could be seen as all forming part of one illegality judgement. If so, then we could ask how Ofcom’s suggested ‘likely’ standard for the initial automated detection element compares with S.192’s ‘reasonable grounds to infer’. If it sets a higher threshold, is the system or process compliant with S.192?

If detected content is not submitted for human review, the answer to where the illegality judgement is being made could depend on what processes ensue.  If takedown of detected content is automatic, that would suggest that the initial triage constituted the illegality judgement. If other technical processes are applied before final decision, then it may be the final process, or perhaps the overall combination, that constitutes the illegality judgement. Either way it is difficult to see why an illegality judgement is not being made and why the S.192 provisions would not apply.  

It must be at least arguable that where automatic removal of automatically detected user content is within the range of actions contemplated by a Code of Practice recommendation, an illegality judgement governed by S.192 is being made either at some point in the process, or that the process as a whole constitutes such a judgement.

Nevertheless, neither the draft Illegal Judgements Guidance nor the draft Codes of Practice address the potential interaction of S.192 (and perhaps S.10(3), depending on its interpretation) with automated illegality judgements.

Wednesday 24 July 2024

The Online Safety Act: proactive illegality duties, safeguards and proportionality

Part 4 of a short series of reflections on Ofcom’s Illegal Harms consultation under the Online Safety Act 2023 (OSA). 

A significant proportion of the consultation’s discussion of Ofcom's proposed Code of Practice recommendations — especially those involving proactive monitoring and detection of illegal content — is taken up with enumerating and evaluating safeguards to accompany each recommended measure.

That is to be expected, for two reasons. First, the OSA itself provides in Schedule 4 that measures recommended in a Code of Practice must be designed in the light of the importance of protecting the privacy of users and the right of users to freedom of expression within the law, and (where appropriate) incorporate safeguards for the protection of those matters.

Second, the potential interference with users' fundamental rights (notably freedom of expression and privacy) brings into play the European Convention on Human Rights (ECHR) and the Human Rights Act (which, following the UK's recent general election, we can assume will be with us for the foreseeable future).

The first step in the ECHR analysis is to consider whether the interference is “prescribed by law”. This is a threshold condition: if the interference fails that test, it is the end of the matter. When considering whether an interference contained in a statute is prescribed by law, it is not enough that the law has been passed by Parliament and is publicly accessible. It also has to have the “quality of law”: it must be sufficiently clear and precise that someone potentially affected by it can foresee in advance, with reasonable certainty, how the law will apply to them.

Requirements (strictly speaking, in the case of the OSA, Ofcom recommendations) for automated proactive detection, filtering and removal of user content present a particularly high risk of arbitrary interference with, and over-removal of, legal content. They can also be seen as a species of prior restraint. The European Court of Human Rights observed in Yildirim that "the dangers inherent in prior restraints are such that they call for the most careful scrutiny on the part of the Court".

Compatibility with the ECHR operates at two levels: the legislative measure and individual decisions taken under it. A court will regard itself as well placed, since it has the facts to hand, to determine whether an individual decision is or is not a justified interference with a Convention right. It is far less willing to declare a legislative measure per se incompatible, unless it is clear that when applied in practice it will result in a breach of Convention rights in most or all cases. If the measure is capable of being operated in a way that does not breach the Convention, then it will not be per se incompatible.

However, there is an important rider: the UK courts have said that in order to protect against arbitrary interference there must be safeguards which have the effect of enabling the proportionality of the interference to be adequately examined.

In the case of legislation such as the OSA, where the Act frames the duties at a very high level and a regulator is authorised to flesh them out, the necessary safeguards have to be provided in Ofcom's Codes of Practice and its statutory guidance. If such safeguards are not provided, or if they are not sufficient, then the regime will fall at the first Convention hurdle of not being prescribed by law. The ECHR compatibility of the regime on this score is thus heavily dependent on Ofcom's work product.

Much judicial ink has been expended on explaining the precise underlying rationale for the “capable of being adequately examined" test. It is safest to regard it as an aspect of the prescribed by law (a.k.a. “legality”) principle: the reason why legislation must be reasonably clear and precise is in order to prevent arbitrariness and the abuse of imprecise rules or unfettered discretionary powers. If the impact of the scheme is foreseeable, then its proportionality is capable of being assessed. If its impact across the board is not discernible, then its impact will be arbitrary.

Lady Hale said in the Supreme Court case of Gallagher:

“The foundation of the principle of legality is the rule of law itself - that people are to be governed by laws not men. They must not be subjected to the arbitrary - that is, the unprincipled, whimsical or inconsistent - decisions of those in power. 

This means, first, that the law must be adequately accessible and ascertainable, so that people can know what it is; and second, that it must be sufficiently precise to enable a person - with legal advice if necessary - to regulate his conduct accordingly. The law will not be sufficiently predictable if it is too broad, too imprecise or confers an unfettered discretion on those in power. 

This is a separate question from whether the law in question constitutes a disproportionate interference with a Convention right -the law in question must contain safeguards which enable the proportionality of the interference to be adequately examined. 

This does not mean that the law in question has to contain a mechanism for the review of decisions in every individual case: it means only that it has to be possible to examine both the law itself and the decisions made under it, to see whether they pass the test of being necessary in a democratic society.”

In the final analysis it may be said that safeguards have to provide sufficient protection against arbitrariness.

The courts have stressed that challenging an entire regime ex ante on proportionality grounds presents a high hurdle and will rarely succeed, compared with a challenge by an individual who claims that their rights have been violated in a particular instance. Nevertheless, the safeguards proposed by Ofcom have to pass the prescribed by law test. If they do pass, then the actual proportionality of a given interference can be considered should a case arise.

The impact of the legality requirement and the nature of the required safeguards have to be considered in the light of the triangular structure of the Online Safety Act regime. We are not here dealing with a discretionary power vested in a state official to direct a user to take down their post. The OSA regime places legal obligations on intermediary service providers. The steps that they take to comply with those obligations have the potential to affect users' rights, particularly freedom of expression. 

Foreseeability requires that a user should be able to predict, with reasonable certainty, whether their contemplated online post is liable to be affected by actions taken by a service provider in discharging its obligations under the Act.

The safeguards stipulated by Ofcom should therefore provide the requisite degree of predictability for users in respect of blocking and removal actions to be taken by service providers when carrying out Ofcom's recommended measures.

As regards the consultation’s general approach to ECHR compliance, two points stand out. The first is that there is virtually no discussion of the “prescribed by law” requirement. Its existence is recited in many places, but the substantive discussion of ECHR compatibility proceeds directly to discussion of legitimate aim, necessity and proportionality of the recommended measures. Para 1.14 of the consultation may provide a clue as to why that is:

“In passing the Act, Parliament has set out in legislation the interferences prescribed by law and which it has judged to be necessary in our democratic society.”

Similarly in para 12.64:

“…our starting point is that Parliament has determined that services should take proportionate steps to protect UK users from illegal content. Of course there is some risk of error in them doing this, but that risk is inherent in the scheme of the Act.”

There is possibly a hint here of regarding the fact that Parliament has passed legislation as being sufficient of itself to satisfy the “prescribed by law” requirement. That may be the starting point, but it is not the end point.

The second point is that insofar as Ofcom has focused on the need for clarity and certainty, it has done so from the perspective of providing clarity to service providers. The Act requires this. Schedule 4 provides that the measures described in a Code of Practice must be:

“sufficiently clear, and at a sufficiently detailed level, that providers understand what those measures entail in practice;”

That, however, does not detract from the ECHR requirement that the potential for interference with users’ privacy and freedom of expression must also be reasonably clear and precise.

The two requirements do not necessarily go hand in hand. A provision may be clear as to the amount of discretion that it gives to a service provider, yet unforeseeable in its effect on the freedom of expression of users.

Several aspects of Ofcom's proposed safeguards in relation to automated detection and related takedowns give pause for thought on the question of capability to assess the proportionality of the interference. The recommendations (which would apply only to some services) are: 

  • Perceptual hash matching against a database of known CSAM material (draft U2U Code of Practice, A4.23)
  • URL matching against a list of known CSAM URLs (draft U2U Code of Practice, A4.37)
  • Fuzzy keyword matching to detect articles for use in fraud (draft U2U Code of Practice, A4.45)

The concerns are most apparent with the fraud keyword proposal, albeit they are not entirely absent with CSAM hash and URL matching. 

URL matching presents the fewest challenges. Ofcom's proposed safeguards relate entirely to the process for establishing and securing the list of URLs. They provide that the service provider should source the list from: 

“a person with expertise in the identification of CSAM, and who has arrangements in place to [inter alia] secure (so far as possible) that URLs at which CSAM is present, and domains which are entirely or predominantly dedicated to CSAM, are correctly identified before they are added to the list; to review CSAM URLs on the list, and remove any which are no longer CSAM URLs” [draft Code of Practice, A4.40]

By way of further safeguards, both the person with expertise and the service provider should secure the list from unauthorised access, interference or exploitation (whether by persons who work for the provider or are providing a service to the provider, or any other person).

The reasonable assumption is that the technology is capable of accurately matching detected URLs with the list, such that no further safeguards are required on that score.

If there were any concern about adequacy of these safeguards, it would probably be whether "a person with expertise in the identification of CSAM" is sufficiently precisely articulated.

For CSAM hash matching the draft Code of Practice contains equivalent safeguards to URL matching for establishment and security of the hash database. However, further safeguards are required since the recommendation of perceptual hashing introduces an element of judgement into the matching process, with the concomitant risk of false positives and consequent blocking or removal of legal user content.

Here the adequacy of the proposed safeguards may be open to more serious debate. The draft Code of Practice states that the perceptual hashing technology should be configured so that its performance strikes "an appropriate balance between precision and recall".

Precision and recall refers to the incidence of false positives and missed hits. There is typically a trade-off: fewer missed hits means more false positives.  As to what is an appropriate balance between them, the draft Code of Practice stipulates that the provider should ensure that the following matters are taken into account: 

- The risk of harm relating to image-based CSAM, as identified in the risk assessment of the service, and including in particular information reasonably available to the provider about the prevalence of CSAM content on its service.

- The proportion of detected content that is a false positive; and

- The effectiveness of the systems and/or processes used to identify false positives.

Annex 15 to the Consultation suggests various further factors that could point towards striking the balance towards either precision or recall.

The draft Code of Practice stipulates that human moderators should review “an appropriate proportion” of material detected as CSAM, and sets out principles that the service provider should take into account in deciding what proportion of detected content it is appropriate to review - for instance that the resource dedicated to review of detected content should be proportionate to the degree of accuracy achieved by the perceptual hash matching technology. It also provides various periodic review and record-keeping recommendations.

Annex 15 sets out Ofcom’s reasons (related to differences between perceptual hash technologies) for not setting a threshold which should be used to determine whether an image is a match.

The substantive balancing and proportionality decisions are thus parked firmly on the desk of the service provider. However, neither the draft Code of Practice nor the Act itself contains any indication of what is to be regarded as a proportionate or disproportionate level of interference with legal user content.

The result is that two different service providers could readily apply the stipulated safeguards in equivalent factual situations, follow the prescribed process and reach significantly differing conclusions about what is an appropriate balance between precision and recall, or about what resource should be devoted to human review. Consequently it can be argued that the effect on user content cannot be predicted. That smacks of arbitrariness. 

The safeguards for fuzzy keyword detection of articles for use in fraud are more extensive, as would be expected for a technology that is inherently more likely to throw up false positives. The consultation document points out that the recommendation:

"...differs from our proposed measures regarding CSAM hashing and the detection of CSEA links which focus on the detection of positive matches with content (or URLs that provide access to content) that has already been determined to be illegal." [Annex 15, A15.121]

Unlike with CSAM URL and hash matching the draft Code of Practice envisages that the service provider may compile its own list of fraud keywords. It contains safeguards around establishment, testing, review and security of the list. It contains equivalent provisions to perceptual hash matching for configuration of the technology so as to strike “an appropriate balance between precision and recall”, stipulating equivalent matters to be taken into account. Ofcom envisages that the safeguards will mean that it will be ‘highly likely’ that a keyword hit will correspond to an offence:

“In light of the above, we would expect any content detected as a result of applying this [keyword technology] measure to be highly likely to amount to an offence concerning articles for use in frauds.” [Volume 4, para 14.249]

It goes on:

“We recognise however that the keyword detection measure we are considering will enable services to identify content about which no prior illegal content judgment or determination has been made and that it may result in false positives. It may identify legitimate content (such as news articles or academic articles) which discuss the supply of articles for use in fraud. It is for this reason that we are not recommending that services take down all content detected by the technology, and are instead recommending that it be considered by services in accordance with their internal content moderation policies.” [ibid]

As with perceptual hash matching the draft Code of Practice provides for after the event periodic human review of some detected content. Whereas for perceptual hash matching this has to be ‘an appropriate proportion’, for fraud detection it has to be ‘a reasonable sample’. Again, it sets out principles to be taken into account in deciding what is a reasonable sample. These bear some similarities to, but are not identical to, those for perceptual hash matching. For instance there is no stipulation that review resource should be proportionate to the degree of accuracy achieved by the technology.

Evaluating the adequacy of the fraud keyword safeguards is complicated by the latitude that the recommendations give service providers as to what kind of action to take following initial keyword detection, and possible statutory interpretation questions as to whether (and if so in what way) the illegality judgement provisions of S.179 and the swift takedown obligations of S.10(3)(b) apply. 

Ofcom's approach is summarised thus:

"... we do not consider it appropriate to recommend that services swiftly take down all content detected as a positive match by their keyword detection technology, instead we recommend (as discussed below) that the decision on whether or not the content should be taken down should be taken in accordance with their content moderation systems and processes." [Annex 15, A15.122]

This is consistent with Ofcom’s broader policy approach to content moderation:

“Given the diverse range of services in scope of the new regulations, a one-size-fits-all approach to content moderation would not be appropriate. Instead of making very specific and prescriptive proposals about content moderation, we are therefore consulting on a relatively high-level set of recommendations which would allow services considerable flexibility about how to set up their content moderation teams.” [Volume 4, p.18]

 Ofcom continues, in relation to its fraud keyword recommendations:

“Consistent with Chapter 12, we are not persuaded that it would be appropriate to specify in detail how services should configure their content moderation systems and processes to take account of content detected by the keyword detection technology (for example, that there be human moderation of all such content), or the outcomes that those systems and processes should achieve (for example, through detailed KPIs).” [Annex 15, A15.123]

It then says:

“We are proposing in that Chapter that all U2U service providers must have in place content moderation systems or processes designed to take down illegal content swiftly.” [Annex 15, A15.124]

The area in which the keyword recommendations depart most significantly from hash and URL matching is thus in the steps to be taken in respect of positive keyword matches: treating them in accordance with the service provider’s internal content moderation systems and processes.  Ofcom’s approach is not to be prescriptive but to give service providers broad latitude in what steps to take in respect of positive keyword matches.

There is, however, an underlying dilemma. There are significant costs and risks associated with being prescriptive: the interference with a platform’s own rights (e.g under ECHR Protocol 1, Article 1), the unlikelihood that a single size of straitjacket can fit all in-scope service providers, prejudicing existing services, the chilling or dampening effect on development of new services, and the greater likelihood that faced with a prescriptive requirement service providers will take an over-cautious approach to blocking and removals. 

Yet the less prescriptive the measures, the broader the range of permissible approaches, the less predictable the effect on users and the greater the likelihood of arbitrary interference with user rights. This dilemma is not of Ofcom’s making. It is hardwired into the Act, but it falls to Ofcom to resolve it. It is an unenviable task. It may be impossible.

Specifically in relation to the fraud keyword detection recommendation, Ofcom says:

"... Implementations that substantially impact on freedom of expression, including the automatic take down of detected content, could be in accordance with the measure in our Code of Practice.” [Chapter 14, para 14.283]

and:

"whether or not such content were, incorrectly, subject to takedown would depend on the approach, to content moderation adopted by the service, rather than the content's detection by the keyword detection technology in and of itself." [Chapter 14, paras 14.284, 14.302]

Ofcom acknowledges that:

“There could therefore be variation in the impact on users’ freedom of expression arising from services’ different implementations of the technology and different approaches to moderation and take down of any detected content.” [para 14.283]

Ofcom, does not, however, discuss the implications for the “capable of being adequately examined" requirement if those variations are insufficiently foreseeable.

The discussion in Annex 15 contemplates that a service provider might have “no systems and processes in place to identify false positives before content is taken down”. That, it is said, would be a factor leaning towards configuring the system to towards greater precision at the expense of recall.

Recommended safeguards for content moderation generally include setting of performance targets, as they relate to accuracy of decision-making; training and materials; and appeals. For performance targets, it is for the service provider to balance the desirability of taking illegal content down swiftly against the desirability of making accurate moderation decisions. As above, different service providers could apply that guidance yet reach significantly different conclusions.

In the context of proportionality Ofcom seeks to diminish the impact on users’ freedom of expression by exempting news publisher content from the fraud keyword matching recommendation (reflecting the Act's exclusion of such content from regulated U2U content). However, that prompts the question of how service providers are to distinguish between news publisher content and the rest, in the context of an automated system: something which raises its own safeguards issues.

Ofcom’s fraud keywords recommendation cross-refers to its Recommendation 4B for large or multi-risk services: that the provider should set and record (but need not necessarily publish) internal content policies setting out rules, standards and guidelines around what content is allowed and what is not, and how policies should be operationalised and enforced. The policies should be drafted such that illegal content (where identifiable as such) is not permitted.    

Recommendation 4A (which is stated not to apply to CSAM perceptual hash and URL matching, but does not exclude fraud keyword detection) also appears potentially relevant to the fraud keyword matching recommendation: the service provider should have systems or processes designed to swiftly take down illegal content of which it is aware (mirroring the statutory obligation in S.10(3)).

Recommendation 4A goes on that for that purpose when the provider has reason to suspect that content may be illegal content, the provider should either make an illegal content judgement in relation to the content and, if it determines that the content is illegal, swiftly take it down; or do the same where its terms of service prohibit the type of illegal content in question and the content is in breach. 

Ofcom comments in relation to Recommendation 4A that:

"The design of this option is not prescriptive as to whether services use wholly or mainly human or automated content moderation processes." [Volume 12, para 12.50]

Thus there appears to be the potential four-way interaction between internal content moderation policies, the statutory takedown obligation, the Recommendation 4A takedown recommendation, and the provider's public terms of service. 

How these might mesh with each other is not immediately clear to this reader. In part this could depend on questions of interpretation of the Act, such as whether awareness for purposes of the statutory takedown obligation requires human awareness or can be satisfied by an automated system, and if so whether awareness equates to reasonable grounds to infer under S.192.    

Overall, the scope for arbitrary interference on user rights of freedom of expression appears to be greater for fraud keyword detection than with CSAM hash and URL matching.

The question of safeguards for proactive, automated detection systems is due to raise its head again. Ofcom has said that it is planning an additional consultation later this year on how automated tools, including AI, can be used to proactively detect illegal content and content most harmful to children – including previously undetected child sexual abuse material.

30 July 2024. Correction to description of 'high hurdle'. 


Monday 22 July 2024

The Online Safety Act illegality duties - a regime about content?

Part 3 of a short series of reflections on Ofcom’s Illegal Harms consultation under the Online Safety Act 2023 (OSA). 

This post analyses the illegality duties created by the OSA. That is not as simple as one might hope. The intrepid reader has to hack their way through a thicket of separately defined, subtly differing, duties. Meanwhile they must try to cope with a proliferation of competing public narratives about what the Act is - or ought to be - doing, such as the suggestion that the regime is really about systems and processes, not content. 

The specific safety duties imposed by the OSA are the logical starting point for an analysis of what the Act requires of service providers. I have categorised the duties in a way which, it is hoped, will illuminate how Ofcom has approached the consultation.  

At the highest level the Act’s illegality duties are either substantive or risk assessment. The substantive duties may require the service provider to take measures affecting the design or operation of the service. The outcomes of the obligatory risk assessment feed into some of the substantive duties.

In my proposed categorisation the substantive service provider duties fall into three categories: content-based, non-content-based and harm-based. A content-based duty is framed solely by reference to illegal content, with no mention of harm. A non-content-based duty makes no mention of either content or harm, but may refer to illegal activity. A harm-based duty is framed by reference to harm arising from or caused by illegal content or activity.  

The tables below divide the various substantive duties into these categories.

The risk assessment duties are a more extensive and complex mixture of duties framed by reference to illegal content, illegal activities and harm.

The differences in framing are significant, particularly for the substantive duties, since they dictate the kinds of measures that could satisfy the different kinds of duty. For instance:

An operational substantive content-based duty is focused on identifying and addressing items of illegal content — such as by reactively removing and taking them down. Recommended measures have to reflect that. (But see discussion below of the difference between a design duty and an operational duty in the context of a proactive, preventive duty.)  

substantive duty to mitigate risk of harm is less focused. A recommended measure would not necessarily involve evaluating the illegality of content. It could consist of a measure that either does not affect content at all, or does so content-agnostically. A reporting button would be an example of such a measure, as would a cap on the permissible number of reposts for all content across the board.

But a recommended measure within this category might yet have content-related aspects. It could be suggested, for instance, that in order to mitigate the risk of harm arising from a particular kind of illegal content the provider should identify content of that kind and take recommended steps in relation to it.

Similarly a risk assessment duty could involve the service provider in evaluating individual items of content, even if the duty ranges more widely: 

“Your risk assessment should not be limited to an assessment of individual pieces of content, but rather consider how your service is used overall. In particular, the requirement to assess the risk of the service being used to commit or facilitate priority offences may mean considering a range of content and behaviour that may not amount to illegal content by itself.” [Service Risk Assessment Guidance, Boxout preceding para A5.42.]

It can be seen in the table below that the harm-based risk assessment duties are further divided into content-related and non-content related. The former all make some reference to illegal content in the context of a harm-based duty.

Within the framework of the Act, not all illegality is necessarily harmful (in the Act's defined sense) and not all kinds of harm are in scope. There is a conceptual separation between illegality per se, illegality that causes (or risks causing) harm as defined, and harm that may arise otherwise than from illegality. Those category distinctions have to be borne in mind when analysing the illegality duties imposed by the Act. It bears repeating that when the Act refers to harm in connection with service provider duties it has a specific, limited meaning: physical or psychological harm.

So equipped, we can categorise and tabulate the illegality duties under the Act. For simplicity the table shows only the duties applicable to all U2U service providers, starting with the substantive duties: 

Category

Description

OSA section reference

Substantive duties

 

 

Content-based

Proportionate measures relating to design or operation of service to prevent individuals encountering priority illegal content by means of the service.

10(2)(a)

 

Operate service using proportionate systems and processes designed to minimise the time for which priority illegal content is present.

10(3)(a)

 

Operate service using proportionate systems and processes designed to swiftly take down illegal content where the provider is alerted or otherwise becomes aware of its presence

10(3)(b)

Non-content-based

Proportionate measures relating to design or operation of service to effectively mitigate and manage the risk of the service being used for the commission or facilitation of a priority offence identified in the most recent illegal content risk assessment.

10(2)(b)

Harm-based

Proportionate measures relating to design or operation of service to effectively mitigate and manage the risks of harm to individuals identified in the most recent illegal content risk assessment (see 9(2)(g)).

10(2)(c)

 These are the illegality risk assessment duties:

Category

Description

OSA section reference

Risk assessment duties

 

 

Content-based

Assess level of risk of users encountering illegal content of each kind

9(5)(b)

 

Assess level of risk of functionalities facilitating presence or dissemination of illegal content

9(5)(e)

Non-content-based

Assess level of risk of service being used for commission or facilitation of a priority offence

9(5)(c)

 

Assess level of risk of functionalities facilitating use of service for commission or facilitation of a priority offence

9(5)(e)

 

Assess how the design and operation of the service may reduce or increase the risks identified.

9(5)(h)

Harm-based (content-related)

Assess level of risk of harm to individuals presented by illegal content of different kinds

9(5)(d)

 

Assess nature and severity of harm that might be suffered by individuals from identified risk of harm to individuals presented by illegal content of different kinds

9(5)(d) and (g)

 

Assess nature and severity of harm that might be suffered by individuals from identified risk of individual users encountering illegal content

9(5)(b) and (g)

 

Assess nature and severity of harm that might be suffered by individuals from identified risk of functionalities facilitating presence or dissemination of illegal content

9(5)(e) and (g)

Harm-based (non-content-related)

Assess level of risk of harm to individuals presented by use of service for commission or facilitation of a priority offence

9(5)(d)

 

Assess nature and severity of harm that might be suffered by individuals from identified level of risk of harm to individuals presented by use of service for commission or facilitation of a priority offence

9(5)(d) and (g)

 

Assess nature and severity of harm that might be suffered by individuals from identified risk of service being used for commission or facilitation of a priority offence

9(5)(c) and (g)

 

Assess different ways in service is used and impact of such use on level of risk of harm that might be suffered by individuals

9(5)(f)

 

Assess nature and severity of harm that might be suffered by individuals from identified risk of different ways in service is used and impact of such use on level of risk of such harm

9(5)(f) and (g)

The various harm-based illegality risk assessment duties feed in to the substantive harm mitigation and management duty of S.10(2)(c). That duty stands independently of the three content-based and one non-content-based illegality duties, as can be seen from the tables and is illustrated in this visualisation.


Content or systems and processes?

In March last year the Ofcom CEO drew a contrast between systems and processes and a content regime, suggesting that the OSA is about the former and not really the latter (see Introduction). 

The idea that the Act is about systems and processes, not content, prompts a close look at the differences in framing of the substantive illegality duties. As can be seen from the table above, the proactive prevention duty in S.10(2)(a) requires the service provider to:

“to take or use proportionate measures relating to the design or operation of the service to… prevent individuals from encountering priority illegal content by means of the service” (emphasis added)

Thus a measure recommended by Ofcom to comply with this duty could be limited to the design of the service, reflecting the ‘safe by design’ ethos mentioned in Section 1(3) of the Act. As such, the S.10(2)(a) duty, although content-based, has no necessary linkage to assessing the illegality of individual items of user content posted to the service. That possibility, however, is not excluded. A recommended proactive measure could involve proactively detecting and blocking individual items, as contemplated by the section’s reference to operational measures.

In contrast, the content removal duty in S.10(3)(b) is specifically framed in terms of use of operational systems and processes: 

“(3) A duty to operate a service using proportionate systems and processes designed to … (b) where the provider is alerted by a person to the presence of any illegal content, or becomes aware of it in any other way, swiftly take down such content” (emphasis added)

In view of this drafting it is not surprising that, notwithstanding the reference to proportionate systems and processes, Ofcom in several places characterises the Act as imposing a simple duty to remove illegal content: 

"When services make an illegal content judgement in relation to particular content and have reasonable grounds to infer that the content is illegal, the content must however be taken down." [para 26.14, Illegal Content Judgements Guidance discussion]

“Within the illegal content duties there are a number of specific duties. As part of the illegal content safety duty at section 10(3)(b) of the Act, there is a duty for a user-to-user service to “swiftly take down” any illegal content when it is alerted to the presence of it (the ‘takedown duty’).” [para A1.14, draft Illegal Content Judgements Guidance]

“As in the case of priority illegal content, services are required to take down relevant nonpriority illegal content swiftly when they are made aware of it.” [para 2.41, Volume 1]

“Where the service is alerted by a person to the presence of any illegal content, or becomes aware of it in any other way, services must swiftly take down such content (section 10(3)(b)).” [para 12.6, Volume 4]

The framing of the S.10(3)(b) duty is tightly focused on the removal actions to be taken by a service provider in relation to individual items of content when it becomes aware of them. The service provider must have proportionate systems and processes designed to achieve the result (take down), and must use them in the operation of the service. 

This duty inescapably requires individual judgements to be made. It does not, however, from a supervision and enforcement point of view, require a service provider to get every individual judgement right.

From the perspective of the service provider the focus of Ofcom’s supervision and enforcement will indeed be on its systems and processes:

“It is important to make clear that, as the regulator, Ofcom will not take a view on individual pieces of online content. Rather, our regulatory approach is to ensure that services have the systems and processes in place to meet their duties.” [Overview, p.17; Volume 4, p.19]

“In line with the wider approach to delivering online safety, supervision will focus on the effectiveness of services’ systems and processes in protecting their users, not on individual pieces of content.” [para 30.5, Volume 6]

That does not mean however, that (as Ofcom appears to suggest in the following extract) the regulatory regime is not about regulating individual user content: 

“It is important to note that the Online Safety regime is about service providers’ safety systems and processes, not about regulating individual content found on such services. The presence of illegal content or content that is potentially harmful to children does not necessarily mean that a service provider is failing to fulfil its duties in the Act. We would not therefore be likely to take action solely based on a piece of harmful content appearing on a regulated service.” [Enforcement Guidance, para A3.6]

The purpose of at least the operational content-based aspects of the OSA regime is to harness the control that intermediary service providers can exercise over their users in order (indirectly) to regulate individual items of user content. The fact that a service provider may not be penalised for an individual misjudgement does not alter the fact that the service provider has to have a system in place to make judgements and that the user in question stands to be affected when their individual post is removed as a result of a misjudgement. The operational content-based duties exist and are about regulating individual content.

Parenthetically, it is also perhaps doubtful whether Ofcom can, in reality, perform its role entirely in the abstract and avoid making its own judgements on individual items of content. For instance, if prevalence of illegal material on a service is relevant to Ofcom’s enforcement priority assessment, how could that be determined without evaluating whether individual items of user content on the service are illegal?

Some may believe that Ofcom's approach is too heavily weighted towards individual content judgements by service providers. However, it is difficult to see how such content-based measures can be avoided when content-based duties — removal and takedown of individual items of illegal content in the operation of the service - are hardwired into the Act, requiring the exercise of judgement to determine whether the content in question is or is not illegal.

That said, in the area of the S.10(2) proactive preventive measures Ofcom has, as already alluded to, much broader scope to assess proportionality and decide what kinds of measures to recommend (or not) and what safeguards should accompany them. That will be the subject of Part 4 of this series.