Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[open-regulatory-compliance] Non-CRA: Open Data in the Digital Omnibus proposal
  • From: Felix Reda <felixreda@xxxxxxxxxx>
  • Date: Fri, 28 Nov 2025 16:14:19 +0000
  • Accept-language: en-US, de-DE
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=github.com; dmarc=pass action=none header.from=github.com; dkim=pass header.d=github.com; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=gaVIyOhAp3qRWupWOgK9h9Bp+GdO6rRE6Kj7K3iLJ+Y=; b=O+i/qeOLPDGm/6i0ExRLHEVMUiQYaldSo8XQpuELSrooxBtLO6XraEKebaLIaqLy5JTjhqUPbUWi9S1Rfoo/6lAFRoA7KWXgRq5cRJh1PQqsqBkK2Fx1T++UHNS6kf7ZjpaWcb0gHj1K8TgB2Io3XgszKf/dRUM+k/w+g1oasfa2uYJeJ91qcZ8Gwh3x2V/5+f9KBsFAet8Uq6pY0odItikPCGdhyVrmKBqXBDfjK6Jvij75gbC0SDFsJjN4AfeeLoeyjWTm3AeG5rLB0hAqxNZ2Tz9uR6YYP6vfN/Ta/nRsXSKGowZ2hWwwBEXUAW1qhrnnOZ9YudIAjzBMbTu1dg==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=YHno6pLNyA3kZojizogG1agV+1XETBZmmeGPUvBQ6NWhjD4TYiFg0COPnYf114VLN9jv0OvwLjwpxyWga0eztOs4pIw2w74XzUYlNQBaXcpucw0gLFRtTovrfZ7r5Qvwc+o+LKGxaidjNZVLKJ4h5JepMGUgYZ03sQVKPaZDbgQP5JY8XaT7vy1tjjMok8AgM39X15qjOEryS49efCtgXYhajLPjY3Cbm5Y6oUqTzFfim3wZmDYZo9IG21c8SmYRxaWt90w0sbVNt204oxVgyDm6hhi6iwOcemgKRB+surjDa/k7xEtYNkaJafrArIaLtYtZ2s4S85GWp/smyuKu+g==
  • Delivered-to: open-regulatory-compliance@xxxxxxxxxxx
  • List-archive: <https://www.eclipse.org/mailman/private/open-regulatory-compliance/>
  • List-help: <mailto:open-regulatory-compliance-request@eclipse.org?subject=help>
  • List-subscribe: <https://www.eclipse.org/mailman/listinfo/open-regulatory-compliance>, <mailto:open-regulatory-compliance-request@eclipse.org?subject=subscribe>
  • List-unsubscribe: <https://www.eclipse.org/mailman/options/open-regulatory-compliance>, <mailto:open-regulatory-compliance-request@eclipse.org?subject=unsubscribe>
  • Msip_labels: MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Enabled=True; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_SiteId=72f988bf-86f1-41af-91ab-2d7cd011db47; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_SetDate=2025-11-28T16:10:47.4228538Z; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Name=Internal; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_ContentBits=0; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Method=Standard
  • Thread-index: AQHcYIGO9CV8iAjsW0OJ4te2xCiIxg==
  • Thread-topic: Non-CRA: Open Data in the Digital Omnibus proposal

Hi everyone,

In the understanding that the scope of ORC WG is in principle broader than the CRA, and that it may deal with other policy issues that affect the open source community, I am sharing the following message, which relates to the open licensing of government data. If you feel this is out of scope for ORC WG, please let me know.

I have identified a potential problem for open licensing in the recently introduced European Commission digital omnibus proposal that I would love to hear your views about (there are two digital omnibus proposals, one on AI and one on other digital legislation, I am referring to the non-AI one: proposal on simplification of the digital legislation). This is about the Open Data Directive, which is supposed to make government data available and re-usable to the public, a piece of legislation which I helped negotiate in a previous life, so the topic is dear to me.

The Digital Omnibus aims to reduce the number of data-related legislative acts by repealing the Open Data Directive and instead incorporating its contents in the Data Act, which is a regulation. That is not a bad idea, because regulations are directly applicable law across the EU and do not need to be transposed into national law by the Member States, a process which can often be a cause for confusion and national differences. So I like the basic idea.

However, the European Commission proposes to make two consequential changes to the Article 6 and 8 Open Data Directive in the process, which I am including below in track changes (they’re Articles 32q and 32r in the new proposal on simplification of the digital legislation, see pp. 41-42). I will use data and documents interchangeably in the following analysis for simplicity’s sake, the legislation applies to both.

My reading of these changes is the following. Up until now, for those public sector bodies not mentioned in Art. 32q (2), the re-use of public sector documents had to be free of charge and subject to non-discriminatory license conditions, ideally standard open data licenses such as CC-0, CC-by, or national open data licenses like Datenlizenz Deutschland. While public sector bodies could charge a marginal price for costs that arose in the context of providing the data (for example the work needed to anonymise a document or dataset that contained personal data), they couldn’t charge for the access to the data as such. This was great news for open data, because the use of nondiscriminatory standard licenses that meet the open definition would allow the general public to combine different data sources without risking the kinds of license conflicts that we know all too well from the open source world.

The proposed changes below, as I read them, apply the following changes to this regime:

If the entity requesting re-use of the documents is a very large enterprise, the public sector body can charge them a higher fee. That alone is defensible, given their greater economic power. However, I am concerned that the specific way that Art. 32q (6) is drafted does not just allow the charging of higher fees from very large enterprises compared to other data users, it also allows the public sector bodies to charge very large enterprises in situations where the same data was previously made available free of charge. I come to this conclusion because Art 32q (6) states that such charges may cover a range of different costs, together with a reasonable return on investment, *in addition* to any of the charges mentioned in paragraph 1. In other words: There can be charges that are not marginal costs related to making the data available in the first place. That means that the basic principle of paragraph 1, that the re-use of government documents must be free of charge, does not apply to very large enterprises at all.


Article 6 32q

Principles governing charging for open government data


1.   The re-use of documents within the scope of this Section shall be free of charge. However, the recovery by the public sector body holding the data of the marginal costs incurred for the reproduction, provision and dissemination of such data or documents as well as for anonymisation of personal data and measures taken to protect commercially confidential information may be allowed.

2.   By way of exception, paragraph 1 shall not apply to the following:

  1. public sector bodies that are required to generate revenue to cover a substantial part of their costs relating to the performance of their public tasks;
  2. libraries, including university libraries, museums and archives;
  3. public undertakings.

[…]

5.   Where charges are made by the public sector bodies referred to in point (b) of paragraph 2, point (b), the total income from supplying and allowing the re-use of data or documents over the appropriate accounting period shall not exceed the cost of collection, production, reproduction, dissemination, data storage, preservation and rights clearance and, where applicable, the anonymisation of personal data and measures taken to protect commercially confidential information, together with a reasonable return on investment. Charges shall be calculated in accordance with the accounting principles applicable to the public sector bodies involved.

6. Public sector bodies may set out higher charges for the re-use of data and documents by very large enterprises than the charges provided for in paragraphs 1, 4 and 5. Any such charges shall be proportionate and based on objective criteria, taking into account the economic power, or the ability of the entity to acquire data, including in particular a designation as a gatekeeper under Regulation (EU) 2022/1925. In addition to the elements listed in paragraph 1 of this Article, such charges may cover the cost of collection, production, reproduction dissemination and data storage and where applicable the cost of anonymisation or measures to protect the confidentiality of the data or documents, together with a reasonable return on investment.

6. 7.   The re-use of the following shall be free of charge for the user:
  1. subject to Article 14 32v paragraph (3), (4) and (5), the high-value datasets, as listed in accordance with paragraph 1 of that Article;
  2. research data referred to in point (c) of Article 1(1)32i.
You may think that’s fair - after all, very large enterprises tend to be very profitable, right? Where this becomes a problem becomes apparent in Art. 32r, which deals with open licenses. Previously, public sector bodies were categorically forbidden from using licenses for public sector data that included discriminatory conditions, which would violate conditions 2.1.6 (non-discrimination) or 2.1.8 (application for any purpose) of the open definition. The use of standard licenses, such as CC-0, was explicitly encouraged.

The objective of allowing public sector bodies to always be able to charge very large enterprises for public sector data conflicts with this open licensing approach. If the same data was provided free of charge under an open license to the general public, but be subject to a fee for very large enterprises, even if there were no costs incurred by the public sector body associated with making the data available, nothing would stop the very large enterprise from simply copying the open data from a third-party source, which would be able to reproduce the data legally. So in order to be able to charge very large enterprises for the data itself (not costs for the provision of the data, such as bandwidth, anonymisation etc.), the Commission has to abandon the encouragement of standard open licenses and explicitly allow for non-open license conditions, as becomes evident from the proposed changes to Article 32r:

Article 8 32r
Standard licences
(1) The re-use of data or documents shall not be subject to conditions, unless such conditions are objective, proportionate, non-discriminatory and justified on grounds of a public interest objective.
(2) When re-use is subject to conditions, those conditions shall not unnecessarily restrict possibilities for re-use and shall not be used to restrict competition.
(3) In Member States where licences are used, public sector bodies shall ensure that the standard licences for the re-use of public sector data or documents, which can be adapted to meet particular licence applications, are available in digital format and able to be processed electronically. Member States shall encourage the use of such standard licences.
(4) Public sector bodies may establish special conditions for the re-use of data and documents by very large enterprises. Such conditions shall be proportionate and should be based on objective criteria. They shall be established taking into consideration the economic power, or the ability of the entity to acquire data, including in particular a designation as a gatekeeper under Regulation (EU) 2022/1925.
What does this mean in practice? All but the most open-data-friendly public sector bodies will abandon the use of open licenses such as CC-0 in favor of either offering no standard licenses at all (every entity/person that requests re-use of the public sector data must negotiate its own license), or in favor of new, custom-made non-open standard licenses that distinguish between re-use by very large enterprises and re-use by everybody else.

The Commission may very well believe that this will have no adverse impact on anyone but very large enterprises, but this couldn’t be further from the truth. As we very well know from the open source context, license incompatibilities are a huge problem. Some open projects use share-alike provisions, such as Wikipedia. It would be impossible to combine such non-openly licensed public sector documents with openly licensed projects under a share-alike clause. By doing so, one would either violate the special conditions restricting the re-use by very large enterprises from the government license, or one would violate the share-alike requirement from the open project. If this new option to establish special conditions for the re-use of public sector data by very large enterprises was used widely by public sector bodies (and I think there would be a lot of economic pressure on them to use this option in the hopes of generating new revenue streams), that could very well be the end of open government data in the EU.

Am I missing something here? Please let me know what you think!

Best,
Felix



Back to the top