The Paper is a Lie: Practical Data Forensics for Scientific Data (eh23)

The Paper is a Lie: Practical Data Forensics for Scientific Data (eh23)

Author: CCC media team April 5, 2026 Duration: 1:00:08
Recent high-profile cases of scientific fraud have demonstrated that scientists are only human too and sometimes individual scientists falsify their data. Since science is the basis of decision- and policy-making, we all should be able to scrutinise scientific studies. Here, we provide practical guidance and concrete tools for forensically examining scientific data and identifying potential cases of data manipulation. These approaches will enable and empower audience members to independently perform forensic plausibility checks on scientific data, as well as provide a starting point for their further, independent study. Limitations and ethical considerations when performing such analyses will also be discussed. Science is the basis of decision- and policy-making, and it is generally a good idea to "trust the science." Recent high-profile cases of scientific fraud, however, have demonstrated that scientists are only human too, and sometimes, for various reasons, individual scientists falsify their research. Accordingly, we all should be able to scrutinise and independently assess scientific studies, spotting potential cases of manipulation. Perhaps surprisingly, this is often significantly easier to do for the layperson than one may suspect. In this talk, we'll focus on a core aspect of checking scientific studies: the raw data, and their forensic examination. To get started, we begin with a brief introduction to the structure of a typical scientific study and the process of modern scientific publishing. Afterwards, we will take a look at a number of recent high-profile cases of scientific fraud together in a hands-on manner. We will thereby establish practical guidance and concrete statistical tools for identifying potential cases of data manipulation which may warrant further examination. These approaches will enable and empower audience members to independently perform forensic plausibility checks on scientific data, as well as provide a starting point for their further, independent study of additionally provided resources. Since no tool or technique is perfect, we will also talk about the limitations of the presented approaches, as well as ethical considerations when performing such analyses. This talk is directed at everyone with an interest in scientific research and everyone who enjoys critically assessing datasets for plausibility. Prior knowledge in statistics, data science and data visualisation are certainly advantageous, but not required, as all necessary theoretical foundations will be introduced during the talk. This work is licensed under CC BY-NC 4.0. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc/4.0/ about this event: https://pretalx.eh23.easterhegg.eu/eh23/talk/XLDNUG/

Tune into the Chaos Computer Club-recent events feed for a direct line to the forefront of digital culture and critical technology discourse. Curated by the CCC media team, this podcast channels the raw, insightful atmosphere of Europe's most influential hacker association, bringing you recordings from their major gatherings and community events. Each episode is a deep dive into talks and presentations from the last two years, covering topics from cryptography and privacy rights to hardware hacking, societal impacts of surveillance, and open-source philosophy. You'll hear from researchers, activists, and engineers who are actively shaping our digital future, offering perspectives rarely found in mainstream tech conversations. This isn't a produced show with hosts; it's an archival audio stream of genuine conference sessions, complete with audience questions and the spontaneous energy of the live event. For anyone interested in the technical details and ethical debates at the heart of modern technology, this feed serves as an essential, unfiltered resource. Subscribe to this podcast to keep your finger on the pulse of the Chaos Computer Club's ongoing dialogue, where complex ideas are broken down and the tools for a more empowered digital life are openly discussed.
Author: Episodes: 100

Chaos Computer Club - recent events feed
Podcast Episodes
Live-Beaterkennung in Musik (eh23) [not-audio_url] [/not-audio_url]

Duration: 1:06:55
Wie funktioniert eigentlich musikalische Beaterkennung? Ich habe einen Algorithmus entwickelt, um aus Live-Audio-Input ein "Metronom-artiges" Taktsignal (MIDI-Clock) zu erzeugen, das auch bei Tempowechseln noch mitkommt.…
Let's Make All Data Interoperable (eh23) [not-audio_url] [/not-audio_url]

Duration: 31:39
Cory Doctorow argues that fighting the Enshittification of the world involves making data and software more interoperable, in order to reduce switching costs from centralized Big-Tech platforms. But even making data form…
RoboCup: Wettkampf, Forschung und die Zukunft der Robotik (eh23) [not-audio_url] [/not-audio_url]

Duration: 34:37
Seit über 25 Jahren ist der RoboCup eine der wichtigsten Plattformen für Robotikforschung und Nachwuchsförderung. Er kombiniert den spielerischen Wettkampf mit echter Wissenschaft und bringt Schüler:innen, Studierende un…
How I reverse engineered my tablets' camera driver (deutsch) (eh23) [not-audio_url] [/not-audio_url]

Duration: 21:37
In diesem Vortrag beleuchte ich den Prozess des Reverse Engineerings des Kamera-Treibers meines Tablets. Dies beinhaltet einen Deep Dive in die Welt von ACPI, Register-Dumps und die Kernel-Entwicklung. -- In this talk I…
Nützliche Mikrofontechnik für bessere Präsentationen (eh23) [not-audio_url] [/not-audio_url]

Duration: 32:04
Präsentationen vor größerem Publikum oder Präsentationen, die aufgenommen werden, brauchen quasi immer ein Mikrofon irgendeiner Art. In dieser Präsentation zeigen wir euch übliche Probleme, die Vortragende mit Mikrofonen…
Testing-Elevator (eh23) [not-audio_url] [/not-audio_url]

Duration: 55:59
Abseits des bekannten Unittests gibt es noch mehr - eine Technikdemonstration auf verschiedenen Testebenen. Testen ist wichtig und verhindert schwerwiegende Fehler. Der Fokus kann hierbei verschieden granular bzw. abstra…
eh23 Opening (eh23) [not-audio_url] [/not-audio_url]

Duration: 41:05
Willkommen im Kaninchenbau. Oder dem, was ihr dafür haltet. Wir eröffnen das Event mit einer dringenden Systemwarnung: Das Maskottchen ist ein Bug, keine Eigenschaft. In diesem Opening klären wir die Regeln der Simulatio…
Reproducible Linux Systems with Nix & NixOS (clt26) [not-audio_url] [/not-audio_url]

Duration: 43:04
Did you ever try to do something on Linux, ask for advise and heard: “But it works on my machine”? Usually what follows is a long and tedious debugging session, just to find that a dependency has been forgotten somewhere…
Let's Talk About Sex – Programme züchten mit Evolution (clt26) [not-audio_url] [/not-audio_url]

Duration: 56:58
Die Evolution ist ein erstaunlich effizientes Optimierungsverfahren. Immerhin hat es alle Lebewesen und auch uns aus dem Nichts erschaffen. Mit simulierter Evolution kann man das Prinzip erfolgreich auf viele andere Aufg…
Galera-Cluster mit Docker: Architektur und Betrieb (clt26) [not-audio_url] [/not-audio_url]

Duration: 38:11
In diesem Vortrag zeige ich den vollständigen Aufbau eines MariaDB-Galera-Clusters in Docker-Containern auf Basis von Docker-Compose. Schritt für Schritt wird erklärt, wie die Compose-Files strukturiert und konfiguriert…