Publication:

Reducing False Positives in Watermark Detection through Low-Entropy Token Filtering

Date

Date

Date
2025
Master's Thesis

Citations

Citation copied

Jingmiao Li. (2025). Reducing False Positives in Watermark Detection through Low-Entropy Token Filtering. (Master’s thesis, University of Zurich) https://doi.org/10.5167/uzh-281059

Abstract

Abstract

Abstract

This thesis addresses the problem of false positives in watermark detection for large language models (LLMs), with a particular focus on open-domain news-style text. As LLMs increasingly generate content that mimics human writing, the risk of mistakenly classifying human-written text as watermarked poses serious ethical and practical concerns. To mitigate this issue, this study investigates two strategies that extend the KGW framework: named entity-based weighting and entropy-based filtering. The proposed methods aim to reduce the inf

Metrics

Citations

Additional indexing

Creators (Authors)

  • Jingmiao Li

Institution

Institution

Institution

Faculty

Faculty

Faculty
Faculty of Arts

Item Type

Item Type

Item Type
Master's Thesis

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Language

Language

Language
English

Publication date

Publication date

Publication date
2025-06

Date available

Date available

Date available
2025-12-02

OA Status

OA Status

OA Status
Green

Metrics

Citations

Citations

Citation copied

Jingmiao Li. (2025). Reducing False Positives in Watermark Detection through Low-Entropy Token Filtering. (Master’s thesis, University of Zurich) https://doi.org/10.5167/uzh-281059

Green Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image