Publication: Reducing False Positives in Watermark Detection through Low-Entropy Token Filtering
Reducing False Positives in Watermark Detection through Low-Entropy Token Filtering
Date
Date
Date
Citations
Jingmiao Li. (2025). Reducing False Positives in Watermark Detection through Low-Entropy Token Filtering. (Master’s thesis, University of Zurich) https://doi.org/10.5167/uzh-281059
Abstract
Abstract
Abstract
This thesis addresses the problem of false positives in watermark detection for large language models (LLMs), with a particular focus on open-domain news-style text. As LLMs increasingly generate content that mimics human writing, the risk of mistakenly classifying human-written text as watermarked poses serious ethical and practical concerns. To mitigate this issue, this study investigates two strategies that extend the KGW framework: named entity-based weighting and entropy-based filtering. The proposed methods aim to reduce the inf
Additional indexing
Creators (Authors)
Faculty
Faculty
Faculty
Item Type
Item Type
Item Type
Referees
In collections
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Language
Language
Language
Publication date
Publication date
Publication date
Date available
Date available
Date available
OA Status
OA Status
OA Status
Citations
Jingmiao Li. (2025). Reducing False Positives in Watermark Detection through Low-Entropy Token Filtering. (Master’s thesis, University of Zurich) https://doi.org/10.5167/uzh-281059