Near Duplicates in Survey Data Series
general
rstats
survey research
duplication
fraud
Do you know how to detect exact or near duplicate rows in your data? Read on to learn more!
You have reached the landing page for our “Near Duplicates in Survey Data” series. Click below to read the posts in this series:
Trust Issues
Trust Issues: Examining Near Duplicates in Survey Data
Do you know how to detect exact or near duplicate rows in your data? Read on to learn more!
Stumbling in the Dark
Stumbling in the Dark: Building/Iterating an R Function to Match Stata’s percentmatch
If you are looking for more information about the modified R function we used to detect near duplicates, then you have come to the right place. (Code shared; detailed write-up forthcoming)
Trust Issues, Part 2
Trust Issues, Part 2: Investigating Near Duplicates in Different Data
This follow-up to our first entry on near duplication in survey data analyzes near duplicates in three more international survey data sets. (Forthcoming)
Reuse
Citation
BibTeX citation:
@online{day2023,
author = {Day, Jake and Brauer, Jon and Kotlaja, Maja},
title = {Near {Duplicates} in {Survey} {Data} {Series}},
date = {2023-10-10},
url = {https://www.reluctantcriminologists.com/blog-posts/[8]/dup-index.html},
langid = {en}
}
For attribution, please cite this work as:
Day, Jake, Jon Brauer, and Maja Kotlaja. 2023. “Near Duplicates in
Survey Data Series.” October 10, 2023. https://www.reluctantcriminologists.com/blog-posts/[8]/dup-index.html.