Communities

Writing
Writing
Codidact Meta
Codidact Meta
The Great Outdoors
The Great Outdoors
Photography & Video
Photography & Video
Scientific Speculation
Scientific Speculation
Cooking
Cooking
Electrical Engineering
Electrical Engineering
Judaism
Judaism
Languages & Linguistics
Languages & Linguistics
Software Development
Software Development
Mathematics
Mathematics
Christianity
Christianity
Code Golf
Code Golf
Music
Music
Physics
Physics
Linux Systems
Linux Systems
Power Users
Power Users
Tabletop RPGs
Tabletop RPGs
Community Proposals
Community Proposals
tag:snake search within a tag
answers:0 unanswered questions
user:xxxx search by author id
score:0.5 posts with 0.5+ score
"snake oil" exact phrase
votes:4 posts with 4+ votes
created:<1w created < 1 week ago
post_type:xxxx type of post
Search help
Notifications
Mark all as read See all your notifications »
Q&A

Comments on The effect of measurement accuracy and rounding on hypothesis testing

Parent

The effect of measurement accuracy and rounding on hypothesis testing

+1
−0

I am checking the temperature I have at home with accuracy of one tenth of a grade (Celsius). The easily publicly available information is temperature in grades, with no decimals.

I am doing very basic hypothesis testing: My null hypothesis is that temperatures I measure do not have a systematic bias upwards or downwards from the public data, while my alternative hypothesis is that the temperatures I measure are systematically higher or systematically lower. I am doing a simple T-test and checking if the average of the differences is far from zero.

Does it make a difference whether I round my own measurements to integers, or, (essentially equivalently,) round the differences to integers? In particular, is there a bias in a particular direction, towards throwing away the null hypothesis or the opposite, if my data has more accuracy then what I am comparing it to?

Notes

This project breaks assumptions of hypothesis testing; at least independence of measurements and possibly the normal distribution of the differences.

The point of the question is not these, but rather the possible effect of rounding or different precision in the ground truth data and the measurements on the hypothesis test.

History
Why does this post require attention from curators or moderators?
You might want to add some details to your flag.
Why should this post be closed?

0 comment threads

Post
+0
−0

Your measurements are more precise, which is actually a good thing. Rounding them to match the less-precise public data might feel like you’re making things consistent, but what you’re really doing is tossing out useful info. And when you’re running a hypothesis test, that extra decimal can matter—especially if the differences you’re trying to detect are subtle.

Here’s the deal: If your data is pretty tight (like, low variability), then rounding could mess things up more. But if there’s a lot of natural fluctuation (like temps bouncing all over the place), then rounding might not change much. Still, why risk it?

Best move? Keep your 0.1°C data as-is for the actual stats and analysis. You can always round later when you’re showing results to someone who doesn’t need the fine-grained details.

If you’re really worried about fairness, you can even treat the public data like it represents a range—like if it says 22°C, maybe it actually means somewhere between 21.5 and 22.5°C. That way, you can still work with your full-precision data and just adjust how you interpret the comparison.

History
Why does this post require attention from curators or moderators?
You might want to add some details to your flag.

1 comment thread

Does the rounding have a systematic effect on the hypothesis testing, as asked? (1 comment)
Does the rounding have a systematic effect on the hypothesis testing, as asked?
tommi‭ wrote 7 days ago

Thanks for the response. Does the rounding have a systematic effect on the hypothesis testing, as asked?