Its strength is generating models of reality that have predictive power, and fine-tuning those models as new information is obtained.
Its weaknesses are a lack of absolute certainty and the inability to model that which has no detectable impact on reality.
Also never touching any why-questions
Removed by mod
Truth in any context will always rely on facts
Why?
We got some 101’s in here beanbag chairin it up.
Speak for yourself, I’m having this conversation from a papasan chair I found on the side of the road
Yeah I’m the one on the beanbag sorry for the confusion guys
Because without facts, what you have is not “truth.” It’s either speculation or bullshit.
But how do you define “facts?” And how do you define “truth?” And how do you define “is?”
We’ll see who cancels who?
Thanks, Jordan.
I think the point is this is paradoxical. Everything must be proven by facts and we cannot trust any general, abstract statement of its own accord, then how can we prove “everything must be proven by facts and we cannot trust any general, abstract statement of its own accord”? What if that’s a wrong assumption?
Maybe the truth is we don’t always need to rely on observable facts, but we don’t know that because we’re making the aforementioned assumption without having any proof that it’s correct.
axioms have entered the chat
The deeper you go in the why territory, the more abstract and tangental your axioms get.
So yeah. All facts and truths ultimately rest on foundations that are either kinda unobservable or unproven. Doesn’t make them less practical or true (by practical definitions) though.
To get a fact out of an observation requires interpretation and a desire-to-interpret. It’s observation translated into dreamstuff.
“Why”, when distinguished from “how”, is asking about the intent of a thinking agent. Neuroscience, psychology, and sociology exist for when thinking agents are involved. When they’re not, that type of “why” makes no sense.
I think that’s because there is no answer to “why” - At least not one that would satisfy the human mind.
The best we are ever going to be getting is “it just is”.
P<0.05 means one in 20 studies are relevant just by chance. If you have 20 researchers studying the same thing then the 19 researchers who get non significant results don’t get published and get thrown in the trash and the one that gets a “result” sees the light of day.
Thats why publishing negative results is important but it’s rarely done because nobody gets credit for a failed experiment. Also why it’s important to wait for replication. One swallow does not make a summer no matter how much breathless science reporting happens whenever someone announces a positive result from a novel study.
TL;DR - math is hard
Also, check out this one weird trick to get positive results almost every time: just use 20 different end points!
P<0.05
How might one translate this to everyday language?
P<0.05 means the chance of this result being a statistical fluke is less than 0.05, or 1 in 20. It’s the most common standard for being considered relevant, but you’ll also see p<0.01 or smaller numbers if the data shows that the likelihood of the results being from chance are smaller than 1 in 20, like 1 in 100. The smaller the p value the better but it means you need larger data sets which costs more money out of your experiment budget to recruit subjects, buy equipment, and pay salaries. Gotta make those grant budgets stretch so researchers will go with 1 in 20 to save money since it’s the common standard.
To expand on the other fella’s explanation:
In psychology especially, and some other fields, the ‘null hypothesis’ is used. That means that the researcher ‘assumes’ that there is no effect or difference in what he is measuring. If you know that the average person smiles 20 times a day, and you want to check if someone (person A) making jokes around a person (person B) all day makes person B smile more than average, you assume that there will be no change. In other words, the expected outcome is that person B will still smile 20 times a day.
The experiment is performed and data collected. In this example, how many times person B smiled during the day. Do that for a lot of people, and you have your data set. Let’s say that they discovered the average amount of smiles per day was 25 during the experimental procedure. Using some fancy statistics (not really fancy, but it sure can seem like it) you calculate the probability that you would get an average of 25 smiles a day if the assumption that making jokes around a person would not change the 20-per-day average. The more people that you experimented on, and the larger the deviance from the assumed average, the lower the probability. If the probability is less than 5%, you say that p<0.05, and for a research experiment like the one described above, that’s probably good enough for your field to pat you on the back and tell you that the ‘null hypothesis’ of there being no effect from your independent variable (the making jokes thing) is wrong, and you can confidently say that making jokes will cause people to smile more, on average.
If you are being more rigorous, or testing multiple independent variables at once, as you might for examining different therapies or drugs, you starting making your X smaller in the p<X statement. Good studies will predetermine what X they will use, so as to avoid making the mistake of settling on what was ‘good enough’ as a number that fits your data.
Good example and well explained. We should team up on a book on science for lay people!
Your point about specifying the null hypothesis and the p value is very important. Another way studies can fail is if you pick 20 different variables, like you mentioned, and then look to see if any of them give you p<0.05. So in your example, we measure smiling and 19 other factors besides being told jokes. Let’s say the weather, the day of the week, what color clothes the person is wearing, what they had for breakfast, etc. Again, due to statistics, one of those 20 is going to appear relevant by chance. You’re essentially doing 20 experiments in one so again you’ll get one spurious result that you can report as “success”.
Experimental design is tough and it’s hard to grok until you’ve had to design and run your own experiment including the math. That makes it easy for people to pass off bad science as legitimate, whether accidentally or on purpose. And it’s why peer review is important, where your study gets sent to another researcher in your field for critique before publication.
There’s other things besides bad math that can trip you up like correlation vs causation, and how the data is gathered. In the above example, you might try to save money by asking subjects to self report on their smiling. But people are bad at doing that due to fallible memory and bias (did that really count as a full smile?). Ideally you want to follow them around and count yourself, with a clear definition of what counts as a smile. Or make them wear a camera that does facial recognition. But both of those cost more money than just handing someone a piece of paper and a pencil and hoping for the best. That’s why you should always be extra suspicious of studies that use self reporting. As my social psych prof said, surveys are the worst form of data collection. It’s what makes polling hard because what people say and what they do are often entirely different things.
I think most science books are understandable by laypersons, except those that are memorization heavy, like biochemistry, or organic chemistry, or some parts of things like microbiology and pathophysiology. Statistics books and research design were pretty understandable, except for the actual math, heh. There really needs to be a push for people to read them casually, and encouraged to just stick to the concept parts and ignore the math and memorization of minor stuff. The free textbooks out there (I think openstax is pretty good, personally) are getting to the point where I think people might read them just for the ‘ooh’ part of science. Heck, it’s why psychology is such an enticing subject in the first place; it’s basically the degree of human interest facts.
I just thought that understanding the way the null hypothesis is used is important to really grasp what information the p is really conveying.
:D And for the parts about self reporting bias, and definitions and such, I was really, really having to hold myself back from talking about what makes your variables independent or dependent, operational definitions, ANOVA and MANOVA and t-tables and Cohen’s D value and the emphasis on not p but now the error bars and all the other lovely goodies. The stuff really brings me back, eh? ;)
I feel like this applies more to flaws in how studies are published and the incentives surrounding that more than the scientific method.
Researchers here. The scientific method is unbelievably tedious. Way more tedious than you would think. So much so that people are willing to pay researchers to do it for them. A simple yes or no question takes weeks or months to answer if you’re lucky.
But the upside is that we can remove our own biases from the answer as much as possible. If you see an obvious difference between any 2 groups, then there’s little to no point in doing the scientific method. But if the difference is less clear, like borderline visible, then biases start to creep in. Someone who thinks there’s no difference will see the data and think there’s no difference. And someone who thinks there’s a difference will look at the data and think there’s a difference. The scientific method excels in these cases, because it gives us a relatively objective way to determine if there is a difference or not between 2 groups
Strength:
- Allows us to predict the future and understand reality
Weakness:
- Only works on falsifiable hypotheses
- Relies on peer review and replication which are pretty dead
- Requires a basic understanding of math in order to understand why it works
I bet your textbook will tell you 😉
It is, in cases where it works, probably the best available method we have for finding the truth.
But there are a lot of questions it cannot answer, it can still give the wrong result just by chance, and the results are only as good as the assumptions you made. The last point is particularly important, and can allow bias to creep in even when all the experiments are done correctly.
Finally, real scientists often do not (and sometimes cannot) follow the scientific method perfectly, due to all sorts of reasons.
With the proviso that it depends how you define the scientific method…
One strength is it gives us a reasonably reliable way to investigate and share information, moving slowly forward with problems even though the people working on them might never meet, or even be alive at the same time.
A major downside is that (at least most popular versions of the scientific method) are designed to look at population level tendencies. And depending on the design and scale of these studies it can erase genuine differences. Let say we take a 50 people with skin rashes and give them some antifungal cream. For the vast majority of people this doesn’t help, and so our study shows that it’s an ineffective treatment for rashes. If we’d found a group of 50 people with rashes caused fungal infection, it would have been a highly effective treatment. So, if that’s the extent of our knowledge of rash treatments we would dismiss claims that antifungals “really helped me” as quack anecdotes.
Obviously, this is the process of investigation and refinement that is part of the science. But in the interim period, when working with things that we know we do not fully understand, we have to be careful to not over privilege “scientific evidence”. In a relatively new field, if one approach has “good evidence” and others don’t, this doesn’t mean they are necessarily less effective. They might just be less amenable to experimental designs that allows for their effectiveness to be shown, or they are effective for a specific subgroup that hasn’t been clearly identified yet. (obvs, this is not meant to be taken to say any woowoo bullshit ‘could’ work, but that there’s a whole messy middle between those two extremes.)
strength is it’s replicable. Not just somebody claiming something without justifying it can happen.
This is totally false in practice.
How is this incorrect? In which field? And how do you confirm
youthe validity of your methodology?Replication rarely happens and in many cases is outright impossible due to lack of shared code.
Things should be replicable, but that hasn’t been the case for a while.
So then the failure of the scientific method is that people aren’t following it. That’s not so much a problem with the method.
If a method can’t practically be followed it’s a sign of a bad method, or at least one that needs modification.
It’s not that it can’t practically be followed, it is just that everyone running after H-index or whatever the hot thing is now has resulted in a drop in quality.
It can easily be followed. Just not within capitalism.
Edit: But you’re correct. And that’s what we’re seeing. A modified version.
the correct term you need is ‘unachievable’, not ‘false’. […] anyway, it depends on the field and type of study.
That’s just wordplay to make the problem seem like it’s not as big of a problem.
Common standards for language formally used in a specific field/profession/discipline aren’t “wordplay” lol
This isn’t a professional forum. Playing the “it’s a technical term” game is absolutely wordplay.
Here’s a great article published yesterday on how science seems to be fueling the meaning crisis:
https://bigthink.com/13-8/why-science-must-contend-with-human-experience/
Weakness: despite its simplicity, it’s still way too complicated for some of the troglodytes to understand. So now we have to contend with the idiots believing in a flat earth and that climate change isn’t real.
Weakness: complete overkill for most of the problems in your life.
Weakness: better done in advance, before you even realized you needed that knowledge to solve some other problem.
Weakness: Immediate cash rewards unlikely.
Been thinking about how quantum physics are connected to chaos theory and the properties of closed dynamic systems.
Will spare you that. Part of it is the human mind doesn’t have the processing of all configurations, all the possible states of an entire systems, simultaneously.
Humans do have abstract thought, critical thinking. We can observe, record data, notice patterns, trends. By chaos theory, humans discovered they could write math equations to describe the behavior of complex systems. With quantum physics, humans trying to figure out how localized realities in a system related to the behavior of system as a whole.
We use scientific method because we can’t comprehend the infinite. Math equations are shorthand, a trick we use to make up for our shortcomings. Science and math is awesome.
Some good answers here already. Here’s a useful relevant article with more info: https://plato.stanford.edu/entries/scientific-method/
Weakness: not usable for quantum physics/mechanics, etc where the outcome changes when the experiment is observed.
That’s not true. The observation just becomes part of the experimental design, you can still make predictions about the outcome.
That’s the debate now isn’t it.
It’s only a debate if you’ve never seen a paper about quantum mechanics. That’s how it works. If you really think that the scientific method “doesn’t apply” to quantum mechanics, you’re deeply misunderstanding either the scientific method or quantum mechanics.
I will admit that I have a layman’s understand of quantum mechanics so that’s entirely possible.