Princeton researchers say machine learning is causing a ‘reproduction crisis’ in scientific research. They found data leakage in 329 papers, including medicine and social sciences. The researchers blame hype around machine learning and lack of training provided to those who use it. If that trend continues, the ripple effects in academia could be pretty severe. A mistake in an information pipeline anywhere will frequently lead to followup errors down the road — and as it probably goes without saying, that could have some pretty devastating consequences. It also doesn’t help that machine learning can, in a way, feel pretty difficult to question, especially when both ease and efficiency are part of the sales pitch. Clearly, researchers need to be careful, though, and really ask themselves if they actually know what they’re doing. . . .