Using DNA to Solve Cold Cases Just Got a Lot Easier, Thanks to This Math

Caroline Delbert

Fri, Nov 4, 2022, 5:22 PM8 min read

genetic research, pipette and dna samples on dna autoradiogram illustrating research into life sciences and genetic modification — New Algorithm Could Turbocharge Solving Cold CasesAndrew Brookes - Getty Images

"Hearst Magazines and Yahoo may earn commission or revenue on some items through the links below."

A new method for solving forensic genetic puzzles is ten times faster than the current method investigators use to solve crimes.
Researchers from Stanford and genetic organizations have developed a decision-making algorithm that uses probability to narrow down which areas of a family tree to focus on—which are often extensive and go back hundreds of years—to more efficiently identify the DNA target.
The Golden State Killer investigation, which investigators finally solved in 2018, popularized forensic genetic genealogy.

Scientists say using math to sort through DNA could help investigators put stubborn cold cases to rest. The approach combines the relatively new field of forensic genetic genealogy—solving crime by charting out DNA-based family trees—with increasing computational power to speed up and simplify this complex form of investigation.

In a new paper recently published in the Journal of Forensic Sciences, researchers from Stanford University, California-based Identifinders, and the DNA Doe Project explain how they developed a new mathematical model to help investigators greatly narrow down their giant pools of genetic candidates:

“We formulate a program that—given the list of matches and their genetic distances to the unknown target—chooses the best decision at each point in time: which match to investigate, which set of potential most recent common ancestors to descend from, or whether to terminate the investigation.”

🧬 Science is on our side. We’ll help you make sense of it all with Pop Mech Pro.

By using a decision tree to optimize the candidate search, the researchers say their new process improves the existing process for forensic genetic genealogy by a factor of ten. They can also use this protocol to pull relevant matches even from large pools with a low likelihood of success. In fact, the new algorithm is so effective that researchers say it “can solve a case with a 7,500-person family tree around 94 percent of the time,” compared to only 4 percent of the time with the current method, according to a Stanford University press release.

Basically, it’s a great way to speed up and enrich the research investigators are already doing—like turning your regular bicycle into an e-bike.

Genetic Genealogy Takes on Crime

Genetic genealogy is the term for combining DNA testing with traditional genealogy to create family trees on a genetic basis—think at-home genetic testing like 23andMe combined with Ancestry.com (which now offers its own DNA testing). It’s also used to test unknown exhumed remains against modern descendants. Genetic genealogy becomes forensic when it’s applied to solving a crime.

The applications for this genetic information are easy enough to see. If an unknown deceased person is found or DNA from a criminal suspect can’t be identified via traditional means, police may take that genetic information and then cross-check it against other data—like what’s known about missing persons at the time. When direct genetic information isn’t available, they can ask close relatives and look for the percentage of shared DNA that indicates a family relationship. On average, a person shares roughly 25 percent of their DNA with a grandparent, 12.5 percent with a first cousin, and 3.13 percent with a second cousin.

us dna testing — *An at-home DNA test kit from 23andMe.*ERIC BARADAT - Getty Images

When millions of people began buying and submitting at-home genetic testing kits, that information was largely made available to law enforcement, despite ongoing questions of legality. That means police now have access to a much larger DNA pool, which they can use to find matches for unidentified victims or suspects of violent crime.

In 2018, investigators used forensic genetic genealogy to split open a major case for the very first time: capturing the Golden State Killer. In that case, one man—himself a former police officer—committed at least 13 murders, 51 rapes, and dozens of burglaries and other crimes in California throughout the 70s and 80s. Because of the variety of crimes and wide geographical area, investigators only consolidated all three major streaks into one file they named the “Golden State Killer” in 2013, decades after the crimes ended in 1986. Police combined DNA databases and made many different family trees, ranging as far back as the 1800s, then narrowed down the suspects to just one.

Like Finding a Needle in a Haystack

So far, Stanford University reports, forensic genetic genealogy has been used to solve over 400 crimes. But the process is tedious, and it’s mostly been undertaken by individuals who felt committed to seeing the process through. And you might be thinking, correctly, that the process is ripe for the application of some raw computing power. Isn’t genetic information just a big list or database, ready to search?

That’s not exactly wrong, but it’s not the whole story. Genetics are messy and enormous. Family relationships get a lot less noticeable and identifiable very quickly as you move away from the immediate family group.

a family tree — *In 2007, Smithsonian genealogist Deb Hull-Walski was able to identify a body from an 1800s cast iron coffin found in Washington, D.C., in part thanks to DNA testing.*James M. Thresher/The Washington Post - Getty Images

The researchers used data from 17 actual cases to test their model. In each case, the target’s DNA—that of the suspect or the victim—produced anywhere from 200 to 5,000 matches. “It is not obvious how many matches, and which of these matches, to investigate, nor is it obvious how to optimally look for an intersection among their families,” the authors write in the study. And so, while we have more computing power than ever before, investigators still need help structuring their searches. This is where the decision-making math comes in.

A decision tree is kind of like a game of Guess Who?. In this iconic children’s game, a full docket of people share certain traits like hair and eye color, glasses, or facial hair. Players ask each other eliminating questions—is your person blond? Do they have brown eyes?—then flip down the candidates they’ve eliminated. But instead of following visible genetic traits, the algorithm looks at the underlying genomes of the matches and their possible relationship with the target. At each juncture, the researchers’ model makes a decision on which lead to pursue.

A More Efficient, Mathematical Approach

The researchers took a different approach, which they refer to as their “proposed strategy,” over that of the current method, which they call the “benchmark strategy.”

“The benchmark method looks for common ancestors between different matches. What you really want to find is the most recent common ancestor between a match and the unknown target, and that’s a slightly different problem,” Lawrence Wein, one of the study authors and a professor of operations, information, and technology at Stanford University, says in the release. According to the researchers, their proposed method is far more efficient because it significantly reduces the overall workload and number of dead-end leads.

As for the math used to help parse through all the genetic data, the researchers created a two-part algorithm that is a kind of stochastic dynamic program, which they define as “the standard approach to solving multi-period optimization problems under uncertainty.”

At every step, the program uses probability while prioritizing the most cost-effective matches. In part, it does this by using the Autocluster tool from GEDmatch, which groups “DNA matches of people who have a common ancestor and likely belong to the same branch of the family tree” according to the company. The algorithm also uses “probabilistic information about the relationship between the target and the match,” according to the study. (The algorithm allows for quite a bit of leeway, too, and even matches with little probability of success are explored.) Meanwhile, the current benchmark method uses neither of those, and requires manual legwork from investigators to determine which DNA-match leads to pursue.

The first step looks at each “generation-ancestral couple pair”—identified matches that had offspring together—and assesses the probability of finding the target by working downward from that pair. If a pair’s cost-effectiveness value passes the threshold the researchers set, then it’s worth investing time into looking at the descendants of that pair in hopes of identifying the target.

In the second step, if a matched pair doesn’t meet the threshold value—if the algorithm deems it improbable that the target is their direct descendant—the algorithm will then work upward in the family tree from that point, then downward again if it finds any promising candidates, until the most recent common ancestor(s) of the unknown target is found.

The Future of Solving Crimes?

So what the researchers from Stanford and elsewhere have done is use mathematical inference to get a huge head start on the game by calculating how likely each candidate is based on the information at hand. They describe it as a kind of “roadmap” for investigators to follow. That means the questions investigators ask after that can be smarter, more specific, and more impactful on their investigations.

Still, the researchers point out that their method can’t fully replace the work done by genealogists, who may use more case-specific information, like location, in their search. And investigators still have to put in the time to solve the case and attain justice.

But results of the study certainly speak for themselves. With a model that’s purportedly ten times better than what we have now, that list of 400 solved cases could soon grow by quite a bit—and very quickly.

Additional reporting by Jessica Coulon.

You Might Also Like

Forget Nvidia: Members of Congress Are Scooping Up Shares of Its Core Rival Instead
There's a much more popular AI stock on Capitol Hill than the leading AI chip maker.
Motley Fool•13h ago
Are You Saving Enough To Be In The Top 3% Of Retirees? Here's How Much You Need
Retirement savings play a crucial role in securing financial stability in your golden years. For those aiming high, being in the top 3% of retirees in terms of savings enhances comfort and offers greater financial freedom. Using data from the Federal Reserve's Survey of Consumer Finances, a 2024 survey by the Employee Benefit Research Institute, reveals that individuals with over $1 million in retirement accounts rank in the top 3% of retirees. Only 3.2% of retirees have surpassed the $1 million
Benzinga•1d ago
Nvidia Owns a 3.4% Stake in This Innovative Artificial Intelligence (AI) Stock Cathie Wood Loves
The $50 million purchase could turn into Nvidia's biggest investment.
Motley Fool•9h ago
2 Stock-Split Stocks to Buy Hand Over Fist Right Now
These proven wealth builders could be exactly what you're searching for right now.
Motley Fool•1d ago
3 Stocks to Invest $30,000 in Right Now
$10,000 to each of these companies could deliver major returns over the next decade.
Motley Fool•5h ago
Housing supply surges by up to 50% in these metro areas — and many sellers are being forced to slash their asking prices
The property report includes 85 major metropolitan areas in the U.S. with populations of at least 750,000.
MarketWatch•2d ago
Generative AI Software Sales Could Soar 6,260%: My Pick for the Best AI Stock to Buy Now (Hint: Not Nvidia)
This little-known software stock could soar as businesses spend more on generative artificial intelligence.
Motley Fool•14h ago
A Once-in-a-Generation Investment Opportunity: 1 Artificial Intelligence (AI) Stock to Buy Now and Hold Forever
Lots of companies are trying to capitalize on the AI revolution, but one company sticks out from the pack.
Motley Fool•4h ago
U.S. panic over national debt might mark a culture shift—are Americans becoming more ‘European’ about money?
Jamie Dimon and Jerome Powell are taking the European viewpoint on soaring debt levels in the U.S.
Fortune•1d ago
Worried About a Stock Market Sell-Off? Buy This Top Vanguard ETF
There are benefits to investing in stocks that have reasonable multiples and aren't as dependent on future earnings growth to justify their valuations.
Motley Fool•10h ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

Yahoo Finance

Using DNA to Solve Cold Cases Just Got a Lot Easier, Thanks to This Math

🧬 Science is on our side. We’ll help you make sense of it all with Pop Mech Pro.

Genetic Genealogy Takes on Crime

Like Finding a Needle in a Haystack

A More Efficient, Mathematical Approach

The Future of Solving Crimes?

Recommended Stories

Forget Nvidia: Members of Congress Are Scooping Up Shares of Its Core Rival Instead

Are You Saving Enough To Be In The Top 3% Of Retirees? Here's How Much You Need

Nvidia Owns a 3.4% Stake in This Innovative Artificial Intelligence (AI) Stock Cathie Wood Loves

2 Stock-Split Stocks to Buy Hand Over Fist Right Now

3 Stocks to Invest $30,000 in Right Now

Housing supply surges by up to 50% in these metro areas — and many sellers are being forced to slash their asking prices

Generative AI Software Sales Could Soar 6,260%: My Pick for the Best AI Stock to Buy Now (Hint: Not Nvidia)

A Once-in-a-Generation Investment Opportunity: 1 Artificial Intelligence (AI) Stock to Buy Now and Hold Forever

U.S. panic over national debt might mark a culture shift—are Americans becoming more ‘European’ about money?

Worried About a Stock Market Sell-Off? Buy This Top Vanguard ETF