Last week, the ongoing debate about COVID-19’s origins acquired a new plot twist. An evolutionary biologist from France discovered a treasure trove of genetic sequences from skins taken from wet markets in Wuhan, China shortly after the outbreak. She and a team of international colleagues then downloaded the data to try to understand who or what might have brought the virus to the market. What they found, as The Atlantic first reported on Thursday, bolsters the case for the pandemic having purely natural roots: The genetic data suggest that live mammals illegally for sale at the Huanan Seafood Wholesale Market–among them, raccoon dogs, a foxlike species known to be susceptible to the virus–may have been carrying the coronavirus at the end of 2019.
But what might otherwise have been a straightforward story on new evidence has rapidly morphed into a mystery centered on the origins debate’s data gaps. Within a day or so of nabbing the sequences off a database called GISAID, the researchers told me, they reached out to the Chinese scientists who had uploaded the data to share some preliminary results. The next day, public access to the sequences was locked–according to GISAID, at the request of the Chinese researchers, who had previously analyzed the data and drawn distinctly different conclusions about what they contained.
Yesterday evening, the international team behind the new Huanan-market analysis released a report on its findings–but did not post the underlying data. The write-up confirms that genetic material from raccoon dogs and several other mammals was found in some of the same spots at the wet market, as were bits of SARS-CoV-2’s genome around the time the outbreak began. Some of that animal genetic material, which was collected just days or weeks after the market was shut down, appears to be RNA–a particularly fast-degrading molecule. This strongly indicates that samples of the animal genetic material were taken from the market within days or weeks. It is possible for the virus’s travel to our country through the mammal channel. Ravindra Gupta (a University of Cambridge virologist), said that there is more evidence to suggest this animal spillover occurred. She was not part of the research. “A year and a half ago, my confidence in the animal origin was 80 percent, something like that. Now it’s 95 percent or above.”
For now, the report is just that: a report, not yet formally reviewed by other scientists or even submitted for publication to the journal–and that will remain the case as long as this team continues to leave space for the researchers who originally collected the market samples, many of them based at the Chinese Center for Disease Control and Prevention, to prepare a paper of their own. And still missing are the raw sequence files that sparked the reanalysis in the first place–before vanishing from the public eye.
Every researcher I asked emphasized just how important the release of that evidence is to the origins investigation: Without data, there’s no base-level proof–nothing for the broader scientific community to independently scrutinize to confirm or refute the international team’s results. Gigi Gronvall (a senior scholar at Johns Hopkins Center for Health Security who was not involved in this new analysis) says that without raw data “some people might say that it isn’t true”. Other clues to the origins of the pandemic are also raised by data that flickers on and off the public internet. Still more evidence might be out there, yet undisclosed.
Transparency is always an essential facet of research, but all the more so when the stakes are so high. SARS-CoV-2 has already killed nearly 7 million people, at least, and saddled countless people with chronic illness; it will kill and debilitate many more in the decades to come. Every investigation into how it began to spread among humans must be “conducted as openly as possible,” says Sarah Cobey, an infectious-disease modeler at the University of Chicago, who wasn’t involved in the new analysis.
The team behind the reanalysis still has copies of the genetic sequences its members downloaded earlier this month. But they’ve decided that they won’t be the ones to share them, several of them told me. For one, they don’t have sequences from the complete set of samples that the Chinese team collected in early 2020–just the fraction that they spotted and grabbed off GISAID. Researchers claim that even if they had all the data, it is not their right to publish them. That’s up to the China CDC team that originally collected and generated the data.
Part of the international team’s reasoning is rooted in academic decorum. There isn’t a set-in-stone guidebook among scientists, but adhering to unofficial rules on etiquette smooths successful collaborations across disciplines and international borders–especially during a global crisis such as this one. It is not a good idea to release data from another team, which was the result of hard work by that team. It risks misattribution of credit, and opens the door to the Chinese researchers’ findings getting scooped before they publish a high-profile paper in a prestigious journal. Niema Moshiri is a computational biologist from UC San Diego, and was one of the report’s authors. “They produced the data, so it’s their data to share with the world.”
If the international team released what data it has, it could potentially stoke the fracas in other ways. The World Health Organization has publicly indicated that the data should come from the researchers who collected them first: On Friday, at a press briefing, Tedros Adhanom Ghebreyesus, the WHO’s director-general, admonished the Chinese researchers for keeping their data under wraps for so long, and called on them to release the sequences again. “These data could have and should have been shared three years ago,” he said. The fact it was not is “disturbing” considering how it could have helped investigations earlier on. Gregory Koblentz of George Mason University, a biodefense specialist, said that the data wasn’t shared. He wasn’t part in this new analysis.
Publishing the current report has already gotten the researchers into trouble with GISAID, the database where they found the genetic sequences. During the pandemic, the database has been a crucial hub for researchers sharing viral genome data; founded to provide open access to avian influenza genomes, it is also where researchers from the China CDC published the first whole-genome sequences of SARS-CoV-2, back in January 2020. A few days later, the researchers told me that they were contacted several times by a GISAID administrator. He chastised them for not cooperating with the China CDC team, and advised them to avoid publishing a paper that uses the China CDC data. The email stated that they were at risk of breaking the terms of the site and could lose their access to the database. It would also be an offense to distribute the data to anyone other than GISAID users, including the wider research community.
This morning, hours after the researchers released their report online, many of them found that they could no longer log in to GISAID–they received an error message when they input their username and password. “They may indeed be accusing us of having violated their terms,” Moshiri told me, though he can’t be sure. Moshiri said that they may have accused us of violating their terms, but he can’t be certain. His colleagues and Moshiri maintain they acted in good faith, and that they haven’t broken any database terms. However, GISAID claims they tried to contact them multiple times to offer collaboration with China CDC. This has so far “thusfar declined,” according to the report of international team.
Why, exactly, the sequences were first made public only so recently, and why they have yet to reappear, remain unclear. In a recent statement, the WHO said that access to the data was withdrawn “apparently to allow further data updates by China CDC” to its original analysis on the market samples, which went under review for publication at the journal Nature last week. There’s no clarity, however, on what will happen if the paper is not published at all. Three of the Chinese researchers, William Liu and Guizhen Wu, didn’t respond to my request for information about the intentions behind the data.
“We want the data to come out more than anybody,” says Saskia Popescu, an infectious-disease epidemiologist at George Mason University and one of the authors on the new analysis. Until then, the international team will be fielding accusations, already flooding in, that it falsified its analyses and overstated its conclusions.
Researchers around the world have been raising questions about these particular genetic sequences for at least a year. In February 2022, the Chinese researchers and their close collaborators released their analysis of the same market samples probed in the new report, as well as other bits of genetic data that haven’t yet been made public. But their interpretations deviate pretty drastically from the international team’s. According to the Chinese team, any virus that was found on the market could have been brought in from infected people. “No animal host of SARS-CoV-2 can be deduced,” the researchers asserted at the time. Although the market had perhaps been an “amplifier” of the outbreak, their analysis read, “more work involving international coordination” would be needed to determine the “real origins of SARS-CoV-2.” When reached by Jon Cohen of Science magazine last week, Gao described the sequences that fleetingly appeared on GISAID as “[n]othing new. It had been known there was illegal animal dealing and this is why the market was immediately shut down.”
There is, then, a clear divergence between the two reports. Gao’s assessment indicates that finding animal genetic material in the market swabs merely confirms that live mammals were being illegally traded at the venue prior to January 2020. The researchers behind the new report insist that the narrative can now go a step further–they suggest not just that the animals were there, but that the animals, several of which are already known to be vulnerable to SARS-CoV-2, were there, in parts of the market where the virus was also found. This close proximity and the inability of the virus to survive without an animal host suggests that there may be another infection.
The Chinese researchers used this same logic of location–multiple types of genetic material pulled out of the same swab–to conclude that humans were carrying around the virus at Huanan. Reanalysis confirmed that the market was likely to have had infected persons at one point or another before closing. But they were unlikely to be the virus’s only chauffeurs: Across several samples, the amount of raccoon-dog genetic material dwarfs that of humans. At one stall in particular–located in the sector of the market where the most virus-positive swabs were found–the researchers discovered at least one sample that contained SARS-CoV-2 RNA, and was also overflowing with raccoon-dog genetic material, while containing very little DNA or RNA material matching the human genome. That same stall was photographically documented housing raccoon dogs in 2014. The case is not a slam dunk: No one has yet, for instance, identified a viral sample taken from a live animal that was swabbed at the market in 2019 before the venue was closed. Gronvall from JHU told me that the situation is clearer than ever. “All of the science is pointed” in the direction of Huanan being the pandemic’s epicenter, she said.
To further untangle the significance of the sequences will require–you guessed it–the now-vanished genetic data. Because they don’t have the sequences, some researchers remain unsure about the importance of this new analysis. Others are also wondering whether more data could yet emerge, given how long this particular set went unshared. “This is an indication to me in recent days that there is more data that exists,” Maria Van Kerkhove, the WHO’s COVID-19 technical lead, told me. Which means that she and her colleagues haven’t yet gotten the fullest picture of the pandemic’s early days that they could–and that they won’t be able to deliver much of a verdict until more information emerges. The new analysis does bolster the case for market animals acting as a conduit for the virus between bats (SARS-CoV-2’s likeliest original host, based on several studies on this coronavirus and others) and people; it doesn’t, however, “tell us that the other hypotheses didn’t happen. We can’t remove any of them,” Van Kerkhove told me.
More surveillance for the virus needs to be done in wild-animal populations, she said. The market swabs data could be useful in helping to identify mammals who might have been exposed by bats, or other intermediaries from a specific area of China. At the same time, to further investigate the idea that SARS-CoV-2 first emerged out of a laboratory mishap, officials need to conduct intensive audits and investigations of virology laboratories in Wuhan and elsewhere. Last month, the U.S. Department of Energy ruled that such an accident was the likelier catalyst of the coronavirus outbreak than a natural spillover from wild animals to humans. The ruling echoed earlier judgments from the FBI and a Senate minority report. But it contrasted with the views of four other agencies, plus the National Intelligence Council, and it was made with “low confidence” and based on “new” evidence that has yet to be declassified.
The longer the investigation into the virus’s origins drags on, and the more distant the autumn of 2019 grows in our rearview, “the harder it becomes,” Van Kerkhove told me. Many in the research community were surprised that new information from market samples collected in early 2020 emerged at all, three years later. Settling the squabbles over SARS-CoV-2 will be especially tough because the Huanan market was so swiftly shut down after the outbreak began, and the traded animals at the venue rapidly culled, says Angela Rasmussen, a virologist at the University of Saskatchewan and one of the researchers behind the new analysis. Raccoon dogs, one of the most prominent potential hosts to have emerged from the new analysis, are not even known to have been sampled live at the market. Koblentz of George Mason University told me that “that evidence is gone now” if ever it existed. For months, Chinese officials were even adamant that no mammals were being illegally sold at the region’s wet markets at all.
So researchers continue to work with what they have: swabs from surfaces that can, at the very least, point to a susceptible animal being in the right place, at the right time, with the virus potentially inside it. Rasmussen said that this is all the data we have at the moment. It may never be enough to fully settle this debate. The world does not know how much evidence is available at the moment, nor what might, or should be, revealed.
The post A Major Clue to COVID’s Origins Is Just Out of Reach appeared first on The Atlantic.