MIT-IBM AI Lab Analyzed 200,000 Bitcoin Transactions. Only 2% Were Labeled 'Illicit'
Blockchain analytics firm Elliptic collaborated with researchers to analyze $6 billion worth of bitcoin transactions.

Blockchain analytics firm Elliptic collaborated with researchers from the Massachusetts Institute of Technology (MIT) and IBM to publish a public dataset of bitcoin transactions associated with illicit activity.
The group’s study detailed how researchers at the MIT-IBM Watson AI Lab used machine learning software to analyze 203,769 bitcoin node transactions worth roughly $6 billion in total. The research explored whether artificial intelligence could assist current anti-money laundering (AML) procedures.
Only 2 percent of the 200,000 bitcoin transactions in the data set were deemed illicit as part of Eliptic's initial work. While 21 percent were identified as lawful, the vast majority of the transactions, roughly 77 percent, remained unclassified. (To date, there have been an estimated 440 million bitcoin transactions since the network's launch in 2009.)
To be clear, the 2 percent comes from an Elliptic data set that was previously not public and the figure was merely affirmed by the MIT researchers' analysis. The data point is in line with a study from competing analytics firm Chainalysis, which estimated just 1 percent of bitcoin transactions in 2019 were known to be associated with illicit activity.
Since Elliptic is frequently hired by law enforcement agencies around the world to identify illegal activities using cryptocurrency, this research aimed to identify patterns that can help distinguish illicit usage from lawful bitcoin usage, especially among unbanked individuals or other unknown entities.
“A big problem with compliance, in general, is false positives. A big part of this research is minimizing the number of false positives,” Elliptic co-founder Tom Robinson told CoinDesk. “The key finding is that machine learning techniques are very effective at finding transactions that are illicit.”
Sometimes, Robinson added, software was able to find patterns that would be difficult to describe yet still matched with known entities, based on pre-existing data from darknet markets, ransomware attacks and other criminal investigations.
Following the academic study, Elliptic made the same dataset public to encourage open-source contributions.
“On the AML side, we are sharing our early experiments with domain experts to solicit feedback,” IBM researcher Mark Weber told CoinDesk, adding:
“We are also hoping the release of the Elliptic Data Set inspires others to join the effort to help make our financial systems safer by developing new techniques and models for AML.”
reported in April that surging demand for U.S. $100 bills was likely driven by a rise in global criminal activity. A 2017 report by the American Institute for Economic Researchhttps://www.aier.org/article/sound-money-project/how-much-cash-used-criminals-and-tax-cheats, estimated that "more than a third of all US currency in circulation is used by criminals and tax cheats."
Update (22:00 UTC, Aug. 6): The title of this article has been modified and language has been added to clarify that the 2 percent figure was calculated in Elliptic's initial work, and not in the subsequent analysis involving MIT-IBM Watson AI Lab.
MIT image via Shutterstock
More For You
Protocol Research: GoPlus Security

What to know:
- As of October 2025, GoPlus has generated $4.7M in total revenue across its product lines. The GoPlus App is the primary revenue driver, contributing $2.5M (approx. 53%), followed by the SafeToken Protocol at $1.7M.
- GoPlus Intelligence's Token Security API averaged 717 million monthly calls year-to-date in 2025 , with a peak of nearly 1 billion calls in February 2025. Total blockchain-level requests, including transaction simulations, averaged an additional 350 million per month.
- Since its January 2025 launch , the $GPS token has registered over $5B in total spot volume and $10B in derivatives volume in 2025. Monthly spot volume peaked in March 2025 at over $1.1B , while derivatives volume peaked the same month at over $4B.
More For You
Solana’s Drift Launches v3, With 10x Faster Trades

With v3, the team says that about 85% of market orders will fill in under half a second, and liquidity will deepen enough to bring slippage on larger trades down to around 0.02%.
What to know:
- Drift, one of the largest perpetuals trading platforms on Solana, has launched Drift v3, a major upgrade meant to make on-chain trading feel as fast and smooth as using a centralized exchange.
- The new version will deliver 10-times faster trade execution thanks to a rebuilt backend, marking the largest performance jump the project has made so far.











