Counterfeits on dark markets: a measurement between Jan-2014 and Sep-2015

Soldner, Felix; Kleinberg, Bennett; Johnson, Shane D.

doi:10.1186/s40163-023-00195-2

Research
Open access
Published: 17 October 2023

Counterfeits on dark markets: a measurement between Jan-2014 and Sep-2015

Crime Science volume 12, Article number: 18 (2023) Cite this article

1902 Accesses
1 Altmetric
Metrics details

Abstract

Counterfeits harm consumers, governments, and intellectual property holders. They accounted for 3.3% of worldwide trades in 2016, having an estimated value of $509 billion in the same year. Estimations in the literature are mostly based on border seizures, but in this paper, we examined openly labeled counterfeits on darknet markets, which allowed us to gather and analyze information from a different perspective. Here, we analyzed data from 11 darknet markets for the period Jan-2014 and Sep-2015. The findings suggest that darknet markets harbor similar counterfeit product types to those found in seizures but that the share of watches is higher while the share of electronics, clothes, shoes, and Tobacco is lower on darknet markets. Also, darknet market counterfeits seem to have similar shipping origins as seized goods, with some exceptions, such as a relatively high share (5%) of dark market counterfeits originating from the US. Lastly, counterfeits on dark markets tend to have a relatively low price and sales volume. However, based on preliminary estimations, the equivalent products on the surface web appear to be advertised for a multiple of the prices found for darknet markets. We provide some suggestions on how information about darknet market counterfeits could be used by companies and authorities for preventative purposes, showing that insight gathering from the dark web is valuable and could be a cost-effective alternative (or compliment) to border seizures. Thus, monitoring darknet markets can help us understand the counterfeit landscape better.

Introduction

Counterfeits are illicit goods that violate intellectual property (IP) rights such as copyrights, trademarks, design rights, or patents, and they can exist physically or digitally (OECD/EUIPO, 2019; WTO, 1994). The purpose of a deceptive counterfeit is to make a monetary profit by deceiving a customer that the product is of a higher value than it is (e.g., by selling it as being genuine) (OECD/EUIPO, 2019). Deceptive counterfeits can be sold directly to consumers or mixed within supply chains of genuine products to reduce costs and increase profits (Hollis & Wilson, 2014). Counterfeits can cause a variety of problems, such as physical (e.g., through foods or pharmaceuticals) and monetary harms to the consumer, the IP holder (e.g., through damages to the brand value, loss of sales), or the government (e.g., through the loss of tax income) (EMCDDA-Europol, 2017; OECD/EUIPO, 2019). In turn, the sales of counterfeits can support organized crime groups financially and facilitate other illegal activities, such as money laundering (EMCDDA-Europol, 2017; UNICRI & ICC BASCAP, 2013; UNODC, 2014). OECD/EUIPO (2019) estimated that counterfeits made up 3.3% of worldwide trades in 2016, worth USD 509 billion. Furthermore, the proportion of counterfeits seems to be elevated within developed regions, such as the European Union (EU).

However, estimating counterfeit goods' trade (value) is difficult and is mostly achieved through auditing goods seized at borders (OECD, 2018; OECD/EUIPO, 2019). Thus, current estimates often exclude domestically traded counterfeits or digital products, and since not all counterfeits will be seized at ports, estimates of what is traded may be incomplete. For example, the number of routinely checked containers at major ports in Genoa (Italy), Melbourne (Australia), Montreal (Canada), New York (USA), and Liverpool (UK) together only account for 2–5% of all traffic (Sergi, 2022). Since only a limited number of containers can be checked, the selection procedure can strongly impact possible finds.

A theoretical and empirical understanding of how counterfeiting occurs is currently not well developed, perhaps due to the complex involvement of various stakeholders, which results in difficulties for researchers to obtain reliable data (Sullivan et al., 2017). For example, many companies affected by counterfeiting operate across nations, affecting the ease with which authorities can monitor and combat counterfeits. Moreover, the definition of counterfeits varies across nations, further complicating how counterfeiting is measured. However, theories provide perspectives as to why counterfeiting occurs and how it might be addressed. The Rational Choice perspective considers the offender's choice to commit a crime (e.g., counterfeiting a product) and influencing factors of the offenders' decisions, such as the perceived risks and rewards (Clarke & Cornish, 1985). The perspective informs us that while changing the perceived risks and rewards of the offender, the likelihood of offending can be altered and reduced, for example, by increasing the perceived risks of detection or by increasing the general efforts needed to commit a crime. Within the context of counterfeits facilitating the traceability of genuine products within a supply chain (e.g., through watermarks) seems to be a possible approach to increasing the efforts to counterfeit (Gayialis et al., 2022). Another perspective, such as the Routine Activity Approach (RAA), discussed by Spink et al., (2013, 2014), states that crime is more likely when a suitable target (e.g., a product that can be counterfeited) and a motivated offender converge absent a capable guardian (Cohen & Felson, 1979). Capable guardians can include those involved in security at country borders or those involved in inspecting goods at other stages of the supply chain (Marucheck et al., 2011; Tang, 2006). For example, when manufactured products are transported, transport personnel and employees could also act as guardians (Hollis & Wilson, 2014). However, effective guardianship requires a clear understanding of the problem and processes to monitor it, such as reporting procedures. Absent this understanding, guardianship will be less effective.

With this in mind, risk assessments are often conducted to aid decisions made by authorities at borders based on intelligence from federal and local authorities and custom officer experiences (Sergi, 2022). However, these are likely to be imperfect. Furthermore, border checks can be random or may only be informed by the country of origin, or how the delivery is labeled, as in the case of parcel shipments (Männistö et al., 2021). Another source of possible bias present in check-selection procedures are large differences in the estimates of counterfeit product types produced by different agencies (IP Crime Group, 2015; OECD/EUIPO, 2019). For example, estimates strongly differ across agencies for footwear (20 percentage points) or for electronics (11%) and clothing (10%). For other products, estimations may be missing entirely, as in the case of Tobacco, which was estimated to make up 28% of all counterfeits by the IP Crime Group (2015) but was not identified as a counterfeited product by OECD/EUIPO (2019). Since different agencies use different data sources (e.g., border or inland seizures), some measurement differences are to be expected, but they also illustrate how inconsistently seizures reflect the true prevalence of counterfeits. Thus, additional data sources to estimate counterfeit affected products would be helpful to better understand the counterfeit landscape and aid efforts at prevention.

With the emergence of dark markets which do not have formal guardians in the same way that open web platforms do, new ways of trading illicit goods, including counterfeits, have appeared (Christin, 2013; van Wegberg et al., 2018), which may serve as an additional data source to measure counterfeit prevalence. Dark markets are online shopping platforms on the deep web—a highly anonymized part of the internet that is not indexed by traditional search engines—which operate like their surface web counterparts, eBay or Amazon. Vendors on dark markets offer a range of illegal products and services, mainly consisting of drugs, but also including hacking services, weapons, guides on how to defraud people, and counterfeits (Baravalle & Lee, 2018; Roberts & Hernandez-Castro, 2017; Soska & Christin, 2015; van Wegberg et al., 2018). During the COVID-19 pandemic, dark markets also started to offer a mix of genuine and fake protective gear (masks, gloves, etc.), medicines, and COVID-19 vaccines (Bracci et al., 2021a, 2021b; Broadhurst & Ball, 2020). Even with successful disruptions and the closing of markets by law enforcement, dark markets increasingly trade in such products and services (Décary-Hétu & Giommoni, 2017; ElBahrawy et al., 2020; EMCDDA-Europol, 2017). On dark markets, vendors openly sell counterfeits and forgeries, which provides an interesting opportunity to gain insight into the counterfeit market from a new angle. Since some dark markets also register the number of goods sold and buyers leave reviews, we can use such information to generate estimates of sales volumes and the monetary value of counterfeits over time. By comparing counterfeit listings and their sales on dark markets to border seizures, we can also see if they differ and provide a more comprehensive picture, which would be of value to law enforcement, companies that are affected by counterfeits of their products, and policymakers.

Therefore, to better understand the counterfeit economy on the dark web, we examined the prevalence and sales of counterfeits sold on 89 dark markets for the 3-year period January 2014–January 2017. Specifically, we quantified the price, volume, type, and origins of advertised counterfeits and estimated their sales volume and the value the same counterfeits would attract on the surface web. We then compare the results to measures and estimations from border seizures conducted by law enforcement over the same period. By highlighting differences, we can identify product groups for which counterfeiting appears to be a problem and would be overlooked based on an analysis of seizures alone.

Fraud and counterfeits on dark markets

Studies that have investigated the types of products listed and sold on the dark web mostly cover illegal drugs, which often account for 60–80% of all listings on a dark market (Baravalle & Lee, 2018; EMCDDA-Europol, 2017). However, some studies have examined less frequently listed products, such as art, wildlife, and plane tickets (Hutchings, 2018; Paul, 2018; Roberts & Hernandez-Castro, 2017). Others focus on fraud-related products or services, such as credit card information, online accounts (e.g., e-bay), social engineering guides and tutorials, or financial malware (e.g., ransomware) (Garg et al., 2015; Marin et al., 2016; Schafer et al., 2019; van Wegberg et al., 2018). Although some of these studies have considered the sale of forged documents (e.g., passports, licenses, diplomas), none have investigated or quantified the sales of counterfeits in a systematic way, such as differentiating between clothing, shoes, electronics, or jewelry; product types which can also be found on surface web markets (e.g., eBay, Amazon). Europol (2017) draws attention to IP crime on the dark web and estimates that solely counterfeit goods make up around 1.5–2.5% of all listings on such markets. The report lists some of the types of counterfeits sold on darknet markets (e.g., clothes, accessories, electronics, jewelry, pirated goods), and discusses the presence of wholesalers, which seem to account for the minority of transactions but most of the sales volume. In contrast, they report that most of the transactions are through the sales of individual items but seem to account for the minority of sales volume. According to this report, counterfeits seem to be sold for 1/3 of the price of the equivalent genuine product, and digital goods for around 1/6 of their original price (Europol, 2017). The report concludes that while the sale of IP goods is limited, there is potential for growth on darknet markets, and IP goods on dark markets should be monitored and investigated in more detail. However, the report does not explain how the mentioned statics were obtained nor which darknet markets were included in the analyses, making it difficult to assess the extent of counterfeits on the dark web. Furthermore, the lack of granularity prevents us from understanding which product types are offered, how frequently, how much they are sold, and where they originate. Lastly, the Europol (2017) report does not differentiate between counterfeits that could be sold on the surface web (e.g., shoes, clothes, electronics) and counterfeits that are limited to the dark web (e.g., fake banknotes or IDs), which is important if we want to inform authorities or companies on potentially affected product types that could be sold on the surface web.

Aims of this paper

With this paper, we aim to address the shortcomings of previous work by examining an extensive collection of dark market datasets to (I) understand the prevalence of counterfeit goods on the dark web and (II) determine the product types, occurrences, and origins of the identified counterfeits. Determining those details will help us (III) report counterfeit prices more accurately (by product types) and make sales volume estimations through product feedback, which can help us better understand the counterfeit economy on the dark web. Subsequently, we (IV) compare dark web counterfeit prices with prices of the same products on the surface web to understand possible profit margins for the various product types identified. We then (V) compare our results to observations made through border seizures, complaint statistics, and activities from authorities to contribute to the overall understanding of the counterfeit economy. Lastly, we (VI) discuss our results in relation to theoretical perspectives to provide future research avenues and possible implications for prevention or intervention approaches for authorities and companies facing counterfeits.

Data

The data used in this study originated from the “Darknet Market Archive”,^{Footnote 1} a collection of 89 markets and associated forums (Branwen et al., 2015) for which data were initially collected between 2014–2015 and continuously supplemented thereafter. To facilitate the selection of relevant markets, we cross-referenced the available market data with a list of markets documented by EMCDDA-Europol (2017). Through this comparison, we identified 38 markets (see Additional file 1: Appendix A), each of which operated for at least six months and was captured in the data archive. The reason for including markets that operated for at least six months was to ensure that the markets were able to attract enough vendors and customers, allowing for a broader range of product offers and trades.^{Footnote 2} The market archive contained data on 30 of the 38 identified markets, but five of them contained data spanning less than six months, and data on eight markets did not include a sufficient self-organizing structure (e.g., categories), which would have allowed for the identification of counterfeit goods. For example, some market data contained products (e.g., shoes, handbags) without categorization or a detailed description, making it impossible to determine if they were counterfeits or originals that had been stolen or otherwise illegally obtained. Furthermore, six markets were either highly specialized (e.g., solely carding or Marihuana markets) or did not contain any counterfeits. Thus, we included the remaining 11 markets in our study (see Table 1).

Table 1 Markets and their data timeframe in this study

Full size table

Data filtering

Each market listed a range of products that were not counterfeits (e.g., drugs, services, weapons). Consequently, it was necessary to exclude such listings prior to analysis. To do this, we created a corpus of counterfeit products in two steps.^{Footnote 3} First, we included products that were clearly categorized as counterfeits based on the categories used on the markets, such as “Counterfeit[s]”, “Replica[s]”, “Counterfeit Items”, and “Replica watches”. Second, listings that were not included on this basis were filtered using an advanced keyword search. These keywords were for 29 other categories of items that, through the manual inspection of the data, were identified as including counterfeits (see Additional file 1: Appendix B for the complete list of the categories). To facilitate the advanced keyword search, we merged the title and description of each listing in those 29 categories, lowercased, tokenized, and stemmed the text, and removed all punctuation. We then searched for 44 stemmed synonyms of the word “counterfeit” (e.g., "fake", "clone"; a complete list is provided in Additional file 1: Appendix C) as well as six negated synonyms of “authentic”, using bigrams (e.g., "genuine", "original"; a complete list is provided in Additional file 1: Appendix D) in each merged listing text. Lastly, a list of keywords was used to exclude listings (Additional file 1: Appendix E) that sold templates or tutorials on how to counterfeit. 124,379 listings were clearly marked as counterfeits, while 42,775 listings were identified through the keyword searches, resulting in a total of 158,228 counterfeit listings overall. Of these, 11,633 were completely unique listings for which at least the title, description, and vendor name differed. Text processing was conducted using the python package “nltk” (Bird et al., 2009).

Categorizing counterfeits

To determine the distribution of product types among those identified as counterfeits, we trained a machine-learning classifier on a subset of human-annotated data. The classifier was then used to predict the categories of the remaining unannotated products. To generate the annotated data, we randomly extracted 2200 unique listings from the counterfeit data set, which participants from the crowdsourcing platform Prolific subsequently annotated.^{Footnote 4} To ensure that we obtained accurate annotations, each listing was annotated by at least three participants. We recruited 220 participants, each annotating 30 listings based on the listing title. Participants provided written informed consent online by clicking all consent statement boxes affirming their consent before taking part in the study. The final category label for each listing was determined using the majority vote.^{Footnote 5} When annotating, participants were presented with an online interface and were required to select one of the following labels: “Watches”, “Handbags”, “Wallets”, “Sunglasses”, “Other accessories”, “Clothing”, “Footwear”, “Articles of leather”, “Fabrics (silk, rugs)”, “Phones”, “Electronics”, “Jewelry”, “Cosmetics”, “Pharmaceuticals”, “Metals”, “Tobacco”, “Forgeries (Money, Coupons, IDs, etc.)”, “Services”, “Other”.^{Footnote 6} We calculated Krippendorff's alpha to determine how much annotators agreed on the labels they generated^{Footnote 7} (Feng, 2015). The value of α = 0.75 demonstrated good agreement (Hayes & Krippendorff, 2007; Krippendorff, 1970).

From the distribution of categorized products, it was apparent that the product types were not uniformly distributed, with watches representing the majority of all counterfeits annotated. Because some of the categories had low numbers, which would likely affect the classifier's performance, when training the classifier, we manually added eight listings to the “Tobacco” category and six listings to the “Cosmetics” category. Table 2 shows the resulting distribution (after manually adding listings) of the labeled categories for the randomly selected subset of counterfeits.

Table 2 Annotated categories within counterfeits

Full size table

Automated labeling

Inspired by previous research (Wegberg et al., 2018), we used the annotated listings to train a multiclass classifier to predict the labels of the remaining unlabeled counterfeits. Obtaining labels for all the listings has the advantage of allowing us to conduct our analyses for the whole dataset, including the price or individual texts of the listings, which would be more difficult through estimations from a sub-sample. We generated text features from the merged product title and description to train the classifier. However, we first lowercased the texts and removed all punctuation. We then tokenized the text, removed all English stop words, and stemmed the remaining words. Subsequently, we generated part of speech tags, unigrams, and bigrams, which were weighted with a tf-idf (term frequency-inverse document frequency) score. The python package “nltk” (Bird et al., 2009) was used for all text cleaning and feature generation steps. To increase the classifier's performance, we used a mix of under- and over-sampling methods to balance the number of product listings between the categories. First, the category “Watches” was under-sampled, reducing the number of listings in the sample. This was followed by oversampling of the remaining categories to increase the number of these listings in the sample, resulting in an equal representation between all categories, each consisting of 450 listings. To reduce the number of listings within each category, we randomly selected listings (without replacement) from the data until we reached 450 listings. To increase the number of listings within a category, we used “SMOTE” (Synthetic Minority Oversampling Technique), which synthesizes new unseen data points (Chawla et al., 2002). Such new data is generated by first randomly selecting a listing of that category and finding the k (5) nearest neighbors of that listing within the feature space. Then, one of the neighbors is selected at random, and a new data point is created at a random point between the two listings in their feature space. Both under- and over-sampling methods were implemented in python using the package “imblearn” (Lemaître et al., 2017). Next, we utilized the “LinearSVC” classifier with an “l2” penalty (the default regularization parameter used to reduce complexity in the model and avoid overfitting) using a tenfold cross-validation procedure. The under-, over-sampling, training, and testing steps were embedded within a pipeline so that the classifier was trained on the balanced listings (450 in each category) but tested on the unbalanced listings (as in Table 2), ensuring a fair assessment. The test performances were evaluated using the average accuracy, and the weighted average of precision, recall, and F1 scores across all folds, as shown in Table 3. The python package “scikit-learn” (Pedregosa et al., 2011) was utilized for training, testing, and evaluating the classifier.

Table 3 The performance scores (weighted average) across 10-folds

Full size table

To better understand the classifier's performance for each category, we generated a normalized confusion matrix for all classes (Fig. 1). The matrix shows the cases of true (rows) and predicted (columns) categories of the listings. Thus, the values in the matrix show the proportion of items for which the true class was predicted. The diagonal cells (left-top to right-bottom) indicate the correct proportion for each category.

Classification performance was generally good, but we observed that six categories showed low (Cosmetics, Tobacco, Other accessories, Other) or very low (Pharmaceuticals, Services) categorization performance. Since low performances are only present with classes exhibiting few listings in the test set, most of the listings are well categorized, which is also reflected in the weighted performance scores (Table 3). An exception was for the category “Other”, which was also less well categorized despite containing more listings than the other low-performing categories. The category “Other” often contained custom orders, with product titles such as “custom [customer name]”, complicating the annotation process. Since the classifier received additional information from the product description, which was not available to the annotators, it is possible that mismatches between the annotations and product descriptions led to more misclassifications in the category “Other”. For example, some custom orders might have similar descriptions as other counterfeits. Besides custom orders, the category “Other” also included guides, instructions, counterfeit art (e.g., paintings), or cars.

Having established the accuracy of the classifier to predict the unlabeled listings (i.e., label all the unannotated listings), the entire annotated data was utilized to re-train the LinearSVC classifier with the same parameters. The advantage of re-training the classifier with the entire annotated data—instead of using the best classifier from the cross-validation procedure, which is trained only on a subset of the annotated data—is that all the annotated data can be leveraged for the training, which supports better predictions.

Holding and placeholder prices

Previous studies about dark markets sometimes encountered holding prices, which vendors use to mark out-of-stock listings, preventing their removal from the market (Soska & Christin, 2015; Wegberg et al., 2018). Some holding prices are very high to prevent anyone from buying the product. The advantage of a holding price is that vendors can keep showing customers what was sold and what might be coming back in stock. However, when estimating price or sale volumes on markets, holding prices with very high values can distort the actual results. Therefore, we used a heuristic proposed and used by others (Soska & Christin, 2015; Wegberg et al., 2018) to replace high holding prices (≥ 10,000 USD) with the original price (if available) or to remove it. In addition, we also looked at listings with very low prices (≤ 5 USD) and found that such prices were mainly not the actual selling price and seemed to function as placeholders too. For example, many listings with a price of 0 need further specifications by the customer (often instructed in the listing description), such as amounts, colors, or shipping, which affects the final price. However, during the data scraping process, the placeholder price is mostly that which is collected rather than the individual price variations. In some instances, vendors listed the variations of the products in separate listings and later merged them into a single listing with the option of making the wanted changes (color, amount, etc.) or vice versa. In such cases, we can determine the average price of such a merged listing to get a more accurate representation of the product price. For listings with a holding and placeholder price, we searched for the same product from the same vendor to find a replacement price. Table 4 shows the distribution of found and replaced holding and placeholder prices.^{Footnote 8} Products with a high holding price for which we did not find replacements were excluded from further analyses of the value of the goods.

Table 4 Number of found and replaced holding and placeholder prices and the average price of all replacements

Full size table

Results

This section looks at the data for all products and counterfeits and their distribution across markets. We then focus on counterfeit product types and product origins and compare our measures with estimates from audits of goods seized by law enforcement at borders. Lastly, we evaluate the monetary value of offered and sold counterfeits and the generated sales volume of vendors.

Product offers and counterfeit prevalence

Figure 2 shows how many products (not just counterfeits) were offered across all markets over time. The volumes shown are monthly and contain all available products on the dark markets. For most markets, the data range between January 2014 and September 2015, but the data for the market Alphabay extends to January 2017. Evolution and Agora offered the most products, followed by Alphabay, Abraxas, BlackBank Market, and Cloud 9. The remaining markets seem to have offered only a minimal number of products and for shorter periods. Reasons for this variation differ. For example, some markets were closed down by law enforcement (Cloud 9, Alphabay), closed down voluntarily (The Marketplace, Agora), experienced an exit scam^{Footnote 9} (Evolution, BlackBank Market, Andromeda, Middle Earth Marketplace, Abraxas), or were hacked (EMCDDA-Europol, 2017).^{Footnote 10} However, scraping data from dark markets can also be unstable, leading to gaps in the data (Ball et al., 2019; Du et al., 2018; Ghosh et al., 2017; Buskirk et al., 2016). Thus, we can only capture a partial picture of overall events, probably leading to underestimating the availability of products on dark markets and their value.

Figure 2 also shows the monthly volume of products offered across all markets combined (gray line). Overall, product offerings seem to increase steadily, with a sharp peak at the beginning of 2015 with almost 100,000 listings. Offers then starkly declined, with only a few products on offer from mid to late 2015, followed by a slow increase for the remaining time, the latter solely attributed to Alphabay. To make comparisons and estimations of counterfeits across markets more comparable, we subsequently focus on the timeframe for which most markets had at least some listings on their platforms: January 2014 to September 2015.

Focusing on counterfeits (Fig. 3), we see a similar overall trend (gray line). However, as expected, the overall number of offers is much lower, with counterfeits accounting for around 2.69% of all listings across markets. Interestingly, the observed proportion of counterfeits on dark markets coincides well with the estimated overall proportion of counterfeits worldwide (3.3%) discussed above (OECD/EUIPO, 2019). Furthermore, only nine of the eleven markets seem to offer counterfeits, with Agora and Evolution offering the most, followed by BlackBank Market, Alphabay, and Middle Earth Marketplace. The remaining markets seem to harbor only a minimal number of counterfeits. Most offers seem to occur between the beginning- and mid-2015.

Counterfeit product types and occurrences

Focusing on counterfeit product types (Table 5), we observe that watches make up most of all products (59%) listed on the markets, followed by four categories, each of which accounts for between 4 and 6% and collectively account for around 20% of all counterfeits. Most of the remaining categories contribute only a little, with most representing only 2% or less of all counterfeits. Thus, almost 80% of counterfeits listed were represented by only five (of the 16) categories of products.

Table 5 Percentage of counterfeit categories; not all categories are shared by the reports; see Additional file 1: Appendix F for separate and complete lists of counterfeit categories by OECD/EUIPO (2019) and IP Crime Group (2015)

Full size table

By comparing our measures of the types of counterfeits to goods seized at borders, we can identify how products differ and discuss possible contributing factors to those differences. Based on a report by OECD/EUIPO (2019), which summarizes findings regarding seized counterfeits between 2014 and 2016, we see that not all categories represented on dark markets are also present in seized goods (Table 5). Also, the distribution of counterfeits found on dark markets and seized products varies greatly. In addition, sunglasses, handbags, and other accessories, which make up around 10% of counterfeits on dark markets, are not listed individually in the report but are grouped within headgear (1.5%), miscellaneous (0.4%), and articles of leather (13.4%). The remaining categories show a similar distribution (OECD/EUIPO, 2019).

Another report by the Intellectual Property Office (IPO) in the United Kingdom shows a different picture of IP and counterfeit-affected product categories (IP Crime Group, 2015). The report summarizes independently reported IP crimes through Crimestoppers^{Footnote 11} and investigations of counterfeits by Trading Standards (TS)^{Footnote 12} between 2014 and 2015. The top five reported and investigated IP crimes were Tobacco, optical media, clothing, alcohol, and footwear. Although watches, jewelry, cosmetics, and electronics were also within the top 17 affected categories, they seem to be less prominent than on dark markets and attracted fewer investigations by TS (Table 5). The differences observed for Tobacco, Footwear, Electronics, Clothing, and Watches, are further examined in the Discussion section.

Counterfeit origins

Next, we examine the shipping origins of products as indicated on the product listings. Figure 4 shows the percentage of shipping origins for all products and counterfeits across all markets. All countries that accounted for 1% or less are aggregated into the category “Other”. While possible shipping destinations are included in the listing data, we did not analyze these as most destinations are listed as “Worldwide” or “Undeclared”, providing only limited information. The distribution of the shipping origins for all products seems to differ from counterfeits. However, “Undeclared” takes up a considerable portion in both cases. While most products seem to originate from the USA, most counterfeits are from China, including Hong Kong. “Other” contained mostly European countries (e.g., Italy, France, Poland, Portugal), it also contained a range of Asian countries (India, Thailand, Singapore, Cambodia), and others (e.g., Afghanistan, Chile). The category “EU” (Europe) is not an aggregation we generated but was indicated on some products. Thus, for those products, we cannot say which European countries they originate from specifically.

Table 6 shows the association between particular types of counterfeit goods and the country they were listed as originating from. Each row shows where listings for the product category originated. Countries that accounted for less than 10% of the listings were aggregated into the category “Other”. For example, 74.92% of counterfeits categorized as footwear originated from China and 25.08% from “Other”. The countries of origin are mutually exclusive, and so the row totals sum to 100%. As previously indicated, China is well represented, contributing to many categories. For Cosmetics, Electronics, Pharmaceuticals, and Services, additional countries previously included in the “Other” category are now visible and seem to specialize in supplying one particular type of counterfeit. However, for some counterfeits, the “Other” category accounts for a substantial fraction of counterfeits indicating that in these cases, the products originate from a large number of countries. In addition, only six categories (Footwear, Clothing, Cosmetics, Pharma., Tobacco, and Watches) seem to have a rate of undeclared origins of below 20%, possibly indicating that many sellers are concerned about giving up too much information by indicating a product origin. Since Belgium is a relatively small country, we examined the services that seemed to originate from there more closely. Of the 195 unique offered services for all counterfeits, 11.86% originated from Belgium (BE), representing 23 listings. All services originating from Belgium were listed by a single vendor and included digital goods, such as Facebook likes, guides about making money, using Tor, and hacking. Since digital goods are not reliant on physical transportation, their origin might only inform us about the possible residence of the individual offering such goods. We also manually examined the products sold by Austria (AT), Australia (AU), Thailand (TH), Afghanistan (AF) and Germany, as they each sold items of one product category that accounted for at least 10% of the total. Australia and Germany both contribute to the sales of pharmaceuticals. Two vendors from Australia contributed nine listings to pharmaceuticals, selling drugs (that were misclassified), bust also fake drugs (presumably used for dilutions to increase profit margins), and measuring syringes. Three vendors from Germany contributed five pharmaceutical listings, which were misclassified chemical drugs, showing that product descriptions from drug listings and pharmaceutical counterfeits can be very similar. Two vendors from Thailand offered 48 electronics, mostly counterfeited smartphones and headphones. 41 handbags that originate from Afghanistan were mostly imitations of Louis Vuitton handbags. 16 cosmetics originated from Austria: counterfeit perfumes from Chanel and Luca Bossi.

Table 6 Percentage of counterfeit shipping origins by country and product category; percentages are split by countries and aggregate to 100% for each row

Full size table

In contrast to the differences observed for counterfeit products seized at borders and offered on dark markets, product origins seem to match better across data sources. For example, between 2014 and 2016, seized goods mainly originated from China (55%) and Hong Kong (26.2%) (EUIPO, 2019; OECD/EUIPO, 2019). However, seized goods also originated from the United Arab Emirates (3.8%), Turkey (3.1%), Singapore (2.8%), Thailand (1.4%), India (1%), and other countries (each with less than 1%) (OECD/EUIPO, 2019). In contrast, for the dark markets, counterfeits were either not explicitly offered from these countries (e.g., Singapore, Thailand, India), or they accounted for less than 1% of the listings. Interestingly, the USA seems to account for 5% of counterfeits on dark markets while only accounting for 0.4% in seized goods.

Counterfeit prices, sales volume, and surface web prices

Lastly, we summarized counterfeit prices for each category (Table 7), estimated vendor sales volumes (Table 8 and Fig. 5), and examined the price differences of products offered on darknet markets and the surface web (Table 9, Fig. 6).

Observed counterfeit prices

Table 7 shows the prices for all counterfeit listings (offers) as customers can see them on the markets. Prices are expressed in USD and are based on all counterfeit listings at the time the listing was posted.^{Footnote 13} The total price volume represents the accumulation of all prices from all unique counterfeits for each category (i.e., the total value if each listed item would have sold once, but only once). The total price volume of all unique counterfeits from Jan-2014 to Sep-2015 is around 1.8 million USD. Many maximum prices of each counterfeit category are high, often attributed to wholesales. The highest observed mean price is for metals, including collectible gold and silver coins or bullions, while the lowest is for sunglasses. With watches making up most listings, they also hold the highest volume, around 1 million USD. Minimum prices of 0.00 are mostly placeholders, and are not free products, often used to prompt the user to select an amount, color, model, and so on (see above). Both Metals and Pharmaceuticals show high standard deviations, which can be attributed to a few very high-priced listings. For example, “28 g PSEUDO SPEED” for $2000 or “Lot of 10 High Quality Counterfeit Gold Bars” for $5799.

Table 7 Summary counterfeit prices and volumes for each product category in USD

Full size table

The last two rows show the prices' mean, total, and weighted mean. Specifically, in the “Mean/Total” row, each USD column (Min, Max, Median, etc.) is averaged by dividing the sum of all product category prices by the number of product types, while solely the column “# Listings” is totaled. The weighted mean is the result of taking the sum of the product of the category price and the number of listings of the same category, divided by the total number of listings. Thus, each mean is weighted by the number of listings available in each product category.

Estimated counterfeit sales volumes

As in previous research (Soska & Christin, 2015; Wegberg et al., 2018), we utilized the total number of feedback comments provided for each listing to estimate how often an item was sold. Since the data was scraped recurrently, listings and their associated feedback is collected cumulatively, adding old and new feedback for every scrape completed. To avoid duplication, we only analysed unique items of feedback. The number of unique feedback was then multiplied by the product's listing price on the darknet market (PP_DM) to obtain an estimated sales volume in USD (Table 8). Although some markets made feedback mandatory in the past, we do not know how the markets in this analysis regulated feedback and if fake reviews are moderated, resulting in some uncertainty as to whether the review count is an under- or over-estimation of sales. However, based on the current estimates, most sales were for watches, followed by “Other” (6.50%) and Forgeries (5.96%).

Table 8 Estimated sales volume (USD) for each category based on the number of feedbacks

Full size table

Considering the monthly sales volume by category (Fig. 5), we observed a similar trend for available listings (Fig. 3), with most sales occurring between mid-2014 and mid-2015. We also observed two peaks in sales in mid-2014 and mid-2015. Again, watches are represented most, followed by forgeries and “Other”.

Comparing these figures to seizures at borders, a report by the OECD/EUIPO (2019) found that the largest value share for goods seized at borders was for watches (22.9%), followed by leather articles (11.6%), electrical equipment & machinery (10.8%), footwear (10.5%), clothing (8.2%), jewelry (5.9%), cosmetics (4.9%), toys (4.6%), optical/photographic & medical instruments (4.1%), mechanical appliances (1.5%), vehicles (1.4%), and other products (less than 1%). Although watches seem to account for the most value in dark markets and border seizures, the concentration of watches is much more pronounced for the dark markets, with an estimated sales volume of over 68%. Thus, seized goods appear to show a more equally distributed range of values across products than is observed on the darknet markets. Categories, such as machinery, toys, medical instruments, and appliances listed for seized goods, did not appear to be explicitly sold on dark markets, probably contributing to the skewed product distribution observed there.^{Footnote 14}

Dark and surface market prices

In addition, we sampled ten darknet market products from each category and determined their price on the surface web (Table 9). For 25 products, we determined the historical price on the surface web by utilizing a product price comparison site (geizhals.eu)^{Footnote 15} which records the complete price development over the product’s lifespan.^{Footnote 16} We determined the current price using Google Shopping (shopping.google.com) for the remaining products.^{Footnote 17} When possible, we used the prices from original brand stores (e.g., Hermes, Louis Vuitton, Gucci, etc.) but selected prices from other shopping platforms if the products were not manufactured anymore or were not otherwise listed. For 46 dark market products, we found the exact match on the surface web, while for the remaining listings, we selected the next best match from the same brand.^{Footnote 18} For the category Metals, we adjust different indicated weights on the listings (e.g., 10 oz, 1 g, 1 kg) by extrapolating the cost for 1 oz for each listing, making a comparison possible. We excluded products from the categories Services, Forgeries, Pharmaceuticals, and “Other” since most of these products cannot be purchased on the surface web.^{Footnote 19} Product prices in Euro were converted into USD based on the conversion rate present on the price date. Since we selected only ten random samples for each product category, the estimated price differences are only intended to illustrate the observed trend and should not be regarded as a complete analysis.

Table 9 Mean [SD] USD of 10 sample products for each category on surface markets

Full size table

To better understand the relationship between darknet markets and surface web prices, we plot one against the other in Fig. 6. Across all product categories, products are more expensive on the surface web, but prices between and within categories vary considerably. Prices between darknet markets and the surface web are closest for cosmetics (for which the mean ratio was 2.22) and most different for watches, which were, on average, 147.23 times more expensive on the open than the dark web.

Discussion

Insights about counterfeits typically originate from data on goods seized at borders by law enforcement agencies. As discussed, these data are not collected through random sampling or other approaches that would ensure that the findings are representative of the ground truth. Instead, they are subject to various biases associated with the intelligence that law enforcement agencies collect or have access to, or the policies followed at borders. This means that our understanding of what is counterfeited is likely to be biased. Further indications of possible biases can be found in the prevalence estimation differences for various agencies (IP Crime Group, 2015; OECD/EUIPO, 2019). Given that IP crime is known to be increasing (Federal Bureau of Investigation, 2014, 2015, 2016; OECD/EUIPO, 2019), it is important to understand the counterfeit economy better, and in this study, we examine what insights the analysis of data regarding the availability and price of counterfeits on dark markets might provide.

Product categories

The current study suggests that the share of counterfeits on dark markets (2.69%) seems to be slightly above previous expectations, which were around 1.5–2.5% (Europol, 2017). We also see differences in some product categories observed during seizures and counterfeits offered on dark markets. As already described, seized products are most likely biased through the activities and procedures adopted by authorities affecting estimations on which product types are affected. Examining the counterfeit categories, we see that watches account for most of the value in both cases but are more prominent on dark markets overall. Watches might be more challenging to identify or detect as counterfeits as other products (e.g., shoes, clothes, Tobacco) in seizures, perhaps due to very high-profit margins, an increased effort is put into making fake watches more difficult to identify. Alternatively, watches might be less prone to bulk shipments and make their way through borders differently than other items (e.g., single parcel shipments through the air versus containers at ports). Hence, watches might be shipped more diversely, possibly going through different security measures and being more difficult to catch overall. However, single parcel shipments might only be worthwhile for high-value items, such as watches, but less profitable for items that need high-volume sales.

Interesting are also other strong prevalence estimation differences, such as for Tobacco, Footwear, Electronics, and Clothing. For some product groups, estimations from authorities are missing entirely (e.g., sunglasses, handbags, accessories, wallets, metals, Tobacco). Especially, Tobacco seems to make up only 0.24% of counterfeits on dark markets, which is missing in estimations by OECD/EUIPO (2019) but is highly representative in measures by IP Crime Group (2015). Vendors on the dark market might favor high-value products, possibly tailoring more towards end-consumers than other businesses. Thus, Tobacco might be more difficult to sell in high volumes on darknet markets. Similarly, OECD/EUIPO (2019) measured relatively high ratios of Footwear, Clothing, and Electronics, which are far less prevalent on the dark market. Again, such differences might originate from biases in selecting shipments for inspections but also illustrate the current issue of inconsistent measurements capturing what is being counterfeited. Important to note is that the authorities also seized counterfeits that are missing on darknet markets, such as vehicles, furniture, or alcohol, which can distort the ratio of product groups (see Additional file 1: Appendix F for a full list of seized goods).

Product origins

Seized and dark market counterfeits mostly seem to originate from China and Hong Kong. However, some uncertainty surrounds the information about the origins of dark market counterfeits since providing this information is voluntary, and a large portion is undeclared (see Limitations). Nonetheless, the stark outlier in product origins of seized goods and product offers on dark markets is the US. Around 5% of dark market counterfeits were listed as originating from the USA, while only 0.4% of goods seized at borders come from the US. Again, such a discrepancy might be due to biased expectations by law enforcement, as searches are sometimes based on shipment origins (Männistö et al., 2021). Thus, border seizures might miss counterfeits originating from countries suggested by dark markets, such as the US. For example, Tobacco, pharmaceuticals, metals, electronics, and accessories (e.g., sunglasses) could be scanned for counterfeits when originating from the US. Similarly, cosmetics seem to originate from Austria more frequently, and pharmaceuticals from Australia. Alternatively, counterfeits from the US might be more heavily purchased domestically, leading to limited exportation, which would avoid border controls. Moreover, dark market listings represent the availability of a product rather than the actual supply of them. Although knowing which country counterfeits are available is helpful, products must be purchased first and subsequently shipped to be found at a border. Thus, estimation of product origins from dark markets and measures of seized goods might also vary because they capture products at different supply chain stages.

Vendor sales volume and product values

Similarly, estimating the sales volume and monetary value of counterfeits on dark markets is accompanied by uncertainty, which is further addressed in the next section (Limitations). However, we can see that the estimated sales volume generated for counterfeits on dark markets seems very small compared to the possible value of the items on the surface web. Europol (2017) estimated that most physical counterfeits on dark markets are sold for one-third of the actual price. Based on the current study, the discrepancy between counterfeit prices and their actual values on the surface web are more diverse and can be twenty times larger (e.g., for watches). Such differences suggest that the prices and possible sales volumes depend highly on the product category. However, the current price differences illustrate that purchasing darknet market counterfeits and selling them on the surface web could lead to considerable profits. Thus, it might be helpful to focus the attention of authorities on highly valuable counterfeits, such as watches, clothing, or jewelry, as they seem to generate the biggest profits. Notably, relative to the patterns observed for darknet markets, watches were underrepresented in the estimates based on seizures, and metals were not featured at all.

We can also see greater differences between dark and surface web prices for higher-value products, such as watches, clothes, and jewelry. Dark market vendors might prioritize higher-valued products, which can generate profits faster than products with lower profit margins (e.g., accessories, Tobacco). Such a strategy would support the idea that darknet market vendors might tailor their products more towards end-consumers, who purchase fewer items, rather than businesses, which could purchase items in high volumes with the purpose of re-selling them. In other words, lower profit margin products need higher turnovers for high profits, which is facilitated by business-to-business transactions.

Possible preventative measures

Since the darknet market counterfeits identified here were fully manufactured consumer products, for them to enter the supply chains of legitimate retailers, the latter would either have to sell them knowingly, or they would have to be introduced during distributional processes so that they are mixed with genuine products without the retailer’s knowledge. Counterfeits could be introduced during packaging, distribution to wholesalers, retailers, or any other transportation process. As Hollis and Wilson (2014) discuss, addressing the problem in cases where companies have been misled would involve improvements to guardianship in risky parts of the supply chain. Companies could be provided with information about which products are affected and from which country they originate to facilitate their efforts to identify risks in their supply chain. For example, informing personnel that are responsible for overseeing the distribution of an affected product (which is being counterfeited) could help them to implement or re-evaluate their internal working processes to reduce the risk of counterfeits entering their supply chain and increasing the risk of discovery for the counterfeiters (Hollis & Wilson, 2014). Such implementations could include raising employee awareness of the affected products, implementing reporting mechanisms, or introducing additional validation checks for particular product types for specified periods of time. To aid in this activity, dark net market data—searchable by brand—could be made accessible to companies. Since product information is quite detailed, an implementation with up-to-date darknet market data is feasible.

Another issue concerns the leaking of product designs. One approach to help address this would involve the identification of products that are found to be offered on darknet markets before their official release on the surface web. Knowing that plans were shared would help companies narrow down which processes would have to be reviewed and where measures should be put in place to ensure adequate guardianship. Such measures might involve limiting access to project plans to only those who need to know about them (to minimise insider threats) and ensuring that all data are secure (to minimise external threats). While some cyber security and brand protection organizations advertise dark web monitoring to detect data leakages, such as personal data, to what extent they track counterfeits is unclear (Corsearch, 2023; Lenaerts-Bergmans, 2023).

Other approaches to counterfeiting might involve one or more of the 25 techniques of situational crime prevention (Clarke, 1995; Freilich & Newman, 2018), which are also informed by Rational Choice theory and the RAA. One such technique is target hardening, which aims to make the target of an offence (e.g. counterfeiting a product) less viable for the offender. Knowing which counterfeits are offered on darknet markets could help companies to make those products more difficult to counterfeit. For example, companies could change the used materials or manufacturing process to increase the efforts of imitating the product. Traceability of genuine products within a supply chain would also fall within that category, as it increases the efforts needed to counterfeit them, which could be technologically facilitated (Gayialis et al., 2022). Alternatively, the offenders' rationalisation for committing a crime could be challenged by removing possible excuses for their actions. Removing excuses includes approaches such as setting up rules or posting instructions to reduce ambiguity in situations that can be exploited. Such strategies could be helpful to deter employees in situations in which they could act maliciously (e.g., stealing plans, reintroducing counterfeits, sharing manufacturing or packaging plans, etc.) by reminding them what actions are disallowed or how specific work tasks should be performed (Freilich & Newman, 2018).

Future studies

Given the results of this study, it would be interesting to examine if and how such information about counterfeits on dark markets can be utilized as intelligence for law enforcement activities, companies, or policymakers. Since darknet markets and single vendor shops are continuously growing, data concerning counterfeit listings are likely to increase too (Labrador & Pastrana, 2022; Laferrière & Décary-Hétu, 2022; Platzer et al., 2022). Thus, the monitoring of darknet market counterfeits might also be increasingly valuable. Besides validating findings from seized goods, dark markets could serve as indicators of early trends for the onset of activities on the surface web. For example, future work could establish a monitoring system that collects dark market counterfeits data. Such a dataset could be used to search for dark market products on the surface web (e.g., Amazon, eBay) to establish if the same or similar products are sold across platforms. Furthermore, a longitudinal study could explore temporal trends, and examine if products tend to appear first on the dark web and subsequently find their way to the surface web. A common problem with research concerning dark markets is the accurate estimation of sales value. While utilizing customer feedback to make informed estimations is helpful, future work could explore if it is possible to exploit transactional data associated with cryptocurrencies used by dark markets (ElBahrawy et al., 2020; Nadini et al., 2021) to complement sales assessments of specific products.

Limitations

The data analysed here misses some bigger markets, such as the first Silkroad, Hydra, Empire, Hansa, Wall Street, and Sheep. The reasons for their exclusion were that they were not included in the data archive or lacked sufficient product categorization needed for the current analyses. The data also does not cover possible user-to-user transactions, which bypass the markets altogether (Nadini et al., 2021). Thus, the findings reported here do not reflect the entire dark market economy, just the activity recorded for those markets sampled. Furthermore, the present analyses utilized historical data without newer scrapes (see ElBahrawy et al., 2020), limiting some of the possible contemporary policy and prevention implications. However, previous work has not provided us with an understanding of how extensive counterfeits are present on darknet markets and re-using existing data in the current study serves as a proof of concept, showing that darknet market data can be valuable in understanding the counterfeit economy better.

Furthermore, a general problem related to research with dark markets is that the data collection procedure is constrained by the scraping process and individual platform closures, which can lead to gaps in the available data, which can make exact measurements of dark market activity difficult. The scraping process can be disrupted due to the slow connection of the Tor network, security measures of the website that are implemented to hinder automated data collection (e.g., required log-ins due to set session time-outs or solving recurring captchas), or temporary website closures. Thus, the overall number of observed listings and associated estimations will be more uncertain, making general conclusions more difficult.

While comparing seized counterfeits to dark markets counterfeits can help us understand how the two areas relate to each other, the comparison is only partly applicable. Dark market listings are offers, while seized products may already have been sold. Although seized products can also inform us about offers, they are only a subset of sold counterfeits from the overall market. Thus, comparisons of dark market listings with seized goods are informative, but they do not always encompass the same measures.

Similarly, uncertainties are present with shipping information and feedback associated with dark market listings. It is voluntary for vendors to make product origin declarations, and many choose not to do so. Nonetheless, many declared origins are in line with the origins of seized goods, providing us with some confidence in our measures. Also, information on postage times and possible tracking numbers is highly valued amongst customers, often referred to in feedback, making a genuine declaration of origin more attractive to vendors. Therefore, we cannot say how accurate product origin declarations are, but some incentives exist for vendors to make truthful indications.

As for product feedback, we cannot always know whether they are mandatory and whether the feedback is for a single or bulk purchase. Thus, the calculated sale volumes are approximations and will come with a general uncertainty because not all purchases will have produced feedback, one instance of feedback might be counted as a single purchase, or feedback could be artificially created to generate trust (Dellarocas, 2006). Similarly, our value estimation process should be taken with caution. Taking ten random samples for each product category will produce only rough estimates and was only intended to illustrate the estimated difference between prices on darknet markets and the surface web. Furthermore, a historic price could not be obtained for all product samples, and prices can vary considerably over time (e.g., original soccer shirts or Nike shoes), influencing estimations.

Conclusion

Based on the analyzed darknet market data, we can say that counterfeit goods are rare (2.99% of all products) on dark markets and are often included in miscellaneous categories. Thus, accurately measuring the prevalence of counterfeits across the dark web is difficult. However, we disentangled product categories using a classification model, allowing for a more in-depth analysis. We showed that some product types exhibit a strong prevalence discrepancy between dark markets and seized goods. Specifically, watches are more prominent on dark markets, while electronics, shoes, clothes, and Tobacco are more prevalent among seized goods. Furthermore, vendors seem to favor high-value products with big profit margins (e.g., watches) instead of products for which higher turnovers are necessary (e.g., Tobacco) to obtain the same revenues. Interestingly, we found some similarities in shipping origins between dark markets and seized goods, with some exceptions, such as relatively high origin shares from the US in dark market counterfeits.

While the study is based on historical data, we showed that examining dark market counterfeits in more detail can contribute to our understanding of the counterfeit market. With an increasing emergence of darknet markets and single vendor shops, offers of counterfeits are also likely to increase. Thus, examining current dark market data would be valuable in future analyses of IP crime, which would provide us with more up-to-date insights. Collecting data from dark markets to gather intelligence could be done manually and automatically and would probably be very cost-effective compared to (border) seizures. Once implemented, prolonged data collection could be easily maintained, providing us with regular details on counterfeits. Such information would be usable by authorities and businesses, informing them which products are currently affected.

Availability of data and materials

The datasets analyzed during the current study are available in the Darknet Market Archives (https://www.gwern.net/DNM-archives). Analysis code and supplementary data can be found at: https://osf.io/32au4/?view_only=e0566219216f4a848bb108f510fe2352.

Notes

Data: https://www.gwern.net/DNM-archives.
Manual inspections of the data revealed that markets with a shorter lifespan did not sell any counterfeits and often harbored very few vendors.
The code used to process and analyze the data can be found at: https://osf.io/32au4/.
www.prolific.co.
Due to the allocation procedure of participants from Prolific to our annotation task, some listings were only annotated twice while some were annotated more than three times. 192 annotations ties were manually resolved by the first author.
The categories were determined based on reported counterfeits for seized goods by law enforcement (OECD/EUIPO, 2019).
Krippendorffs alpha ranges between -1 (perfect disagreement) and 1 (perfect agreement) and can account for unequal numbers of annotators and annotations per item as well as missing annotations.
If several replacement prices were available, we took the average price as the replacement.
An exit scam describes a situation in which the platform (market) owners steal all cryptocurrencies from all customers. On many markets it is necessary to upload cryptocurrency to an account before making a purchase. Thus, market owners have full control over the customers deposits. In some instances, the market owners also control the implemented escrow service, allowing for an even bigger exit scam.
For detailed timeline of the market lifespans and their reasons for closing see (EMCDDA-Europol, 2017).
Crimestoppers is a non-governmental organization, which allows citizens to anonymously report crimes and concerns (https://crimestoppers-uk.org/).
Trading Standards is the local law enforcement within the UK, investigating IP crimes and enforce consumer protection legislations (https://www.tradingstandards.uk/).
All listings which contained only cryptocurrency prices were transformed into USD, utilizing the average conversion value from “https://coinmarketcap.com/currencies/bitcoin/historical-data/” on the day the listing was dated (scraping date).
The category “Other” contained a high variation of different products, including leather products (e.g., belts) and cars.
Geizhals.eu collects prices from original, licensed vendors, and reselling platforms (e.g., eBay), capturing price developments after the products is not manufactured anymore.
We determined the historic price by looking up the price of the product on the date it was listed on the dark market. For four products, we found historic prices deviating from the listing date by 2–4 months.
Google shopping shows products from original, licensed vendors, and reselling platforms (e.g., eBay).
In some cases, the product titles from the dark markets were not detailed enough (e.g. “LV wallet”) to find the exact products on the surface web. Three products in cosmetics and five in Tobacco could not be found on the surface web.
While randomly sampling products from “Other” we encountered products such as a car code grabber, a mail list for spam, or other digital services, which are not sold on the surface web. We also encountered a counterfeit of a Picasso painting “Seated Woman (Marie-Therese)”, worth more than USD 60 million.

References

Ball, M., Broadhurst, R., Niven, A., & Trivedi, H. (2019). Data capture and analysis of darknet markets. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.3344936
Article Google Scholar
Baravalle, A., & Lee, S. W. (2018). Dark web markets: turning the lights on AlphaBay. In H. Hacid, W. Cellary, H. Wang, H.-Y. Paik, & R. Zhou (Eds.), Web Information Systems Engineering – WISE 2018 (pp. 502–514). Springer International Publishing.
Google Scholar
Bird, S., Klein, E., & Loper, E. (2009). Natural language processing with python (p. 504). O’Reilly Media Inc.
Google Scholar
Bracci, A., Nadini, M., Aliapoulios, M., McCoy, D., Gray, I., Teytelboym, A., Gallo, A., & Baronchelli, A. (2021a). Dark Web Marketplaces and COVID-19: After the vaccines. ArXiv: 2102.05470 [Physics]. http://arxiv.org/abs/2102.05470. Accessed 28 Mar 2022.
Bracci, A., Nadini, M., Aliapoulios, M., McCoy, D., Gray, I., Teytelboym, A., Gallo, A., & Baronchelli, A. (2021b). Dark web marketplaces and COVID-19: Before the vaccine. EPJ Data Science, 10(1), 6. https://doi.org/10.1140/epjds/s13688-021-00259-w
Article Google Scholar
Branwen, G., Christin, N., Décary-Hétu, D., Andersen, R. M., StExo, El Presidente, Anonymous, Lau, D., Sohhlz, Kratunov, D., Cakic, V., Whom, McKenna, M., & Goode, S. (2015). Dark Net Market archives, 2011–2015 (2015-07-12). https://www.gwern.net/DNM-archives. Accessed 3 July 2019.
Broadhurst, R., & Ball, M. (2020). Availability of COVID-19 related products on Tor darknet markets. Australian Institute of Criminology. https://doi.org/10.52922/sb04534
Book Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357. https://doi.org/10.1613/jair.953
Article Google Scholar
Christin, N. (2013). Traveling the silk road: A measurement analysis of a large anonymous online marketplace. Proceedings of the 22nd International Conference on World Wide Web - WWW ’13, 213–224. https://doi.org/10.1145/2488388.2488408
Clarke, R. V. (1995). Situational crime prevention. Crime and Justice, 19, 91–150. https://doi.org/10.1086/449230
Article Google Scholar
Clarke, R. V., & Cornish, D. B. (1985). Modeling offenders’ decisions: A framework for research and policy. Crime and Justice, 6, 147–185. https://doi.org/10.1086/449106
Article Google Scholar
Cohen, L. E., & Felson, M. (1979). Social change and crime rate trends: A routine activity approach. American Sociological Review, 44(4), 588–608. https://doi.org/10.2307/2094589
Article Google Scholar
Corsearch. (2023). Content Protection: Investigation Services. Corsearch. https://corsearch.com/investigation-services/. Accessed 23 May 2023.
Décary-Hétu, D., & Giommoni, L. (2017). Do police crackdowns disrupt drug cryptomarkets? A longitudinal analysis of the effects of operation onymous. Crime, Law and Social Change, 67(1), 55–75. https://doi.org/10.1007/s10611-016-9644-4
Article Google Scholar
Dellarocas, C. (2006). Reputation mechanisms. In T. Hendershott (Ed.), Handbook on economics and information systems (pp. 629–660). Elsevier.
Chapter Google Scholar
Du, P.-Y., Zhang, N., Ebrahimi, M., Samtani, S., Lazarine, B., Arnold, N., Dunn, R., Suntwal, S., Angeles, G., Schweitzer, R., & Chen, H. (2018). Identifying, Collecting, and Presenting Hacker Community Data: Forums, IRC, Carding Shops, and DNMs. 2018 IEEE International Conference on Intelligence and Security Informatics (ISI), 70–75. https://doi.org/10.1109/ISI.2018.8587327
ElBahrawy, A., Alessandretti, L., Rusnac, L., Goldsmith, D., Teytelboym, A., & Baronchelli, A. (2020). Collective dynamics of dark web marketplaces. Scientific Reports, 10(1), 18827. https://doi.org/10.1038/s41598-020-74416-y
Article Google Scholar
EMCDDA-Europol. (2017). Drugs and the darknet: Perspectives for enforcement, research and policy. Publications Office of the European Union.
Google Scholar
EUIPO. (2019). 2019 Status Report on IPR infringement. European Union, Intellectual Property Office. https://euipo.europa.eu/tunnel-web/secure/webdav/guest/document_library/observatory/documents/reports/2019_Status_Report_on_IPR_infringement/2019_Status_Report_on_IPR_infringement_en.pdf. Accessed 10 Nov 2020.
Europol. (2017). INTELLECTUAL PROPERTY CRIME ON THE DARKNET. Enforcement Cooperation. https://www.europol.europa.eu/publications-documents/intellectual-property-crime-darknet. Accessed 7 May 2019.
Federal Bureau of Investigation. (2014). 2014 Internet Crime Report. U.S Department of Justice. https://pdf.ic3.gov/2014_IC3Report.pdf. Accessed 10 Nov 2020.
Federal Bureau of Investigation. (2015). 2015 Internet Crime Report (p. 236). U.S Department of Justice.
Federal Bureau of Investigation. (2016). 2016 Internet Crime Report. U.S Department of Justice. https://pdf.ic3.gov/2016_IC3Report.pdf. Accessed 10 Nov 2020.
Feng, G. C. (2015). Mistakes and how to avoid mistakes in using intercoder reliability indices. Methodology, 11(1), 13–22. https://doi.org/10.1027/1614-2241/a000086
Article Google Scholar
Freilich, J. D., & Newman, G. R. (2018). Situational crime prevention. Oxford Research Encyclopedia of Criminology. https://doi.org/10.1093/acrefore/9780190264079.013.3
Article Google Scholar
Garg, V., Afroz, S., Overdorf, R., & Greenstadt, R. (2015). Computer-supported cooperative crime. In R. Böhme & T. Okamoto (Eds.), Financial cryptography and data security (pp. 32–43). New York: Springer.
Chapter Google Scholar
Gayialis, S. P., Kechagias, E. P., Papadopoulos, G. A., & Masouras, D. (2022). A review and classification framework of traceability approaches for identifying product supply chain counterfeiting. Sustainability, 14(11), 6666. https://doi.org/10.3390/su14116666
Article Google Scholar
Ghosh, S., Porras, P., Yegneswaran, V., Nitz, K., & Das, A. (2017). ATOL: A Framework for Automated analysis and categorization of the darkweb ecosystem. In Workshops at the Thirty-First AAAI Conference on Artificial Intelligence.
Hayes, A. F., & Krippendorff, K. (2007). Answering the call for a standard reliability measure for coding data. Communication Methods and Measures, 1(1), 77–89. https://doi.org/10.1080/19312450709336664
Article Google Scholar
Hollis, M. E., & Wilson, J. (2014). Who are the guardians in product counterfeiting? A theoretical application of routine activities theory. Crime Prevention and Community Safety, 16(3), 169–188. https://doi.org/10.1057/cpcs.2014.6
Article Google Scholar
Hutchings, A. (2018). Leaving on a jet plane: The trade in fraudulently obtained airline tickets. Crime, Law and Social Change, 70(4), 461–487. https://doi.org/10.1007/s10611-018-9777-8
Article Google Scholar
IP Crime Group. (2015). IP Crime Report 2014/15 (p. 52). Interlectual Property Office UK.
Krippendorff, K. (1970). Estimating the reliability, systematic error and random error of interval data. Educational and Psychological Measurement, 30(1), 61–70. https://doi.org/10.1177/001316447003000105
Article Google Scholar
Labrador, V., & Pastrana, S. (2022). Examining the trends and operations of modern Dark-Web marketplaces. 2022 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW), 163–172. https://doi.org/10.1109/EuroSPW55150.2022.00022
Laferrière, D., & Décary-Hétu, D. (2022). Examining the uncharted dark web: Trust signalling on single vendor shops. Deviant Behavior. https://doi.org/10.1080/01639625.2021.2011479
Article Google Scholar
Lemaître, G., Nogueira, F., & Aridas, C. K. (2017). Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. Journal of Machine Learning Research, 18(17), 1–5.
Google Scholar
Lenaerts-Bergmans, B. (2023, April 27). What is Dark Web Monitoring? [Beginner’s Guide] - CrowdStrike. Crowdstrike.Com. https://www.crowdstrike.com/cybersecurity-101/dark-web-monitoring/. Accessed 23 May 2023.
Männistö, T., Morini, C., & Hintsa, J. (2021). Customs Innovations for Fighting Fraud and Trafficking in Cross-border Parcel Flows
Marin, E., Diab, A., & Shakarian, P. (2016). Product Offerings in Malicious Hacker Markets. ArXiv:1607.07903 [Cs]. http://arxiv.org/abs/1607.07903. Accessed 6 Mar 2020.
Marucheck, A., Greis, N., Mena, C., & Cai, L. (2011). Product safety and security in the global supply chain: Issues, challenges and research opportunities. Journal of Operations Management, 29(7–8), 707–720. https://doi.org/10.1016/j.jom.2011.06.007
Article Google Scholar
Nadini, M., Bracci, A., ElBahrawy, A., Gradwell, P., Teytelboym, A., & Baronchelli, A. (2021). Emergence and structure of decentralised trade networks around dark web marketplaces. ArXiv:2111.01774 [Physics]. http://arxiv.org/abs/2111.01774. Accessed 28 Mar 2022.
OECD. (2018). Trade in counterfeit goods and free trade zones: Evidence for recent trends. OECD Publishing.
Book Google Scholar
OECD, EUIPO. (2019). Trends in trade in counterfeit and pirated goods. OECD Publishing.
Google Scholar
Paul, K. (2018). Ancient artifacts vs. Digital artifacts: new tools for unmasking the sale of illicit antiquities on the dark web. Arts, 7(2), 12. https://doi.org/10.3390/arts7020012
Article Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., & Cournapeau, D. (2011). Scikit-learn: Machine learning in python. Machine Learning in Python, 12, 2825–2830.
Google Scholar
Platzer, F., Brenner, F., & Steinebach, M. (2022). Similarity analysis of single-vendor marketplaces in the tor-network. Journal of Cyber Security and Mobility. https://doi.org/10.13052/jcsm2245-1439.1124
Article Google Scholar
Roberts, D. L., & Hernandez-Castro, J. (2017). Bycatch and illegal wildlife trade on the dark web. Oryx, 51(3), 393–394. https://doi.org/10.1017/S0030605317000679
Article Google Scholar
Schafer, M., Fuchs, M., Strohmeier, M., Engel, M., Liechti, M., & Lenders, V. (2019). BlackWidow: Monitoring the Dark Web for Cyber Security Information. 2019 11th International Conference on Cyber Conflict (CyCon), 1–21. https://doi.org/10.23919/CYCON.2019.8756845
Sergi, A. (2022). Playing Pac-Man in Portville: Policing the dilution and fragmentation of drug importations through major seaports. European Journal of Criminology, 19(4), 674–691. https://doi.org/10.1177/1477370820913465
Article Google Scholar
Soska, K., & Christin, N. (2015). Measuring the longitudinal evolution of the online anonymous marketplace ecosystem. Proceedings of the 24th USENIX Security Symposium, 33–48.
Spink, J., Moyer, D. C., Park, H., & Heinonen, J. A. (2013). Defining the types of counterfeiters, counterfeiting, and offender organizations. Crime Science, 2(1), 8. https://doi.org/10.1186/2193-7680-2-8
Article Google Scholar
Spink, J., Moyer, D. C., Park, H., & Heinonen, J. A. (2014). Development of a product-counterfeiting incident cluster tool. Crime Science. https://doi.org/10.1186/s40163-014-0003-4
Article Google Scholar
Sullivan, B. A., Chan, F., Fenoff, R., & Wilson, J. M. (2017). Assessing the developing knowledge-base of product counterfeiting: A content analysis of four decades of research. Trends in Organized Crime, 20(3), 338–369. https://doi.org/10.1007/s12117-016-9300-5
Article Google Scholar
Tang, C. S. (2006). Perspectives in supply chain risk management. International Journal of Production Economics, 103(2), 451–488. https://doi.org/10.1016/j.ijpe.2005.12.006
Article Google Scholar
UNICRI, & ICC BASCAP. (2013). Confiscation of the Proceeds of Crime: A Modern Tool for Deterring Counterfeiting and Piracy and Executive Summary. United Nations Interregional Crime and Justice Research Institute, International Chamber of Commerce ‘Business Action to Stop Counterfeiting and Piracy’
UNODC. (2014). The illicit trafficking of counterfeit goods and transnational organized crime. nited Nations Office on Drugs and Crime. https://www.unodc.org/documents/counterfeit/FocusSheet/Counterfeit_focussheet_EN_HIRES.pdf. Accessed 1 June 2021.
Van Buskirk, J., Naicker, S., Bruno, R. B., Breen, C., & Roxburgh, A. (2016). Drugs and the Internet. https://www.drugsandalcohol.ie/20369/1/NDARC_Drugs&TheInternet_Bulletin1.pdf. Accessed 1 Oct 2019.
van Wegberg, R., Tajalizadehkhoob, S., Soska, K., Akyazi, U., Ganan, C., Klievink, B., Christin, N., & van Eeten, M. (2018). Plug and Prey? Measuring the Commoditization of Cybercrime via Online Anonymous Markets. Proceedings of the 27th USENIX Security Symposium, 1009–1026
WTO. (1994). TRIPS: Agreement on trade-related aspects of intellectual property rights. New York: WTO.
Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This work was funded by the Dawes Centre for Future Crime.

Author information

Authors and Affiliations

Department of Security and Crime Science & Dawes Centre for Future Crime, University College London, London, UK
Felix Soldner, Bennett Kleinberg & Shane D. Johnson
GESIS – Leibniz Institute for the Social Science, Cologne, Germany
Felix Soldner
Department of Methodology and Statistics, Tilburg University, Tilburg, The Netherlands
Bennett Kleinberg

Authors

Felix Soldner
View author publications
You can also search for this author in PubMed Google Scholar
Bennett Kleinberg
View author publications
You can also search for this author in PubMed Google Scholar
Shane D. Johnson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

FS, BK, and SJ contributed to the conception and design of the study. FS curated and analyzed the data and drafted the first version of the manuscript. FS, BK, and SJ reviewed and edited the Manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Felix Soldner.

Ethics declarations

Ethics approval and consent to participate

The study was reviewed by the ethics committee of the UCL Department of Security and Crime Science and was exempted from requiring approval by the central UCL Research Ethics Committee.

Consent for publication

Participants involved in this study provided written informed consent online by clicking all consent statement boxes affirming their consent before taking part in the study.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Appendix A:

List of considered markets. Appendix B: Categories included for keyword searches. Appendix C: Synonyms used for keyword search. Appendix D: Synonyms of authentic. Appendix E: Keywords used to exclude listings. Appendix F: Full lists of percentage counterfeits by OECD/EUIPO (2019) and by IP Crime Group (2015). Appendix G: Product price differences for 10 products in each category between Darknet markets and the surface web.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Soldner, F., Kleinberg, B. & Johnson, S.D. Counterfeits on dark markets: a measurement between Jan-2014 and Sep-2015. Crime Sci 12, 18 (2023). https://doi.org/10.1186/s40163-023-00195-2

Download citation

Received: 17 January 2023
Accepted: 27 August 2023
Published: 17 October 2023
DOI: https://doi.org/10.1186/s40163-023-00195-2

Counterfeits on dark markets: a measurement between Jan-2014 and Sep-2015

Abstract

Introduction

Fraud and counterfeits on dark markets

Aims of this paper

Data

Data filtering

Categorizing counterfeits

Automated labeling

Holding and placeholder prices

Results

Product offers and counterfeit prevalence

Counterfeit product types and occurrences

Counterfeit origins

Counterfeit prices, sales volume, and surface web prices

Observed counterfeit prices

Estimated counterfeit sales volumes

Dark and surface market prices

Discussion

Product categories

Product origins

Vendor sales volume and product values

Possible preventative measures

Future studies

Limitations

Conclusion

Availability of data and materials

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Appendix A:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Crime Science

Contact us