Detecting Fake Content Publishers in BitTorrent: Performance & User Security Threats | Lecture notes Statistics

arXiv:1105.3671v3 [cs.CR] 19 Apr 2012

TorrentGuard: stopping scam and malware

distribution in the BitTorrent ecosystem

Michal Kryczka∗† , Ruben Cuevas†, Roberto Gonzalez†Angel Cuevas‡and Arturo Azcorra†

∗Institute IMDEA Networks

†University Carlos III Madrid

‡Telecom SudParis

Abstract—In this paper we conduct a large scale measurement

study in order to analyse the fake content publishing phenomenon

in the BitTorrent Ecosystem. Our results reveal that fake content

represents an important portion (35%) of those files shared in

BitTorrent and just a few tens of users are responsible for 90%

of this content. Furthermore, more than 99% of the analysed

fake files are linked to either malware or scam websites. This

creates a serious threat for the BitTorrent ecosystem. To address

this issue, we present a new tool named TorrentGuard for the

early detection of fake content. Based on our evaluation this tool

may prevent end users from downloading more than 35 millions

of fake files per year. This could help to reduce the number

of computer infections and scams suffered by BitTorrent users.

TorrentGuard is already available and it can be accessed through

both a webpage or a Vuze plugin.

I. INTRODUCT IO N

BitTorrent is one of the most popular applications in the

current Internet. It is daily utilised by millions of users and

is responsible for a major portion of the Internet traffic [26].

This success motivated the research community to investigate

different aspects of BitTorrent covering performance [20][25],

economics [10][13][30] and incentives [17][27] issues. How-

ever, to the best of the author knowledge, the research com-

munity has put less attention to BitTorrent security aspects.

Some previous works have analysed the vulnerabilities of

the BitTorrent protocol to free-riders [21][22][29] whereas

some others address the lack of privacy offered by BitTorrent

[8]. More recently, in a previous work [12] we demonstrated

that the BitTorrent ecosystem is suffering from a continuous

poisoning index attack resulting in 30% of published torrents

associated to fake content. Furthermore, this fake content

produces 25% of the download events, which means that

every fourth content download in BitTorrent is fake. These

initial results highlight a serious issue that, to the best of the

authors knowledge, has still not been covered by the research

community.

In this paper we thoroughly analyse the fake publishing

phenomenon in BitTorrent in order to understand its real

impact on the system performance as well as the potential

risks of fake content for BitTorrent users. Furthermore, we

propose a practical solution to mitigate this problem. We base

our study on data collected from torrents published in The

Pirate Bay portal during a period of 14 days from 30-04-2011

to 13-05-2011. The 35% of almost 30K analysed torrents are

associated to fake content. This depicts a 5% increment in the

presence of fake content within the BitTorrent ecosystem in a

period of one year between our two measurement studies. This

justifies (even more) the necessity of the research conducted

in this paper.

In order to fight the fake publishing phenomenon, the first

step is to properly characterise the fake publishers and their be-

haviour. The current BitTorrent portals solutions identify fake

publishers through the user account that they use to upload

fake torrents to the portal. We show in the paper that this

technique is inefficient since the fake publisher can generate

as many user accounts as needed in those portals. Instead, the

parameter that uniquely identifies the fake publisher is the IP

address it uses to perform its activity. Surprisingly, our data

reveals that just 20 fake publishers (whose IP we identify) are

responsible for injecting 90% of fake content in the BitTorrent

ecosystem. Moreover, most of these IP addresses belong to

Hosting Providers where the fake publishers rent dedicated

high-resource servers to perform their activity.

The fake publishing activity is time consuming since a fake

publisher needs to manually create the user accounts used in

the different portals (in some cases up to 4 accounts per day).

Furthermore, this activity requires dedicated resources (e.g.

rented servers). This investment in time and resources can be

only justified by a strong motivation behind the distribution of

fake content. We have downloaded and manually inspected a

large number of fake content published during our measure-

ment period and found 3 different profiles among the fake

publishers: (i)a first group of fake publishers aims to spread

malware using the popular BitTorrent system; (ii)a second

set of users tries to attract BitTorrent users to scam websites

in order to get economical benefit from the victims by using

different scam techniques; (iii)the last group is formed by

antipiracy agencies that upload fake versions of those content

that they want to protect.

Our data shows that more than 99% of the published fake

content is associated with the two first profiles. This supposes

a very serious threat for the BitTorrent ecosystem since the

activity of these publishers may lead to thousands of unde-

sirable episodes of scammed users and computer infections.

These findings suggest that new solutions need to be proposed

in order to eliminate or at least reduce the number of fake

content available in the BitTorrent ecosystem. Towards this

Detecting Fake Content Publishers in BitTorrent: Performance & User Security Threats, Lecture notes of Statistics

Related documents

Partial preview of the text

Download Detecting Fake Content Publishers in BitTorrent: Performance & User Security Threats and more Lecture notes Statistics in PDF only on Docsity!

arXiv:1105.3671v3 [cs.CR] 19 Apr 2012

TorrentGuard: stopping scam and malware

distribution in the BitTorrent ecosystem

Michal Kryczka∗†, Ruben Cuevas†, Roberto Gonzalez†^ Angel Cuevas‡^ and Arturo Azcorra†

VIII. RELATED WORK