36 items found

Licenses: Academic Free License 3.0 Groups: sobigdata-eu

Filter Results
  • Dataset

    Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...

    Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...
    • CSV
      The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Romanian sample)

    The Semantic Networks from news articles (Romanian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Romanian_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Method

    Reducing radicalizism in social networks by feeds prioritization - Rebalancin...

    Code and description of the methodology of the paper "Rebalancing Social Feed to Minimize Polarization and Disagreement" funded by SoBigData ++
  • Dataset

    Lexical networks from Polish news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Polish news articles extracted from the dataset described...
    • jsonl
      The resource: 'polish_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Finnish news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Finnish news articles extracted from the dataset...
    • jsonl
      The resource: 'finnish_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Lithuanian news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Lithuanian news articles extracted from the dataset...
    • jsonl
      The resource: 'lithuanian_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Danish sample)

    The Semantic Networks from news articles (Danish sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Danish_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Italian sample)

    The Semantic Networks from news articles (Italian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (German sample)

    The Semantic Networks from news articles (German sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'German_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Dutch sample)

    The Semantic Networks from news articles (Dutch sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Dutch_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (English sample)

    The Semantic Networks from news articles (English sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
  • Dataset

    ClueWeb12

    The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information...
  • Dataset

    Facebook Wallpost

    Online interactions between users via the wall feature in the New Orleans regional network.
    • HTML
      The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • Dataset

    ClueWeb09

    The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on...
  • Dataset

    Twitter social bots

    Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,...
  • Dataset

    Twitter fake followers

    Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the...