148 items found

Licenses: Academic Free License 3.0 Groups: sobigdata-it

Filter Results
  • Dataset

    EMAKG: Enhanced Microsoft Academic Knowledge Graph

    The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and...
    • The resource: 'Link to dataset.' is not accessible as guest user. You must login to access it!
    • The resource: 'Link to scientific ...' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Where do migrants and natives belong in a community: a Twitter case study and...

    Today, many users are actively using Twitter to express their opinions and to share information. Thanks to the availability of the data, researchers have studied behaviours...
    • The resource: 'Link to paper' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Cybersecurity NER dataset

    Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and...
  • Dataset

    Iperf K8s-based Power and Resource consumption dataset

    The data were collected in a Prometheus-like data format: each entry has a timestamp, a value and key-value labels containing additional information. Metrics were gathered...
    • CSV
      The resource: '5G_Power_and_Resource_consu ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Stroke and sepsi

    The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by...
    • The resource: 'Stroke and sepsi' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • ConferencePaper

    Characterising different communities of Twitter users: migrants and natives

    Today, many users are actively using Twitter to express their opinions and to share information. Thanks to the availability of the data, researchers have studied behaviours...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Origin and destination attachment: study of cultural integration on Twitter

    The cultural integration of immigrants conditions their overall socio-economic integration as well as natives’ attitudes towards globalisation in general and immigration in...
    • HTML
      The resource: 'Link to article.' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Measuring the Salad Bowl: Superdiversity on Twitter

    Superdiversity refers to large cultural diversity in a population due to immigration. In this paper, we introduce a superdiversity index based on the changes in the emotional...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • Dataset

    Human and mouse gene regulatory networks

    The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available...
    • The resource: 'Human and mouse gene ...' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Combining Twitter and Mobile Phone Data to Observe Border-Rush: The Turkish-E...

    Following Turkey's 2020 decision to revoke border controls, many individuals journeyed towards the Greek, Bulgarian, and Turkish borders. However, the lack of verifiable...
    • The resource: 'Link to article.' is not accessible as guest user. You must login to access it!
  • ConferencePaper

    Digital footprints of international migration on twitter

    Studying migration using traditional data has some limitations. To date, there have been several studies proposing innovative methodologies to measure migration stocks and...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • Method

    Online Learning of Order Flow and Market Impact (OLOFMI)

    This library performs regime detection in the aggregated order flow time-series and market impact analysis. The required input file is in the format of the message file of the...
  • Method

    Score-Driven Bayesian Online Change Point Detection (SD-BOCPD)

    This code deals with Bayesian online detection in univariate time-series of changepoints, i.e. abrupt variations in the generative parameters of a data, and regimes, i.e....
  • Access required...

    ×

    Dataset

    Private A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings...

    "A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings and Interaction Networks (2011-2021)" is a comprehensive dataset containing a 10 years long...
  • Experiment

    Online polarization: enriching models with data, understanding data through m...

    Development of online polarization dynamics models and application to social media discussion data
    • The resource: 'GitHub repository' is not accessible as guest user. You must login to access it!
    • The resource: 'GitHub Repository Change ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Brexit dataset

    This dataset comprises a set of online footprints extracted from Twitter using the available APIs. It is centered around the Brexit debate on Twitter from the 2nd until the...
    • RAR
      The resource: 'BrexitDataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental office room conditions

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in smart office rooms located in ICAR-CNR, for...
    • RAR
      The resource: 'IoT_dataset_smart_office' is not accessible as guest user. You must login to access it!