252 items found

Groups: sobigdata-it

Filter Results
  • Dataset

    Know your trees dataset

    A set of images of urban trees in Tortona specifically focusing on images of trees, leaves, bark and habits along with general information, taxonomy, and selected biometric...
    • ZIP
      The resource: 'Dataset Know Your Trees ...' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • ConferencePaper

    Characterising different communities of Twitter users: migrants and natives

    Today, many users are actively using Twitter to express their opinions and to share information. Thanks to the availability of the data, researchers have studied behaviours...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Origin and destination attachment: study of cultural integration on Twitter

    The cultural integration of immigrants conditions their overall socio-economic integration as well as natives’ attitudes towards globalisation in general and immigration in...
    • HTML
      The resource: 'Link to article.' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private Multi-Start Optimization Neural Networks

    In this repository, we publish the codes necessary to implement the Multi-Start Optimization Neural Networks (MSO-NNs), presented fin the paper: Automatic...
  • Access required...

    ×

    Dataset

    Private Vegetation of a basin of the Po river Dataset

    We provide two climatological dataset composed by D = 136 (with 1038 samples) and D = 1991 (with 981 samples) continuous climatological features and a scalar target, which...
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Word-in-Context task for Italian

    The general goal of the WiC-ITA task is to establish whether a word w occurring in two different sentences, s_1 and s_2, has the same meaning or not. In particular, our task...
  • JournalArticle

    Measuring the Salad Bowl: Superdiversity on Twitter

    Superdiversity refers to large cultural diversity in a population due to immigration. In this paper, we introduce a superdiversity index based on the changes in the emotional...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • Dataset

    Human and mouse gene regulatory networks

    The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available...
    • The resource: 'Human and mouse gene ...' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Combining Twitter and Mobile Phone Data to Observe Border-Rush: The Turkish-E...

    Following Turkey's 2020 decision to revoke border controls, many individuals journeyed towards the Greek, Bulgarian, and Turkish borders. However, the lack of verifiable...
    • The resource: 'Link to article.' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private Alternate Training for Multi-Task Neural Networks

    In this repository, we publish the code used to implement the Alternate Training through the Epochs (ATE) procedure for training Multi-Task Neural Networks (MTNN) presented in...
  • Access required...

    ×

    Dataset

    Private Highway driving simulation

    The SUMO simulator is used to model scenarios with diferent road topologies and traffc intensities, randomizing the fow of vehicles, to ensure the generation of sufciently...
  • ConferencePaper

    Digital footprints of international migration on twitter

    Studying migration using traditional data has some limitations. To date, there have been several studies proposing innovative methodologies to measure migration stocks and...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private Dynamical Linear Upper Confidence Bound (DynLin-UCB)

    The repository contains the code to run DynLin-UCB (Dynamical Linear Upper Confidence Bound). DynLin-UCB is an optimistic regret-minimization algorithm that can be used to...
  • Method

    Online Learning of Order Flow and Market Impact (OLOFMI)

    This library performs regime detection in the aggregated order flow time-series and market impact analysis. The required input file is in the format of the message file of the...
  • Method

    Score-Driven Bayesian Online Change Point Detection (SD-BOCPD)

    This code deals with Bayesian online detection in univariate time-series of changepoints, i.e. abrupt variations in the generative parameters of a data, and regimes, i.e....
  • Method

    Debiaser for Multiple Variables (DEMV)

    DEMV is a Debiaser for Multiple Variables that aims to increase Fairness in any given dataset, both binary and categorical, with one or more sensitive variables, while keeping...
    • ipynb
      The resource: 'Tutorial Notebook' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings...

    "A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings and Interaction Networks (2011-2021)" is a comprehensive dataset containing a 10 years long...