379 items found

Licenses: Academic Free License 3.0 Organisations: SoBigData Catalogue

Filter Results
  • Access required...

    ×

    Dataset

    Private Air Traffic Data International Mobility Indicators for the UK

    The Air Traffic Data International Mobility Indicators for the UK results from the investigation on air passenger data. Starting from air passenger traffic volumes from each...
  • Dataset

    Stroke and sepsi

    The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by...
    • The resource: 'Stroke and sepsi' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • Dataset

    Human and mouse gene regulatory networks

    The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available...
    • The resource: 'Human and mouse gene ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Application

    Private mamtorch library

    Python Multiply-And-Max/min (MAM) torch-compatible kernel library (https://github.com/SSIGPRO/mamtorch). With this library, it is possible to substitute standard neurons with...
  • Method

    Online Learning of Order Flow and Market Impact (OLOFMI)

    This library performs regime detection in the aggregated order flow time-series and market impact analysis. The required input file is in the format of the message file of the...
  • Method

    Score-Driven Bayesian Online Change Point Detection (SD-BOCPD)

    This code deals with Bayesian online detection in univariate time-series of changepoints, i.e. abrupt variations in the generative parameters of a data, and regimes, i.e....
  • Dataset

    Common Crawl Financial News Dataset

    This dataset contains financial articles related to companies in the S&P500 index for the period from September 2016 to February 2020. The articles were extracted from the...
    • CSV
      The resource: 'Common_Crawl_Financial_News' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings...

    "A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings and Interaction Networks (2011-2021)" is a comprehensive dataset containing a 10 years long...
  • Dataset

    Last.Fm UK User Graph Dataset: A Social Network and Music Listening Behavior ...

    The Last.Fm UK User Graph Dataset is a comprehensive collection of social network and music listening behavior data obtained from the Last.Fm platform. The dataset includes user...
    • Folder
      The resource: 'Link to the folder ...' is not accessible as guest user. You must login to access it!
  • Experiment

    Online polarization: enriching models with data, understanding data through m...

    Development of online polarization dynamics models and application to social media discussion data
    • The resource: 'GitHub repository' is not accessible as guest user. You must login to access it!
    • The resource: 'GitHub Repository Change ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Brexit dataset

    This dataset comprises a set of online footprints extracted from Twitter using the available APIs. It is centered around the Brexit debate on Twitter from the 2nd until the...
    • RAR
      The resource: 'BrexitDataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental office room conditions

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in smart office rooms located in ICAR-CNR, for...
    • RAR
      The resource: 'IoT_dataset_smart_office' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental outdoor home conditions

    The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in outdoor of a smart domestic room located in...
    • The resource: 'IoT_dataset_outdoor_smart_home' is not accessible as guest user. You must login to access it!
  • Dataset

    The subTHz regime, first Results on channel measurement: 500-750 GHz

    The measurements have been conducted using a Keysight PNA Vector Analyzer connected to a pair of VDI Extenders for the frequency bands 500-750 GHz (W-band). IF bandwidth has...
    • s2p
      The resource: '500-750 GHz 30 cm' is not accessible as guest user. You must login to access it!
    • s2p
      The resource: '500-750 GHz 60 cm' is not accessible as guest user. You must login to access it!
    • s2p
      The resource: '500-750 GHz 90 cm' is not accessible as guest user. You must login to access it!
  • Dataset

    RAN and NWDAF data from Cellular Network in Catania

    Dataset containing various RAN and UEs metrics collected from 4 BSs deployed at Piazza D'Uomo, Catania. Metrics can be used for machine learning-based studies for physical...
    • The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Solar power generation in Tuscany

    The dataset contains the expected power generation profile of a set of photovoltaic generators positioned in Tuscany with a nominal power of 4 MWp. The generation profile has...
    • XLSX
      The resource: 'Solar power Tuscany' is not accessible as guest user. You must login to access it!
  • Dataset

    UWB RADAR dataset of human activity detection in smart office

    The UWB RADAR dataset consists of time series data acquired from UWB RADAR deployed in a smart office room located in ICAR-CNR, for monitoring human activity detection. Raw...
    • RAR
      The resource: 'IoT_UWB_RADAR_dataset_for_s ...' is not accessible as guest user. You must login to access it!